Python and Javascript development frameworks | Data Science
● Owner - File Ingestion and Profile Management
● Write modules in Scala and Python that translate large volumes of JSON and XML data
into familiar schemas for other teams to analyse
● Ideate data flow pipelines and construct them on Apache Airflow for orchestration and
management ● Work on their deployment on Apache Spark using hadoop friendly clusters on Amazon
EMR
● Worked in the NextGenOps team primarily with machine log parsing
● Created Kafka connectors using Scala to ingest logs sent on the particular topic and
process them as per the design requirements
● Implemented the Streaming Parsing of System Event Logs using LCS for better analysis
of new logs
● Develop ML models over a range of problems spanning hospital readmission, molecular chemistry, retail and banking
● Compete with other researchers for top performing model submissions
● Part of the Thoreau project under Prof. Guha
● Implemented a CSV to dashboard pipeline using Django and Postgres
● Image layering on Maps using the Mapbox API
Working on finding well performing alphas and assesing their performance using scientific programming in Python and inbuilt expressions.
PCM, Computer Science
A simple implementation to fetch metrics of various departments from RSP via Internet and display them on the mobile screen. Awarded letter of recommendation.
Ranked 24/10341 (top 1%) on the Titanic problem. The initial kernel has 20+ forks and over 2000 views. Other kernels in 30%+, working on improving their performance.
Active contributor in the OPNFV Community. Enhancing testing in the college maintained AutolabCLI
Working with the Nirmaan team in helping kids in Zari learn Maths and English and compete for the JNV examination.
Helping first time git users with various issues regrading Git and Github. Earlier mentored Machine Learning Foundations on Coursera, helping others with simple ML models for sentiment analysis, deep learnig on images, etc using Graphlab in Python.