Career Journey
4+ Years of
Building at Scale
A timeline of systems built, problems solved, and scale achieved — from first commit to production incident veteran.
4+
Years Experience
200+
Datasets Scaled
4x
Cost Reduction
3
Cloud Certs
Timeline
My Journey
CURRENT
Dec 2023 – Present
Senior Data Engineer
Nasdaq · Bengaluru, India
PySpark
Databricks
Delta Lake
AWS
Airflow
- → Architected and scaled a distributed data platform supporting 200+ datasets
- → Built ETL pipelines using PySpark, Databricks, and Delta Live Tables
- → Migrated legacy pipelines to Delta Lake architecture
- → Managed and monitored pipelines using Airflow
- → Implemented CI/CD pipelines using Jenkins and GitHub Actions
- → Improved data reliability using validation frameworks and monitoring systems
- → Worked on large-scale batch and real-time data processing systems
Oct 2021 – Nov 2023
Data Consultant
Knowledge Foundry Business Solutions · Bengaluru, India
PySpark
AWS
Airflow
- → Designed and optimized ETL pipelines achieving 4x cost reduction
- → Built scalable data pipelines using PySpark and AWS
- → Developed distributed data processing systems
- → Designed cloud-based workflows using S3, EC2, and Airflow
- → Built ML pipelines for preprocessing, feature engineering, and model tuning
May 2021 – Oct 2021
Data Engineer Intern
Zep Analytics · India
Python
MySQL
- → Built ETL pipelines using Python
- → Developed API-based data ingestion systems
- → Created web scraping solutions for large-scale data collection
- → Optimized MySQL queries for better performance
- → Worked on end-to-end data workflows
2017 – 2021
B.Tech in Computer Science and Engineering
Feroze Gandhi Institute of Engineering and Technology
Contributions
Research & Publications
📄
Voice-Based Classification using ML/DL
Neuroquantology · December 2022
Published a research paper focusing on classification of patients leveraging various Machine Learning and Deep Learning architectures.
📦
Autowave: AutoML Python Library
Open Source · PyPI
Authored and published an open-source AutoML library designed specifically for automating audio analysis, training, and classification pipelines.
Certifications
AWS, Azure & Data Science
☁️
AWS Solutions Architect
Amazon Web Services · Associate
🔷
Azure Administrator
Microsoft Azure · Associate
🔴
Databricks Lakehouse
Databricks · Fundamentals
hub
Neo4j Professional
Graph Database Certification
psychology
Machine Learning Master Course
Coding Blocks · Deep Learning & NLP
model_training
Generate Synthetic Images DCGANs
Coursera · Deep Learning