Career Journey

4+ Years of
Building at Scale

A timeline of systems built, problems solved, and scale achieved — from first commit to production incident veteran.

4+
Years Experience
200+
Datasets Scaled
4x
Cost Reduction
3
Cloud Certs
Timeline

My Journey

CURRENT Dec 2023 – Present

Senior Data Engineer

Nasdaq · Bengaluru, India
PySpark Databricks Delta Lake AWS Airflow
  • Architected and scaled a distributed data platform supporting 200+ datasets
  • Built ETL pipelines using PySpark, Databricks, and Delta Live Tables
  • Migrated legacy pipelines to Delta Lake architecture
  • Managed and monitored pipelines using Airflow
  • Implemented CI/CD pipelines using Jenkins and GitHub Actions
  • Improved data reliability using validation frameworks and monitoring systems
  • Worked on large-scale batch and real-time data processing systems
Oct 2021 – Nov 2023

Data Consultant

Knowledge Foundry Business Solutions · Bengaluru, India
PySpark AWS Airflow
  • Designed and optimized ETL pipelines achieving 4x cost reduction
  • Built scalable data pipelines using PySpark and AWS
  • Developed distributed data processing systems
  • Designed cloud-based workflows using S3, EC2, and Airflow
  • Built ML pipelines for preprocessing, feature engineering, and model tuning
May 2021 – Oct 2021

Data Engineer Intern

Zep Analytics · India
Python MySQL
  • Built ETL pipelines using Python
  • Developed API-based data ingestion systems
  • Created web scraping solutions for large-scale data collection
  • Optimized MySQL queries for better performance
  • Worked on end-to-end data workflows
2017 – 2021

B.Tech in Computer Science and Engineering

Feroze Gandhi Institute of Engineering and Technology
Contributions

Research & Publications

📄
Voice-Based Classification using ML/DL
Neuroquantology · December 2022

Published a research paper focusing on classification of patients leveraging various Machine Learning and Deep Learning architectures.

📦
Autowave: AutoML Python Library
Open Source · PyPI

Authored and published an open-source AutoML library designed specifically for automating audio analysis, training, and classification pipelines.

Certifications

AWS, Azure & Data Science

☁️
AWS Solutions Architect
Amazon Web Services · Associate
🔷
Azure Administrator
Microsoft Azure · Associate
🔴
Databricks Lakehouse
Databricks · Fundamentals
hub
Neo4j Professional
Graph Database Certification
psychology
Machine Learning Master Course
Coding Blocks · Deep Learning & NLP
model_training
Generate Synthetic Images DCGANs
Coursera · Deep Learning