Building production-ready AI systems — from Generative AI and RAG pipelines to scalable ML workflows and data engineering on AWS. Experienced in LangChain, vector databases, prompt engineering, and end-to-end ML deployment. Passionate about turning raw data and documents into intelligent, reliable AI products.
// about me
I'm an AWS Certified Cloud Practitioner and MS Computer Science graduate from Pace University, New York, with a B.Tech in Artificial Intelligence from Amrita Vishwa Vidyapeetham. My focus is on designing and deploying end-to-end machine learning pipelines and cloud-native data systems on AWS that go beyond notebooks into real production infrastructure.
On the ML engineering side, I've trained and deployed models using XGBoost, TensorFlow, and Scikit-learn for fraud detection, churn prediction, and demand forecasting. I work across the full ML lifecycle — feature engineering, hyperparameter tuning, cross-validation, SageMaker managed training jobs, and real-time inference endpoints — with monitoring and automated retraining via CloudWatch and EventBridge.
On the data engineering side, I build batch and streaming ETL pipelines that process 500,000+ records, design data lake architectures on Amazon S3, and manage relational data warehouses on AWS RDS. I understand the full data journey — schema validation, data quality enforcement, transformation logic, and storage — the infrastructure that makes ML systems actually reliable.
I'm actively seeking entry-level roles in Data Engineering, ML Engineering, or Cloud Engineering at US tech companies and startups.
Download Resume ↓// tech stack
// featured projects
Production-style ML systems built on AWS — real-time pipelines, model deployment, and automated monitoring.
Python · Scikit-learn · AWS S3 · Lambda · Kinesis · RDS · SageMaker
Cloud-native, real-time fraud detection pipeline on AWS — processing insurance claims through a 4-stage distributed architecture from ingestion to inference.
Python · LangChain · OpenAI · ChromaDB · Streamlit
Production-grade RAG pipeline answering questions from custom documents with mandatory source citations — MMR retrieval, metadata-grounded responses, and strict hallucination prevention via prompt engineering.
Python · XGBoost · AWS S3 · Lambda · SageMaker · Comprehend · CloudWatch · EventBridge
End-to-end MLOps pipeline predicting SaaS customer churn with NLP feature extraction from support tickets and automated model retraining on data drift.
Python · XGBoost · Pandas · NumPy · AWS S3 · SageMaker
Scalable batch ETL pipeline processing 500,000+ retail transactions for time-series demand forecasting, deployed as a SageMaker real-time endpoint.
Python · XGBoost · TensorFlow · LSTM · CNN · Flask
Multi-model ML system benchmarking LSTM, CNN, and XGBoost for 7-day Bitcoin price forecasting, deployed as a Flask REST API with live web dashboard.
Python · JAX · TensorFlow · OpenCV · COLMAP
Implemented Deformable NeRF for photorealistic 3D face reconstruction from monocular video, with multi-stage preprocessing for camera pose estimation.
// system design
End-to-end production ML pipeline — from raw data to automated model monitoring.
Real-time claim scoring with distributed stream processing and SQL-based analyst reporting.
Automated ML lifecycle: NLP feature extraction → training → real-time endpoint → drift-triggered retraining.
500K+ transaction batch pipeline with temporal feature engineering and live prediction serving.
// what i build
// contact
🟢 Open to Machine Learning Engineer, Data Engineer, and Cloud Engineering opportunities at US tech companies and startups.