Available for full-time roles

AI / ML Engineer

Production AI —
not just notebooks.

3 live AWS deployments: investment research, document retrieval, and neural audience intelligence — all automated, auditable, and measured by the work they replace.

3 Live AWS deployments
PR #20 Submitted to Meta FAIR TRIBE v2 (CLA signed)
<30s Deterministic investment decision
Raj Kumar Nelluri
Available for Full-Time Roles AI/ML Engineer · Remote or Hybrid

Real systems.
Measured results.

Every number below comes from a production deployment that replaced a manual process — not a Kaggle notebook.

96.7%
Fraud detection accuracy across 50k+ insurance claims
91%
F1 score on churn prediction with SHAP explainability
87%
Retrieval accuracy on enterprise RAG chatbot
4.2%
RMSE on 8-week retail sales forecast — 18% better than baselines

Tools I work
with daily.

AI / ML
PyTorch TensorFlow Scikit-learn XGBoost YOLOv8 LangChain
LLM / Data
GPT-4o LangGraph ChromaDB FAISS Pandas NumPy Prophet
Infrastructure
AWS EC2 SageMaker Lambda Kinesis S3 Docker FastAPI Redis Nginx
MLOps
MLflow GitHub Actions Streamlit Weights & Biases

Built. Shipped. Measured.

Every project below replaced a manual process that was costing someone real time, money, or accuracy.

All on GitHub

Full-stack AI —
from manual process
to production system.

Designed and deployed complete AI systems — not just notebooks. A model that never leaves a researcher's laptop isn't a product — it's a cost. Every system I've built started with a process someone was doing manually. My job was to make that unnecessary. Engineered across the full stack: model architecture and training, API development, containerisation, and AWS cloud deployment.

Owned three commercially critical areas end-to-end: LLM agents and RAG pipelines (GPT-4o, LangGraph, deterministic decision systems, hallucination mitigation), computer vision systems (YOLO, ResNet, real-time inference), and predictive ML (forecasting, churn, fraud). I care less about model accuracy on holdout sets and more about what happens when the system goes live. Every system is measured against accuracy, latency, and business outcome — not validation loss.

3 Production deployments
3 AI domains
AWS Cloud infrastructure
End-to-end Full-stack ML
MS CS Pace University · NYC
AWS CCP Certified Cloud Practitioner
B.Tech AI Amrita Vishwa Vidyapeetham
01
LLM Engineering
Production-grade agents and RAG pipelines that replace manual research and document retrieval — with deterministic decision logic and hallucination guardrails built in.
02
Computer Vision
YOLO and ResNet pipelines that replace manual inspection and review at scale — real-time inference via FastAPI, streaming ingestion via Kinesis, results queryable via SQL.
03
ML Infrastructure
End-to-end MLOps on AWS — S3 ingestion, Lambda ETL, SageMaker training, real-time endpoints, CloudWatch drift detection, and CI/CD retraining. Systems that stay accurate without manual intervention.
04
Forecasting & Prediction
LSTM + Prophet ensembles and XGBoost classifiers that replace manual planning and gut-feel decisions — SHAP explainability makes every prediction auditable for non-technical stakeholders.

Every layer — so nothing blocks the deployment.

ML / DL
PyTorch & TensorFlow Scikit-learn XGBoost / LightGBM YOLOv8 / ResNet LSTM / Transformers Prophet SHAP / LIME
LLM & Data
GPT-4o / LLM APIs LangChain LangGraph ChromaDB / FAISS RAG Pipelines Pandas / NumPy SQL / PostgreSQL OpenCV
Cloud & Infra
AWS EC2 / S3 SageMaker Lambda / Kinesis RDS / CloudWatch Docker & Compose FastAPI Redis Nginx GitHub Actions
MLOps & Tools
MLflow Weights & Biases Streamlit Jupyter Python (Expert) Git / GitHub Linux / Bash
Actively looking · Available now

Got a manual workflow
that should be automated?

I'm looking for a full-time AI/ML Engineering role where there's a real problem — a bottleneck, a slow decision, a process someone is doing by hand. If that's your team, reach out directly. I reply within 24 hours.