Designed and optimized Vector and Hybrid Search pipelines for large-scale document retrieval. Built scalable Knowledge Base supporting 7+ document types. Reduced API latency by 60% (5s to 2s).
Created extensible ML model serving platform using TF-Serving, reducing latency by 99% (400ms to 5ms). Built VLM-powered product filter generation for 10k+ products using GPT-4o.
Fine-tuned Llama3-8B-Instruct for banking chatbot scenarios achieving 90%+ accuracy. Built Text-to-SQL AI Copilot for Salesforce DB, reducing query creation time by 95%.
Built Agent Scoring mechanism using distilBERT, driving 100% product revenue growth. Secured 4000+ ML deployments with IAM-based access control. Optimized Kafka pipeline reducing false alerts by 70%.
Engineer at AI71
Senior Engineer, AI71
Team Lead at Sprinklr
Engineering Manager, Sprinklr
Associate Director at Zepto
ML Platform Lead, Zepto
Engineer
Software Engineer
Insights on designing and optimizing RAG pipelines for large-scale document retrieval and knowledge orchestration.
Lessons learned from deploying 4000+ ML models in production. Infrastructure security, scalability, and reliability.
Building ReAct-based agentic orchestrators and multi-agent workflows for dynamic use-case coverage.