Skip to content

Jayant Khandebharad

I build production-grade GenAI systems end-to-end — agentic platforms, RAG pipelines, voice AI, and LLMOps on Azure.

3+ yrs
building production GenAI
10K+
documents indexed (GraphRAG)
+35%
retrieval precision
400+
calls/mo handled by voice AI
−22%
cloud cost
99.9%
uptime

Things I've built

What I work with

Languages

  • Python
  • TypeScript
  • SQL
  • C++

GenAI & Agentic

  • RAG & GraphRAG
  • Multi-agent orchestration
  • LangChain
  • LangGraph
  • LiteLLM
  • Prompt engineering
  • Evals & guardrails
  • Embeddings & vector search

LLM Providers

  • Azure OpenAI (GPT-4o)
  • Anthropic Claude
  • Google Gemini
  • Groq
  • Hugging Face

Voice & Document AI

  • Azure Communication Services
  • Azure Speech (STT/TTS, SSML)
  • Azure AI Search
  • OCR

Backend & Data

  • FastAPI
  • Azure Functions
  • Async workers & queues
  • Service Bus
  • PostgreSQL + pgvector
  • Cosmos DB
  • MySQL
  • Redis
  • REST APIs

Cloud, DevOps & LLMOps

  • Azure Container Apps
  • Container App Jobs
  • Docker
  • Azure DevOps CI/CD
  • Bicep
  • Application Insights / KQL
  • Cost & observability

Foundations

  • PyTorch
  • Transformers from scratch
  • Data Structures & Algorithms
  • System design

Where I've worked

  1. Software Engineer II, Generative AI · CloudLex

    Jan 2023Present
    • Built a GraphRAG pipeline (LiteLLM + Cosmos DB on Azure) indexing 10K+ legal documents — improving retrieval precision ~35% and cutting latency ~40%.
    • Automated model-deployment pipelines via Azure DevOps CI/CD, enabling zero-downtime rollouts across multi-tenant environments.
    • Developed multiple LLM chatbots (helpdesk, multi-tenant intake, PIP/MVA) with FastAPI and LangChain, automating client intake and cutting manual workload ~50%.
    • Created a natural-language voice/IVR system with Azure Speech and PSTN integration that handles 400+ monthly calls autonomously, reducing response time ~60%.
    • Designed multi-agent LLM systems for summarization, document review, and task generation using state-machine orchestration and Redis caching.
    • Optimized token usage, API throughput, and inference speed — cutting cloud costs ~22% — and maintained 99.9% uptime across the GenAI products.
  2. Software Developer Intern · Eaton

    Jun 2022Jul 2022
    • Built and optimized Angular UI modules for customer management and standardized REST API integration, improving render performance.

Let's build something

I'm open to AI / GenAI engineering roles where I can own systems end-to-end. The fastest way to reach me is email.