AI/LLM Technical Lead Β· Forward Deployed Engineer Β· Production AI Systems at Scale
π Manchester, UK Β· π’ Microsoft Industry Solutions Delivery
I build and ship production-grade AI systems for enterprise customers across financial services, sports, construction, and energy industries. My work sits at the intersection of customer problem solving and frontier AI deployment β from discovery and architecture through to production and evaluation.
At Microsoft's Industry Solutions Delivery team, I embed directly with enterprise customers to co-design and build AI applications.
- Multi-agent systems β parallel agent architectures, tool routing, Semantic Kernel, Microsoft Agent Framework
- RAG pipelines β hybrid search, semantic reranking, query planning, document intelligence
- MCP (Model Context Protocol) β building and integrating MCP tools and servers for real-time agent data retrieval
- Evaluation frameworks β LLM-as-judge, golden datasets, CI/CD integrated quality monitoring, Azure AI Evaluation SDK
- Production LLM deployment β prompt engineering, grounding, PII detection, content safety, session management
- LLMOps β observability, autoscaling, Redis caching, performance tuning at scale
| Area | Tools |
|---|---|
| Daily driver | Visual Studio Code , Claude models β agent mode, MCP, skills |
| LLMs | GPT models, Claude models |
| Orchestration | Semantic Kernel, Microsoft Agent Framework, LangChain |
| Search & Retrieval | Azure AI Search (hybrid + semantic), Vector DBs, Redis |
| Evaluation | Azure AI Evaluation SDK, LLM-as-judge, Prompt Flow |
Languages:
Python Β· C#/.NET Β· TypeScript Β· JavaScript Β· SQL
AI & Cloud:
Azure OpenAI Β· Azure AI Search Β· Document Intelligence Β· AI Content Safety
Container Apps Β· Azure Functions Β· APIM Β· App Gateway (WAF) Β· Cosmos DB Β· Redis
DevOps & Tools:
Azure DevOps Β· GitHub Actions Β· Docker Β· Application Insights
Sports Platform β Multi-Agent LLM System
Agentic RAG chat system handling thousands of RPS across catalogue and conversational queries. Parallel agent architecture with MCP tools for real-time player and club data retrieval. Custom evaluation framework baked into CI/CD pipeline.
Construction β AI-Powered RFP Bidding System
Intelligent document processing with geospatial RAG, winning proposal analysis, and conversational interface. Background evaluation pipeline via queue-based processing for continuous quality monitoring.
Financial Services β Enterprise RAG Chatbot
End-to-end RAG implementation with Azure AI Search and Semantic Kernel. Defined evaluation golden datasets with stakeholders and built LLM-as-judge scoring for safe enterprise deployment.
Energy β AI Timesheet Validation Workflow
AI-driven automation for procurement timesheet validation at one of the world's largest oil corporations.
- I've taken customers from "is this even possible?" to production β across PoCs, greenfield builds, and legacy integrations
- I build evaluation frameworks first β not as an afterthought
- I push back on customers when needed and define what "good" actually looks like
- I work at both ends: whiteboard and terminal
- MCP server design patterns and tool boundary architecture in agentic systems
- Evaluation strategies for text-to-SQL and open-ended LLM responses
- AI safety and interpretability in enterprise deployment contexts
- πΌ LinkedIn
- βοΈ vijaycse@live.in
β‘ Fun fact: The hardest part of enterprise AI isn't the model β it's getting the right people in the room to define what good looks like.


