SHALIN BHATT
Technical Lead & AI Systems Architect
shalin.dev@proton.me | 415-490-7852 | San Francisco Bay Area
shalinbhatt.dev | github.com/shalin-dev | linkedin.com/in/shalinkb
Professional Summary
Engineering leader with 12 years building and leading teams in regulated enterprise environments — healthcare platforms serving 50K+ daily users, HIPAA-compliant, 99.95% uptime.
Built production RAG pipelines with multi-model routing (OpenAI, Anthropic, Groq) and hallucination mitigation through source-grounded retrieval. Led AI-assisted coding pipelines and owned technical RFCs that set the quality bar across a 12-person team. Outside of work, build autonomous multi-agent systems, LLM evaluation frameworks, and AI observability infrastructure.
Managed teams of 8-15 engineers with 90% retention. Hired 10+ engineers. Served as interim Engineering Manager during leadership transitions, reporting directly to VP.
Seeking Head of AI Engineering, Engineering Manager, or Staff-level roles where autonomous systems or AI infrastructure is central to the product.
Key Achievements
15 Engineers Led & Managed (Core team + project teams) | 10 Engineers Hired (Recruited, interviewed, and made hiring decisions) | 90% Team Retention (Industry avg: 60-70%) | 5+ Engineers Promoted to Senior Roles (from mentored intern/junior) |
50K+ Daily Users Served | 40-60% Faster Development via AI Tooling |
Core Skills
Autonomous Agent Systems: Multi-Agent Architecture · Agent Orchestration · Autonomous Decision Systems · Concurrent Execution & State Management · Self-Healing & Failure Recovery · Tool Use & Function Calling
Production LLM Engineering: Retrieval-Augmented Generation (RAG) · LLM Evaluation Frameworks · LLM-as-Judge Patterns · Multi-Model Routing & Fallback · Prompt Engineering · Context Window Management
AI Reliability & Observability: Agent Workflow Tracing · Production AI Monitoring · Hallucination Mitigation & Guardrails · Custom Evaluation Pipelines · Latency Optimization · Model Benchmarking
AI Infrastructure: Vector Databases & Semantic Search · Memory-Augmented Reasoning · Knowledge Persistence · Containerized AI Deployment · API Design for LLM Systems
Leadership: Team Management (8-15 Direct Reports) · Performance Reviews · Technical Hiring (10+) · Roadmap Planning · Cross-Functional Leadership · Mentorship & Career Development
Regulated & Enterprise Systems: HIPAA Compliance · Audit Trails · Zero-Downtime Deployment · Security-First Architecture · Enterprise Scale (50K+ Users)
Technical Stack: Python · LangGraph · LangChain · OpenAI · Anthropic · Groq · Ollama · ChromaDB · FastAPI · Docker · Kubernetes · PostgreSQL · Redis
Professional Experience
Technical Lead | STERIS CORPRichmond, CA | Dec 2021 – Present
AI-Assisted Engineering & RAG
- Built production RAG pipelines for internal knowledge retrieval and document search — multi-model routing (OpenAI, Anthropic, Groq) with hallucination mitigation through source-grounded retrieval and automated response verification
- Led AI-assisted coding pipelines — automated code review, testing, and documentation generation integrated into daily development workflows
- Reduced AI development cycle time by 40-60% through AI-first workflows and agentic coding tools adopted across a 12-person team
Engineering Leadership
- Managed 8-15 direct reports with 90% retention vs 60-70% industry average — conducted performance reviews, ran 1:1s, and mentored 5+ engineers into senior roles
- Served as interim Engineering Manager during two leadership transitions — reported directly to VP on roadmap, risks, and strategic decisions
- Owned technical RFCs and architecture decisions across multiple product initiatives — set engineering quality standards, drove technical clarity, and balanced technical debt reduction with new feature delivery
- Led technical hiring: designed interview process, evaluated candidates, and made 10+ hiring decisions
Platform Engineering
- Led platform engineering serving 50K+ daily users across 500+ enterprise installations with 99.95% uptime in HIPAA-regulated environment
- Designed CI/CD pipelines with Docker and Kubernetes — 70% faster deployments across staging and production with zero-downtime releases
- Delivered 15+ major feature releases including internationalization for 4 markets — managed cross-functional initiatives with Product, Design, Finance, and Operations
Tech: Python · LangGraph · OpenAI · Anthropic · Groq · Docker · Kubernetes · PostgreSQL · Redis
Senior Software Engineer | STERIS CORPRichmond, CA | Oct 2017 – Nov 2021
- Owned backend platform architecture supporting multiple product lines — full lifecycle from technical design through production deployment
- Led Scrum process across multiple teams — sprint planning, retrospectives, and technical roadmap ownership driving 15+ feature releases
- Hired and mentored engineers — recruited talent, ran technical interviews, and grew junior developers into senior roles
- Built CI/CD pipelines and testing frameworks for containerized Linux environments — improved deployment reliability and reduced manual overhead
Tech: Ruby on Rails · PostgreSQL · JavaScript · Docker · Kubernetes · Jenkins · Linux
Selected AI Projects
Local AI Application Suite (github.com/shalin-dev/local-ai-suite)Privacy-focused 4-app AI platform running 100% locally: Personal AI Assistant (multi-model chat & image generation), Code Documentation Generator (auto-docs for any codebase), Local RAG System (semantic search with citations), and AI Image Classifier (auto-tagging & face recognition). Fully containerized with Docker. Demonstrates production-ready AI architecture for enterprise privacy requirements.
Tech: React · TypeScript · FastAPI · Ollama · ChromaDB · Docker
LLM Eval Harness (github.com/shalin-dev/llm-eval-harness)LLM evaluation framework with LLM-as-judge pattern, consistency testing, and RAG evaluation. Built 6 automated evaluators (accuracy, consistency, latency, cost, LLM-as-judge, RAG quality) with multi-provider runners for model selection at scale.
Tech: Python · Sentence Transformers · PyTorch · Click · Ollama · OpenAI API
AI Observe (github.com/shalin-dev/ai-observe)AI observability layer for production LLM applications. Decorator-based agent workflow tracing, web dashboard with timeline visualization, and framework integrations for Ollama, CrewAI, and LangChain to make AI decisions transparent and debuggable.
Tech: Python · FastAPI · SQLAlchemy · Jinja2 · SQLite · PostgreSQL
Claude Chat ExportChrome extension for exporting Claude AI conversations to markdown, JSON, or text format. Built to preserve important AI interactions for documentation and knowledge management workflows.
Tech: JavaScript · Chrome Extensions API · DOM Manipulation · Markdown
DevTab ManagerDeveloper-focused tab management Chrome extension for organizing browser sessions, saving workspace states, and quickly restoring development environments. Reduces context-switching overhead for multi-project workflows.
Tech: JavaScript · Chrome Storage API · Session Management · IndexedDB
Open Source Contributions
OpenCode — Contributed bug fixes for terminal UI stability and session management. (Go, TUI)
Multica — Contributed Linux desktop integration fixes including native file handling and window management. (TypeScript, Electron, Linux)
Zen Browser — Contributed fixes for tab management and window state handling. (TypeScript, CSS, Browser)
Education & Certifications
Project Management & Project Leadership — University of California, Berkeley (Extension) | PMI Accredited (Completed)
Bachelor of Engineering in Information Technology — Ahmedabad Institute of Technology (2009-2013)
Certified Scrum Master (CSM) — Scrum Alliance (2018, Renewed 2023)
Becoming a Manager Professional Certificate — LinkedIn Learning (2025)
Claude Code 4: Agentic Coding for Professional Developers — Anthropic (2025)
Microsoft AI Workshop — Microsoft (2025)
Building with AI: Building a Copilot with Azure AI Foundry — Microsoft (2024)
Docker Foundations Professional Certificate — Docker, Inc (2026)
Agent Skills with Anthropic — DeepLearning.AI (May 2026)