No cost, no commitment - we'll make a personal intro
Somil Shah
Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents
INTERACT Animal LabNortheastern UniversitySan Francisco, CA4 Years ExperienceMid LevelWorks On-Site
Connect with Somil
Somil already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.
Typically responds within 24 hours
Recommended
Already have an account?
About
AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).
Hire with Reval
Find your next great hire
Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.
Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems
San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University
“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”
Mid-level AI/ML Engineer specializing in production RAG systems and MLOps
Athens, GA4y exp
University of GeorgiaUniversity of Georgia
“Built and deployed a GPT-4 + Pinecone RAG system that lets users query large internal document collections with grounded, cited answers. Demonstrates strong applied LLM engineering (chunking experiments, hallucination controls, metadata recency boosting) plus production-minded evaluation/monitoring and performance tuning (rate-limit mitigation via pooling/batching). Also effective at translating complex AI concepts to non-technical stakeholders through prototypes and live demos, helping secure client sponsorship.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
Remote, US3y exp
TickerIndiana University Bloomington
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”