Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp

Quant AI ResearchNYU

“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”

Python SQL C C++Java HTML+120

View profile

Manasa Mangipudi

Screened

Mid-level Machine Learning Engineer specializing in NLP and computer vision

3y exp

Columbia UniversityRutgers University–New Brunswick

“AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.”

Python Java R SQL C++MATLAB+106

View profile

Jhansi Bendi

Screened

Senior Software Engineer specializing in cloud-native microservices and event-driven systems

Antioch, CA18y exp

SephoraRashtriya Sanskrit Sansthan

“Senior engineer/tech lead with 18+ years building large-scale distributed applications, specializing in performance and reliability improvements. Recently owned multiple apps on an email personalization team, shipping major optimizations (including a push-update feature and audience-count architecture redesign) that reportedly lifted system performance from ~50% to ~99% while also leading code standards, reviews, and mentoring.”

AngularJS Apache Kafka API Gateway Azure DevOps Backend Development ChatGPT+197

View profile

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp

Wells FargoUniversity of South Florida

“Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.”

Python Distributed systems State management Docker Containerization Debugging+85

View profile

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp

Wells FargoUniversity of North Texas

“Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.”

Python SQL Java Scala Shell Scripting TypeScript+153

View profile

Nishantkumar Asodariya

Screened

Mid-level Supply Chain Analyst specializing in global logistics automation and forecasting

USA4y exp

HoneywellIndiana Wesleyan University

“Built and shipped a production LLM-powered recruiting workflow that ranks resumes against job descriptions, generates evidence-based justifications, and finds "hidden fit" candidates using embeddings + RAG. Demonstrates strong production engineering around hallucination control, latency, and predictable LLM cost management (budget checks, top-K pruning, tenant caps), plus orchestration experience with Airflow/Prefect/Kubernetes and a structured evaluation/monitoring methodology for AI agents.”

Automation Communication Contract Negotiation Cross-Functional Collaboration Data Analysis Forecasting+101

View profile

Sri Harshitha Yannam

Screened

Junior Software Engineer specializing in AI/ML and cloud platforms

Austin, TX2y exp

AmazonUniversity of Wisconsin–Milwaukee

“LLM/agent engineer who shipped a production "Memory Assistant" at HydroX AI, building a LangChain/LlamaIndex RAG memory pipeline on ChromaDB/FAISS with robust fallbacks (BERT/BART), prompt-injection mitigation, and 99.9% uptime monitoring. Also built a multi-step customer support agent using Rasa + OpenAI Assistants API with structured tool calling, guardrails, and human-in-the-loop escalation, and has experience hardening agents against messy ERP data via Pydantic validation, idempotency, and transactional outbox patterns.”

Python Java TypeScript JavaScript HTML CSS+177

View profile

Ishaan Nanal

Screened

Intern-level Software Engineer specializing in backend systems and AI/ML

Ithaca, NY1y exp

QuorAgraCornell University

“Built and shipped an LLM-powered RAG research copilot used by 20+ users across biology, physics, and ML, cutting literature review from days to minutes. Strong focus on production reliability—iterated on chunking/retrieval/prompting, added validation and modular pipelines for debuggability, and is now containerizing and scaling the system with Docker and GCP.”

Python SQL JavaScript Java C C+++75

View profile

Harrishkumar Loganathan

Screened

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp

SocureArizona State University

“AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.”

Python R PySpark Node.js JavaScript TypeScript+135

View profile

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp

Elevance HealthTexas Tech University

“ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.”

Large Language Models Generative AI PyTorch TensorFlow FAISS Transformers+113

View profile

Rajeev Sai Nitturu

Screened

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp

JPMorgan ChaseCalifornia State University, Long Beach

“Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.”

Java Python TypeScript JavaScript SQL Data Structures & Algorithms+113

View profile

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and applied research

New York, NY2y exp

NYU’s Center for Social Media, AI, and PoliticsNYU

“LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.”

Python JavaScript TypeScript SQL R Java+162

View profile

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp

University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

“Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.”

Java Scala Python JavaScript TypeScript SQL+108

View profile

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp

Johnson & JohnsonStevens Institute of Technology

“Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.”

A/B Testing API Integration Apache Airflow Apache Hadoop Apache Kafka Apache Spark+102

View profile

SUSENDRANATH MUSANI

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp

PfizerUniversity of New Haven

“Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.”

A/B Testing Agile Apache Kafka Apache Spark AWS Lambda BERT+103

View profile

Pruthvik Elemati

Screened

Mid-Level Software Engineer specializing in distributed systems and cloud-native backends

Dallas, USA5y exp

T-MobilePurdue University

“AI/LLM engineer with production experience at Charles Schwab building a RAG-based assistant to help 5,000+ reps answer complex financial policy questions. Implemented a multi-layer anti-hallucination approach (GNN-driven ontology/graph retrieval + citation-only answers) and compliance-focused guardrails (Azure AI Content Safety) in partnership with audit/compliance stakeholders.”

Python Java Go TypeScript Apache Kafka Prometheus+140

View profile

Shouhardik Saha

Screened

Junior Software Engineer specializing in ML, distributed systems, and LLM applications

Austin, TX1y exp

ZondaUC San Diego

“Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.”

Python Java C C++C#SQL+100

View profile

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp

BlackRockWebster University

“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”

Python SQL Shell Scripting Apache Spark PySpark Apache Hadoop+103

View profile

Sravani Kasaraneni

Screened

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp

ServiceNowRivier University

“Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.”

SDLC Agile Waterfall Python R Java+104

View profile

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp

FurboUSC

“New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).”

Python C C++MySQL Go Apache Spark+61

View profile

John Greenough

Screened

Junior Software Engineer specializing in AI, security, and cloud systems

Trondheim, Norway1y exp

Norwegian University of Science and TechnologyUniversity of Waterloo

“Built and deployed an LLM + RAG + memory system on a Furhat social robot, adding continuous face/voice recognition embeddings over WebSockets to enable persistent, natural conversations across sessions. Experienced working around real-world hardware/latency constraints and uses Datadog plus structured debugging/rollback practices for stabilizing customer-facing LLM workflows.”

Python JavaScript TypeScript Java Kotlin SQL+73

View profile

Sathwik Varikoti

Screened

Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI

Remote5y exp

InfosysUniversity at Buffalo

“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”

A/B Testing Amazon Bedrock Amazon EC2 Amazon S3 AWS Glue AWS IAM+99

View profile

Shanmukh Sai Madhu

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp

JPMorgan ChaseUniversity of South Dakota

“Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.”

Agile Apache Airflow Apache Kafka Apache Spark AWS AWS Lambda+122

View profile

Ramtin Khorrami

Screened

Principal Software Engineer specializing in AI/ML and cloud-native backend systems

New York, NY16y exp

McKinsey & CompanyNJIT

“McKinsey data/ML practitioner who led production deployment of an entity resolution + semantic search platform for unstructured finance and healthcare data, integrating with legacy systems under HIPAA constraints. Deep hands-on stack across transformers (spaCy/HF BERT), embeddings + FAISS, and production MLOps/workflow tooling (Airflow, Docker, CI/CD, Prometheus/Grafana), with reported gains of +30% decision speed and +25% search relevance.”

Python SQL R Ruby Java JavaScript+124

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?