Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Manpreet Kour

Screened

Senior Data Scientist specializing in Generative AI and NLP

Seattle, USA6y exp

SOTIDr. B. R. Ambedkar National Institute of Technology, Jalandhar

“ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.”

Machine Learning Deep Learning Generative AI Computer Vision PyTorch PySpark+92

View profile

Sharanya Rao

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare

Remote, USA3y exp

Ally FinancialUniversity of Maryland, Baltimore County

“Built an AI lending assistant (RAG + DeBERTa) used by credit analysts to retrieve policies and past loan decisions, tackling real production issues like hallucinations, document quality, and sub-second latency. Deployed a modular, Dockerized AWS architecture (ECS/EMR + load balancer) with load testing, caching/precomputed embeddings, and CloudWatch monitoring, and used Airflow to automate scheduled data/embedding/vector DB refresh pipelines with retries and alerts.”

Python PySpark SQL Pandas NumPy Scikit-learn+133

View profile

Yogita Adari

Screened

Mid-level AI Engineer specializing in generative and multimodal systems

San Francisco, CA4y exp

Handshake AISyracuse University

“Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.”

Python Java C MySQL PostgreSQL MongoDB+161

View profile

Aaron Lao

Screened

Intern Software Engineer specializing in agentic RAG and full-stack web development

San Francisco, CA1y exp

ConnectionLoopsUniversity of San Francisco

“Entry-level software engineer who built an agentic AI backend in Python/FastAPI, including APIs for conversation history retrieval and user data storage, and worked through async/concurrency challenges for multiple agents querying simultaneously. Also has practical AWS experience using S3 for static hosting with Lambda and RDS for backend/data access.”

Python Java SQL PostgreSQL JavaScript Node.js+67

View profile

Surya Pavan

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications

Baltimore, MD5y exp

AcerCalifornia State University, Northridge

“GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.”

Agentic AI Amazon EC2 Amazon S3 AWS IAM AWS Lambda Azure Blob Storage+153

View profile

Shree Gopalakrishnan

Screened

Entry-Level AI/ML Engineer specializing in LLM apps, RAG pipelines, and production ML systems

1y exp

iFrog Marketing SolutionsUC San Diego

“AI/LLM practitioner at iFrog Marketing Solutions who drove a RAG chatbot from prototype to production in a legacy, AI-resistant environment by validating customer needs and building a business case. Implemented production-grade LLM practices (CI/CD eval gating, rollbacks, prompt/context engineering) and led internal workshops to bring non-AI-native developers up to speed while partnering with sales on tailored demos to drive adoption.”

Machine Learning LangChain Retrieval-Augmented Generation (RAG)OpenAI API Hugging Face PyTorch+87

View profile

Yogendra Nalam

Screened

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp

Ally FinancialUniversity of Michigan-Dearborn

“GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.”

Agile Anomaly Detection API Development AWS Azure DevOps Azure Machine Learning+107

View profile

Ashutosh Gupta

Screened

Senior Backend Engineer specializing in AI/LLM and Healthcare Claims

8y exp

UnitedHealth GroupIndiana University Bloomington

“JavaScript/React performance-focused engineer who contributed upstream to an open-source virtualization/pagination library, fixing overlapping-fetch race conditions and introducing prefetch/deduping patterns that cut load times from ~3s to <900ms and reduced render thrash ~35%. Also built healthcare automation systems (clinical summary and claims triage), including a FastAPI + RAG pipeline that retrieved CPT/ICD evidence, improving decision accuracy from 67% to 86% and reducing turnaround time by 40%.”

Python Java JavaScript TypeScript SQL Bash+130

View profile

Anudeep Nayak

Screened

Junior Software Engineer specializing in cloud APIs, security testing, and AI web apps

Boulder, Colorado3y exp

ActualizeUniversity of Colorado Boulder

“Software engineer with experience delivering customer-facing and internal tools across GE Renewables, GE Healthcare (supply chain/production systems), and a Boulder-based event app startup. Recently focused on scaling backend performance using Redis and RabbitMQ, and has hands-on experience resolving hard-to-reproduce production issues in legacy authentication/session systems; also deployed a personal project (Journal Buddy) publicly.”

Python Java JavaScript TypeScript SQL MySQL+56

View profile

Harsh Tripathi

Screened

Mid-level Machine Learning Engineer specializing in LLMs, agentic AI, and risk/fraud modeling

San Francisco, CA3y exp

The Research Foundation for SUNYUniversity at Buffalo

“Built and productionized an agentic LLM workflow during a summer internship to transform unstructured clinical reports into analytics-ready structured data, using a LangChain multi-agent design plus an LLM-as-a-judge layer to control quality in a regulated setting. Also has experience orchestrating ML pipelines at Piramal Capital using AWS Step Functions/EventBridge/CloudWatch, with strong emphasis on observability, evaluation rigor, and measurable impact (80–90% reduction in manual data entry).”

Python C++SQL Java Large Language Models (LLMs)Multi-Agent Systems+97

View profile

Sathwik Pattem

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines

New York, NY4y exp

DeloitteSaint Louis University

“Engineer with Deloitte experience building real-time analytics products and scalable Kafka/Go/Postgres pipelines, plus production LLM features using RAG and embeddings. Demonstrates strong focus on performance, reliability, and guardrails/evaluation loops to reduce hallucinations and improve real-world AI system quality.”

Go Python TypeScript JavaScript Java SQL+76

View profile

Samuel Park

Screened

Mid-level Software Engineer specializing in backend systems, DevOps/SRE, and AI workflows

Los Angeles, CA5y exp

xAIUC Santa Barbara

“Built an end-to-end automated trading system for Polymarket, including Go/Python execution services, Terraform-scheduled ETL/feature pipelines, and monitoring on modest hardware. Also shipped a production LLM+RAG signal verifier/explainer that grounds trade decisions in external context (news/social) with vector DB retrieval and guardrails, plus a lightweight RAGAS-style eval loop on ~50 resolved markets that improved signal faithfulness by ~15%.”

Go Python JavaScript TypeScript Node.js React+73

View profile

Harshitha Mittapalli

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native and GenAI solutions

Remote, USA5y exp

Capital OneUniversity of North Carolina at Charlotte

“Built and shipped production RAG-based LLM agents automating multi-step document query workflows, emphasizing reliability via monitoring, retries, structured exception handling, and fallback retrieval (alternative embeddings/keyword search). Demonstrated measurable gains (18% latency improvement, 25% retrieval efficiency, 12% precision) and has experience integrating agents with messy tax and transaction data at RSM using validation/cleaning and idempotent design.”

Large Language Models (LLMs)LangChain Retrieval-Augmented Generation (RAG)Prompt Engineering Generative AI Google Gemini+90

View profile

Yukta Chikate

Screened

Mid-level Machine Learning Engineer specializing in safety-critical and uncertainty-aware ML systems

Brooklyn, NY5y exp

MTech DistributorsNortheastern University

“Built and productionized an LLM-powered assistant for company documents and support questions, focused on reducing time spent searching PDFs/policies/tickets while preventing hallucinations by grounding answers in approved sources. Demonstrates strong production engineering (Kubernetes/orchestration, caching, monitoring, fallbacks) plus security-minded permissioning and close collaboration with operations/support stakeholders.”

Machine Learning Predictive Modeling Multi-Agent Systems Root-Cause Analysis Statistical Analysis Anomaly Detection+102

View profile

Rohan Varma Bandari

Screened

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

USA4y exp

Wells FargoUniversity of North Texas

“Built production LLM + hybrid RAG and multi-agent orchestration systems at Wells Fargo to automate complaint document/audio transcript understanding and categorization, addressing vocabulary drift via embedding + vector index updates instead of frequent retraining. Strong in LLM workflow reliability (testing/benchmarks/observability) and stakeholder-facing delivery with explainability (citations/SHAP-style justifications) and Tableau dashboards.”

Python SQL Jupyter Notebook Amazon SageMaker Visual Studio Code NumPy+128

View profile

Ramcharan Reddy

Screened

Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices

Texas, USA6y exp

Morgan StanleyUniversity of Central Missouri

“Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.”

Java Python JavaScript SQL Bootstrap React+125

View profile

Harideep Balusa

Screened

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp

Freddie MacUniversity of Wisconsin

“Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.”

Python R SQL Scala Machine Learning Deep Learning+125

View profile

Nagaveda Sai Kumar Reddy Gajjala

Screened

Mid-Level Full-Stack Software Engineer specializing in distributed systems and cloud integrations

Richardson, TX4y exp

PaycomUniversity of Texas at Dallas

“Backend engineer with enterprise SaaS experience (Zoho) who owned an end-to-end cloud integration between Endpoint Central and ServiceDesk Plus, redesigning device onboarding across 64+ scenarios and building a fault-tolerant sync engine that recovered 100% failed transactions. Also built and operated production systems across the stack—FastAPI services with strong testing/observability, React+TypeScript portals, PostgreSQL performance tuning, and AWS deployments with real incident response (RDS CPU saturation resolved with zero downtime).”

.NET Agile Algorithms Azure App Service Azure Blob Storage Bash+112

View profile

Ojasmitha Pedirappagari

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms

Jersey City, NJ5y exp

Nurture HoldingsUC Santa Cruz

“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”

Python SQL C#TypeScript JavaScript AWS+83

View profile

Zubair Shaik

Screened

Mid-level Full-Stack Developer specializing in AI-driven FinTech platforms

Remote, USA4y exp

Bank of AmericaIndiana Wesleyan University

“Built and productionized an LLM-powered loan decisioning agent at Bank of America, integrating RAG with microservices to automate creditworthiness assessment and recommendations. Emphasizes real-world reliability and governance (EKS autoscaling, observability, SOC2/PCI security controls), and drove measurable outcomes including 20% faster loan decisions and a reduction in agent failures/fallbacks to under 2% through schema enforcement and confidence-based routing.”

Python Java Spring Boot Flask FastAPI TypeScript+134

View profile

Tara Worrell

Screened

Senior Software Engineer specializing in backend systems, AI/LLM integration, and cloud infrastructure

Brooklyn, NY6y exp

AuraCodeBinghamton University

“Backend engineer with experience in highly regulated and high-stakes systems, including an airline crew messaging platform requiring near-zero-error real-time operations and a HIPAA-compliant mental health application built from an early-stage concept. They also show strong operational maturity, having owned a GoDaddy production incident through resolution and then led deployment pipeline improvements that reduced build failures by 40% and doubled deployment frequency.”

TypeScript Python Node.js AWS Full-Stack Development LLM Integration+124

View profile

Sai Vivek Reddy Gankidi

Screened

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp

Summit Design and TechnologyNorthwest Missouri State University

“Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.”

Python PyTorch TensorFlow Keras Hugging Face Transformers+82

View profile

BHEEMA SABILLA

Screened

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp

DiscoverUniversity of South Dakota

“Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.”

Python PySpark SQL Scala Pandas NumPy+157

View profile

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp

BNY MellonFranklin University

“Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.”

CI/CD Cross-Functional Collaboration Data Analytics Data Science Docker Documentation+56

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?