Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Hamad Alajeel

Screened

Intern Machine Learning & AI Automation Engineer specializing in ML workflows and AI hardware

Fort Lauderdale, FL0y exp

Revscale Technologies Inc.UC San Diego

“ML practitioner with hands-on experience adapting diffusion models (DDPM + U-Net in PyTorch) to improve low-dose CT medical imaging quality via denoising and upsampling against high-dose ground truth. Also built a RAG workflow during a recent internship by cleaning client survey data, embedding with OpenAI text-embedding-3-large, and indexing in Pinecone with MD5 deduplication, alongside a strong emphasis on production-grade Python practices.”

Azure DevOps C C++Data Pipelines Deep Learning Error Handling+116

View profile

Chirag Bellara

Screened

Mid-level Full-Stack Software Engineer specializing in cloud and AI-enabled applications

San Francisco, CA4y exp

One CommunityPurdue University

“Product-focused full-stack engineer (70/30 app vs infra) with Accenture experience and recent AI workflow work, shipping end-to-end systems from React/TypeScript UIs through FastAPI backends to Postgres. Built an AI-driven data extraction platform with async job APIs, strict schema validation, and strong observability, and has operated AWS ECS-based deployments with real incident mitigation (DB connection exhaustion/latency under traffic spikes).”

Python Java TypeScript JavaScript React Node.js+148

View profile

Rifat Mahfuz

Screened

Junior Backend Software Engineer specializing in microservices and API platforms

New York City, NY1y exp

ShareTripUniversity of Illinois Urbana-Champaign

“Backend engineer with strong performance and security instincts: built a Flask API for readability metrics with clean, testable modular design; optimized SQLAlchemy/Postgres to eliminate N+1 issues (800ms to 120ms). Also implemented an LLM-powered natural-language travel search using Claude Sonnet + Amadeus with RAG and anti-exploitation safeguards, plus multi-tenant isolation via Postgres RLS and Redis caching that cut search latency from ~20s to ~4–5s while reducing storage costs.”

TypeScript SQL Python Go C++Java+87

View profile

Kranthi Kumar Karupati

Screened

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp

AccentureEastern Illinois University

“LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).”

Amazon API Gateway Amazon Bedrock Amazon CloudWatch Amazon DynamoDB Amazon EKS Amazon ECS+168

View profile

Ramesh Jasti

Screened

Mid-level AI/ML & MLOps Engineer specializing in cloud AI infrastructure and GenAI

San Jose, USA5y exp

HPEWestern Illinois University

“At HPE, led and deployed an enterprise-grade LLM document intelligence platform for an insurance client, automating extraction from highly variable PDFs/scans/emails and raising field accuracy from 74% to 93%. Built a LangChain/Pinecone/OpenSearch RAG framework to cut hallucinations by 37% and operationalized LangSmith evals in CI, driving a 41% triage accuracy lift and >33% fewer incorrect resolutions while partnering closely with claims operations via HITL workflows.”

Python Bash PowerShell Go TensorFlow PyTorch+144

View profile

Varsha Hemakumar

Screened

Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems

USA3y exp

FintaUniversity at Buffalo

“Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.”

Python R SQL MongoDB Pandas NumPy+95

View profile

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp

S&P GlobalUniversity of Central Missouri

“Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.”

Python PySpark SQL Scala Batch Processing Data Transformation+119

View profile

Tejaswi Rao

Screened

Mid-level Machine Learning Engineer specializing in MLOps and GenAI analytics

Jersey City, New Jersey7y exp

MediacomStevens Institute of Technology

“ML/LLM practitioner who has deployed a production RAG-based trouble-call identifier using multiple datasets (device, network, past complaints). Experienced in end-to-end MLOps (FastAPI + Docker + Kubernetes with HPA) and in evaluating/monitoring LLM behavior to reduce hallucinations, with additional applied work in forecasting/anomaly detection and churn prediction for retention campaigns.”

Apache Airflow BigQuery C++CI/CD Classification Data Science+54

View profile

Sushma Sri B

Screened

Mid-level Full-Stack Engineer specializing in cloud-native microservices (FinTech/Healthcare)

Charlotte, NC5y exp

ADPUniversity of North Carolina at Charlotte

“Built and shipped production systems spanning real-time operational dashboards and an LLM-powered internal documentation assistant using RAG (embeddings + vector DB). Demonstrates strong focus on reliability and iteration: implemented guardrails and evaluation loops (human review, hallucination tracking, regression prevention) and improved performance/scalability through query optimization, caching, and retrieval tuning.”

Java Python JavaScript TypeScript Spring Boot Hibernate+99

View profile

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp

Data Security Council of IndiaUSC

“Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.”

A/B Testing Artificial Intelligence Backend Development C++Cloud Computing Data Structures and Algorithms+76

View profile

SASIREKHA GULIPALLI

Screened

Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning

Alpharetta, GA4y exp

MotrexGeorgia State University

“Strategic sourcing professional specializing in seasonal apparel supply chains, combining Coupa/JD Edwards analytics with Excel/Python modeling and Power BI dashboards to drive cost reduction and OTIF gains. Notable for rapid mitigation of a 10-day factory delay affecting 12 holiday SKUs (preserved 95% of revenue) and for automating PO workflows to cut cycle time by 4.2 days and improve OTIF by 15%.”

A/B Testing Amazon EC2 Amazon S3 Bash BigQuery Classification+113

View profile

Rushir Bhavsar

Screened

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp

Cadence Design SystemsArizona State University

“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”

AI Agents Angular Apache Spark AWS AWS CloudFormation AWS Lambda+121

View profile

Joshua Hewitt

Screened

Senior Software Engineer specializing in Generative AI product development

San Francisco, USA9y exp

PadletUniversity of Sydney

“AI product builder at Padlet who shipped multiple production LLM features for education workflows, including an AI document generator (AI Recipes) and a RAG-enabled in-product chat assistant. Built an AI microservice layer (LangChain) to swap model providers easily and created automated + human-in-the-loop evaluation systems (including ~100-test runs) to iterate on prompts and quality.”

Python Ruby TypeScript JavaScript PHP FastAPI+66

View profile

Jax Diagana

Screened

Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation

San Francisco, CA7y exp

AnaplanUniversity of St. Thomas

“FDE at Bland.ai and founder of Fi (incident-response agent) who routinely takes LLM/agentic concepts from prototype to production. Has hands-on experience reverse-engineering undocumented systems to deliver integrations, building LLM testbeds for voice-agent reliability, and rapidly shipping RAG/semantic search solutions (e.g., Confluence runbooks) after deep customer discovery with DevOps/SRE teams.”

A/B Testing AI Agents Automation Confluence Data Science dbt+66

View profile

Aniket Janrao

Screened

Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs

Houma, LA2y exp

Objective Medical Systems LLCUniversity at Buffalo

“Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.”

BERT C++Data preprocessing Data visualization Deep learning Docker+125

View profile

Manasa Reddy Nagendla

Screened

Mid-level Full-Stack Java Engineer specializing in microservices, cloud, and event-driven systems

Cincinnati, OH6y exp

Procter & GambleUniversity of Cincinnati

“Software engineer at Procter & Gamble focused on warehouse/operations systems, building near-real-time order/inventory visibility using Java/Spring Boot, React, Kafka, PostgreSQL, and Redis with measurable latency and load-time gains. Also shipped internal LLM/RAG knowledge assistants grounded in company runbooks and workflows, implementing guardrails and an evaluation loop that drove concrete retrieval improvements (document chunking) and regression prevention.”

Java Python Go Node.js C#SQL+161

View profile

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp

Receptro.AIUniversity of Texas at Dallas

“Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.”

Python SQL PyTorch TensorFlow scikit-learn Hugging Face+83

View profile

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp

Capital OneOklahoma Christian University

“ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.”

Python SQL NumPy Pandas Bash PySpark+97

View profile

Saikrishna Vallala

Screened

Mid-level QA Automation Engineer / SDET specializing in Financial Services and Healthcare

USA5y exp

Morgan StanleyDePaul University

“Fintech-focused engineer who built an end-to-end KYC verification pipeline for advisor onboarding using Flask microservices, Celery/Redis, and AWS (Lambda/ECS/EC2) with CloudWatch-driven scaling and latency optimizations. Also shipped a production internal knowledge assistant using RAG + embeddings/vector search with guardrails (similarity-based fallback, prompt-injection protections) and an evaluation loop with compliance specialist review that drove measurable retrieval improvements.”

Playwright Cypress Cucumber TDD TestNG PyTest+110

View profile

Jake Lee

Screened

Junior Full-Stack/Systems Engineer specializing in AI, embedded systems, and healthcare apps

Boston, MA3y exp

SolstisBoston University

“Led architecture for “Solstice/Solstis,” a safety-aware, hands-free AI medical assistant that guides users through minor emergencies with a structured, state-machine-driven LLM agent integrated with device hardware. Built RAG grounded in Red Cross procedures plus guardrails, fallbacks, and emergency escalation, and improved real-world usability by shifting from open-ended chat to a deterministic step-by-step workflow measured via completion rate, repeat prompts, and latency.”

C C++C#Java JavaScript Python+89

View profile

Ankita A Khartmol

Screened

Junior Backend Software Engineer specializing in conversational AI and cloud APIs

Bangalore, India1y exp

HarmanUSC

“Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.”

Python Java JavaScript SQL REST APIs WebSockets+83

View profile

Chandra Shekar Akkandra

Screened

Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services

Newark, CA5y exp

JPMorgan ChaseUniversity of Missouri-Kansas City

“Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.”

A/B Testing Amazon DynamoDB Amazon EC2 Amazon ECS Amazon EKS Amazon Kinesis+136

View profile

Abdul Mohammed

Screened

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp

Cardinal HealthIndiana Tech

“Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.”

SQL Python R HTML JSON Microsoft Excel+105

View profile

Nidhish Rao Bairineni

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp

Wells FargoSouthern Methodist University

“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”

Artificial Intelligence Machine Learning Generative AI Large Language Models OpenAI Claude+173

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?