Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Hugging Face Transformers Professionals

Pre-screened and vetted.

Hugging Face Transformers Python Docker SQL PyTorch CI/CD

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp

OpendoorUSC

“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”

Generative AI Multi-Agent Systems Prompt Engineering Vector Databases LangChain LangGraph+396

View profile

Ming-Kai Liu

Screened

Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision

Raleigh, NC2y exp

Citrus OncologyUC San Diego

“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”

Linux C C++Python Java SQL+81

View profile

Jeevan aher

Screened

Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech

Remote, USA3y exp

JPMorgan ChaseUniversity of Illinois Urbana-Champaign

“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”

Python R SQL JavaScript Bash C+107

View profile

Harsh Chaudhari

Screened

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp

SplunkUniversity of Colorado Boulder

“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”

C++Data Pipelines Data Preprocessing Docker Embeddings FAISS+70

View profile

Praveen Nutulapati

Screened

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp

JPMorgan ChaseUniversity of Central Missouri

“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”

A/B Testing Agile Amazon Bedrock Amazon EC2 Amazon EMR Amazon RDS+184

View profile

Prakhar Srivastava

Screened

Senior Software Engineer specializing in backend infrastructure, cloud automation, and reliability

Mountain View, CA8y exp

OracleStony Brook University

“End-to-end deployment owner for Oracle document delivery/print services in a hospital-like production environment, focused on reliability/performance at scale (thousands of systems). Also describes implementing event-driven RAG/agentic LLM workflows with attention to embeddings/index consistency, latency, and measurable improvements in response relevance and operational efficiency.”

Python C C++Java C#Bash+155

View profile

Sirisha Maddikunta

Screened

Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions

O Fallon, MO6y exp

MastercardUniversity of Texas at Arlington

“Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.”

Generative AI Natural Language Processing Computer Vision Prompt Engineering Retrieval-Augmented Generation LoRA+234

View profile

Manvir Singh

Screened

Senior Full-Stack & Mobile Software Engineer specializing in cloud-based applications

Englewood, NJ10y exp

Cobalt BrandsUniversity of Washington

“Data/ML backend engineer with hands-on production experience spanning RAG services (LlamaIndex/OpenAI) and AWS data platforms. Has delivered Terraform-managed AWS architectures (Lambda + ECS Fargate) with secure secrets handling, built Glue-to-Redshift ETL with schema evolution controls, modernized SAS reporting into Python microservices, and achieved major Redshift query speedups (2+ hours to under 15 minutes).”

React React Native TypeScript Next.js Redux Tailwind CSS+117

View profile

vamshi saggurthi

Screened

Mid-Level Software Engineer specializing in LLM agents and real-time data streaming

8y exp

AmazonRutgers University–New Brunswick

“Software engineer with experience at Striim and Amazon who ships end-to-end production systems across UI, backend, ML, and operations. Built a real-time PII detection capability for a streaming data platform by integrating Python ML inference into a Java monolith via gRPC sidecars, achieving ~3M events/hour throughput and ~93% accuracy, and helped drive enterprise adoption (Fiserv, CVS). Also modernized internal Amazon tooling for multi-region scale with modularization and fully automated deployments.”

Python Java R JavaScript Apache Airflow Apache Kafka+110

View profile

Raghav Konduri

Screened

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp

Scale AIRowan University

“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”

A/B Testing Agile Amazon CloudWatch Amazon EC2 Amazon EKS Amazon Kinesis+151

View profile

Binita Chourasia

Screened

Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI

4y exp

Cardinal HealthRivier University

“Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.”

Python SQL Generative AI Large Language Models Prompt Engineering Retrieval-Augmented Generation+142

View profile

Akhil Chippalthurthy

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics

New Jersey, USA5y exp

JPMorgan ChaseStevens Institute of Technology

“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”

Python R SQL Jupyter Notebook LightGBM XGBoost+172

View profile

Tianai Shi

Screened

Intern Full-Stack Software Engineer specializing in test analytics platforms

La Jolla, CA2y exp

NutanixUC San Diego

“Software engineer intern at Nutanix who independently shipped and maintained an internal smoke-test/failure-analysis dashboard, integrating failure data from multiple upstream systems (e.g., Jira, Jenkins, CircleCI) via REST APIs. Also has prior data-science experience building Postgres-based asset management analytics with automated reporting and indexing for faster time-series retrieval.”

API Design Asynchronous Processing Backend Development BERT CI/CD C+94

View profile

Shreya Andela

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise data platforms

5y exp

JPMorgan ChaseUniversity of North Texas

“Built and shipped a production LLM-powered RAG assistant for enterprise internal document search (PDFs, knowledge bases, structured data), addressing real-world issues like noisy documents, hallucinations, and latency with grounded prompting, retrieval-confidence fallbacks, and performance optimizations. Also partnered with compliance and business teams at JPMc to deliver a solution aligned with regulatory constraints, supported by monitoring, feedback loops, and systematic evaluation.”

Python R SQL FastAPI ETL Pipelines Unit Testing+156

View profile

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp

DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

Python SQL PySpark Bash TensorFlow PyTorch+106

View profile

Keerthana Tammina

Screened

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp

DoorDashSaint Louis University

“ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.”

Agile Amazon Redshift Amazon S3 Amazon SageMaker Anomaly Detection Apache Hadoop+157

View profile

Prathik Goud Makthala

Screened

Mid-Level Backend Software Engineer specializing in FinTech and scalable APIs

California, USA5y exp

AffirmRochester Institute of Technology

“Backend/microservices engineer with fintech loan-lifecycle experience operating low-latency (sub-250ms) services in production using Kafka, idempotent transaction design, and Datadog observability. Also built an end-to-end LLM chatbot (React + Flask) with a decoupled model integration layer (FLAN-T5 via Hugging Face) and has experience designing partner-facing REST APIs with OAuth2/JWT and Swagger documentation.”

Python FastAPI Flask Java Spring Boot JavaScript+110

View profile

Akhilesh Padala

Screened

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

USA4y exp

DatabricksGannon University

“ML/AI engineer with strong end-to-end production ownership across predictive ML and Generative AI use cases. They built a churn prediction platform that cut churn 12% and preserved about $1.2M in annual revenue, and also shipped a RAG-based support assistant that reduced ticket resolution time 30% while improving agent satisfaction and onboarding speed.”

Python Java R SQL PySpark Apache Spark+130

View profile

Moh Abdullah

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and production ML systems

New York, USA9y exp

Luma AI

“ML/AI engineer with hands-on ownership of both classical ML and GenAI systems in production. They built an end-to-end churn prediction service on AWS and also shipped RAG-based document search/summarization features, with clear experience in monitoring, hallucination reduction, cost/latency optimization, and creating shared Python/LLM infrastructure used across teams.”

Python SQL Scala Java C C+++335

View profile