Vetted Data Preprocessing Professionals

Pre-screened and vetted.

DR

Entry-Level Software Engineer specializing in full-stack development and machine learning

College Station, TX0y exp
NatWestTexas A&M University

Master’s CS candidate with backend internship experience modernizing live operational workflows at NatWest/NetWess, focusing on reliability improvements, safer CI/CD deployments, and incremental refactors using feature flags and rollback paths. Built FastAPI-based APIs with strong security patterns (JWT + 2FA/TOTP, centralized authorization, RLS) and demonstrated attention to edge cases like idempotency and data consistency in a Netflix-clone project.

View profile
SR

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp
VerizonUniversity of Massachusetts Dartmouth

Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile
PV

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.

View profile
BB

Mid-level Data Analyst specializing in healthcare and finance analytics

New Jersey, USA5y exp
Omada HealthRowan University

Built an end-to-end Alexa smart-home IoT application controlling a Wi-Fi bulb, including ESP32 firmware (MQTT) and an AWS serverless backend (IoT Core/Device Shadow, Lambda, DynamoDB) with a REST API. Demonstrates strong real-time scalability patterns (streaming ingestion, stateless processing, partition-key design) and full-stack delivery with Spring Boot + React (JWT auth, CORS, data-heavy dashboards).

View profile
KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
JG

Junjie Gao

Screened

Intern Full-Stack/Frontend Engineer specializing in data pipelines and analytics dashboards

San Francisco, CA2y exp
Association for Computing MachineryUC San Diego

Backend engineer with experience at Roche and Jarsy focused on API and data-layer performance. Re-architected slow generalized endpoints into more efficient APIs (30% faster lookups) and led a schema refactor/migration with feature-flag rollout, dual writes, rollback scripts, and automated integrity checks; also addressed pipeline duplicate-entry issues via deduplication.

View profile
SR

Saketh Reddy

Screened

Mid-Level Software Development Engineer specializing in full-stack and LLM/AI systems

CA, USA4y exp
JPMorgan ChaseUniversity of Central Missouri

AI engineer with hands-on production experience building an end-to-end RAG system that reduced document-answering time from hours to minutes, improving accuracy through chunk overlap and hybrid BM25+semantic retrieval. Also built a LangGraph-based agent that researches company financial news via web search (Google Serper), using Pydantic structured outputs and checkpointing for reliability; experienced collaborating with non-technical stakeholders at JPMC and communicating ROI.

View profile
Sai Chatrathi - Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps in NY, USA

Sai Chatrathi

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

NY, USA4y exp
HumanaSyracuse University

Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.

View profile
Naveena Musku - Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

Naveena Musku

Screened

Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

5y exp
Western UnionJawaharlal Nehru Technological University

Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.

View profile
NT

Mid-level Software Engineer specializing in full-stack cloud-native systems

New York, NY7y exp
Dune SecurityNYU

Backend/platform engineer from Dune Security with strong experience turning messy, fragmented workflows into reusable production systems. They’ve built a shared database abstraction layer, integrated multiple enterprise security platforms into a unified workflow, and shipped AWS Bedrock-powered security insight features with guardrails and human review.

View profile
KP

Krisha Patel

Screened

Entry-Level Software Engineer specializing in AI/ML and Full-Stack Development

United States0y exp
TargetUniversity at Albany

Backend engineer who built an NL-to-SQL system at Target, using a multi-step LLM pipeline with vector-store schema retrieval and SQL validation to safely answer business questions. Strong in production FastAPI systems (async, Pydantic, Docker/Uvicorn, load balancing) and security (OAuth2/JWT, scopes, and database row-level security), with experience migrating Flask apps to FastAPI + PostgreSQL using strangler/feature-flagged canary rollouts.

View profile
SG

Sai Garipally

Screened

Mid-level AI/ML Engineer specializing in GenAI, LLMs, and computer vision

USA5y exp
UiPathSacred Heart University

Built and productionized a multi-agent, LLM-powered document understanding system to replace manual review of long documents, using LangGraph orchestration plus RAG to reduce hallucinations. Implemented layered reliability controls (structured templates, checker agent, and human-in-the-loop feedback) and reported ~40% speed improvement after orchestration; also has hands-on Airflow experience for scheduled data pipelines.

View profile
AE

Ashwitha E

Screened

Junior Data Scientist specializing in fraud analytics and cloud data platforms

Dallas, TX3y exp
Bank of AmericaUniversity of North Texas

Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.

View profile
AK

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp
Data Security Council of IndiaUSC

Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.

View profile
Rushir Bhavsar - Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp
Cadence Design SystemsArizona State University

Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.

View profile
Aniket Janrao - Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs in Houma, LA

Aniket Janrao

Screened

Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs

Houma, LA2y exp
Objective Medical Systems LLCUniversity at Buffalo

Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.

View profile
Arya Mane - Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing in Dallas, Texas

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp
Receptro.AIUniversity of Texas at Dallas

Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.

View profile
Ankita A Khartmol - Junior Backend Software Engineer specializing in conversational AI and cloud APIs in Bangalore, India

Junior Backend Software Engineer specializing in conversational AI and cloud APIs

Bangalore, India1y exp
HarmanUSC

Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.

View profile
PK

Junior Software Engineer specializing in AI/LLM backend systems

Los Angeles, CA2y exp
Easley-Dunn ProductionsUSC

Built production AI systems in high-stakes domains, including a medical RAG chatbot focused on reducing hallucinations and a document-processing workflow that automated manual PDF extraction. Demonstrates strong end-to-end ownership across backend services, APIs, LLM integration, and iterative reliability improvements based on real usage and failure analysis.

View profile
YK

Entry-level Software Developer specializing in full-stack web and machine learning applications

California, USA1y exp
Easley-Dunn ProductionsUSC

Early-career candidate with a thoughtful, engineering-first approach to AI-assisted development: they use AI to accelerate implementation while retaining human ownership of architecture and final code quality. They recently built a speech-to-text workflow using Groq Whisper and showed practical judgment by designing around imperfect transcription accuracy with checks and fallback handling.

View profile
KP

Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms

Dallas, TX5y exp
MattelKennesaw State University

At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).

View profile
MB

Manav Bhasin

Screened

Junior Full-Stack Machine Learning Engineer specializing in production ML systems

San Jose, CA2y exp
AgroFocal Technologies IncSan José State University

Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.

View profile
YM

Yogi Makadiya

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and DevSecOps

Seattle, WA3y exp
CuraJoyUniversity of Maryland, College Park

Backend-leaning product engineer with DevSecOps depth who has shipped real-time, Kafka-driven data pipelines and AI-enabled customer-facing features to production on AWS. Built a Spring Boot API layer serving real-time predictions at 100K+ requests/day, improving latency by 35% and user task completion by ~25%, and delivered a React/TypeScript dashboard plus a Postgres audit/history model optimized for search and large event volumes.

View profile

Need someone specific?

AI Search