Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Preprocessing Professionals

Pre-screened and vetted.

Data Preprocessing Python Docker SQL AWS CI/CD

Dipanwita Rano

Screened

Entry-Level Software Engineer specializing in full-stack development and machine learning

College Station, TX0y exp

NatWestTexas A&M University

“Master’s CS candidate with backend internship experience modernizing live operational workflows at NatWest/NetWess, focusing on reliability improvements, safer CI/CD deployments, and incremental refactors using feature flags and rollback paths. Built FastAPI-based APIs with strong security patterns (JWT + 2FA/TOTP, centralized authorization, RLS) and demonstrated attention to edge cases like idempotency and data consistency in a Netflix-clone project.”

Agile Artificial Intelligence C C++CI/CD Cybersecurity+99

View profile

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp

VerizonUniversity of Massachusetts Dartmouth

“Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.”

Machine Learning Artificial Intelligence Deep Learning PyTorch TensorFlow Keras+110

View profile

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp

GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B Testing Agile AI Agents Apache Kafka Apache Spark AWS Glue+170

View profile

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp

AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine Learning Generative AI Large Language Models (LLMs)Prompt Engineering Retrieval-Augmented Generation (RAG)Embeddings+131

View profile

BHARATH BHOOTHPUR

Screened

Mid-level Data Analyst specializing in healthcare and finance analytics

New Jersey, USA5y exp

Omada HealthRowan University

“Built an end-to-end Alexa smart-home IoT application controlling a Wi-Fi bulb, including ESP32 firmware (MQTT) and an AWS serverless backend (IoT Core/Device Shadow, Lambda, DynamoDB) with a REST API. Demonstrates strong real-time scalability patterns (streaming ingestion, stateless processing, partition-key design) and full-stack delivery with Spring Boot + React (JWT auth, CORS, data-heavy dashboards).”

Python SQL R NumPy Pandas Matplotlib+113

View profile

Koti Sai venkata Bhargav Edupuganti

Screened

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp

UnitedHealth GroupKent State University

“Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.”

A/B Testing Agile AWS Bash BigQuery CI/CD+131

View profile

Junjie Gao

Screened

Intern Full-Stack/Frontend Engineer specializing in data pipelines and analytics dashboards

San Francisco, CA2y exp

Association for Computing MachineryUC San Diego

“Backend engineer with experience at Roche and Jarsy focused on API and data-layer performance. Re-architected slow generalized endpoints into more efficient APIs (30% faster lookups) and led a schema refactor/migration with feature-flag rollout, dual writes, rollback scripts, and automated integrity checks; also addressed pipeline duplicate-entry issues via deduplication.”

AngularJS C C++Caching CSS D3.js+70

View profile

Saketh Reddy

Screened

Mid-Level Software Development Engineer specializing in full-stack and LLM/AI systems

CA, USA4y exp

JPMorgan ChaseUniversity of Central Missouri

“AI engineer with hands-on production experience building an end-to-end RAG system that reduced document-answering time from hours to minutes, improving accuracy through chunk overlap and hybrid BM25+semantic retrieval. Also built a LangGraph-based agent that researches company financial news via web search (Google Serper), using Pydantic structured outputs and checkpointing for reliability; experienced collaborating with non-technical stakeholders at JPMC and communicating ROI.”

Agile Angular Apache Airflow Apache Kafka AWS Bitbucket+138

View profile

Sai Chatrathi

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

NY, USA4y exp

HumanaSyracuse University

“Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.”

Python Pandas NumPy Scikit-learn SQL TensorFlow+77

View profile

Naveena Musku

Screened

Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

5y exp

Western UnionJawaharlal Nehru Technological University

“Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.”

Python FastAPI Flask Django LangChain Hugging Face+163

View profile

NITEESHA THOTTEMPUDI

Screened

Mid-level Software Engineer specializing in full-stack cloud-native systems

New York, NY7y exp

Dune SecurityNYU

“Backend/platform engineer from Dune Security with strong experience turning messy, fragmented workflows into reusable production systems. They’ve built a shared database abstraction layer, integrated multiple enterprise security platforms into a unified workflow, and shipped AWS Bedrock-powered security insight features with guardrails and human review.”

Python Go Java C C++JavaScript+177

View profile

Krisha Patel

Screened

Entry-Level Software Engineer specializing in AI/ML and Full-Stack Development

United States0y exp

TargetUniversity at Albany

“Backend engineer who built an NL-to-SQL system at Target, using a multi-step LLM pipeline with vector-store schema retrieval and SQL validation to safely answer business questions. Strong in production FastAPI systems (async, Pydantic, Docker/Uvicorn, load balancing) and security (OAuth2/JWT, scopes, and database row-level security), with experience migrating Flask apps to FastAPI + PostgreSQL using strangler/feature-flagged canary rollouts.”

.NET Algorithms Angular API Testing Bootstrap C+97

View profile

Sai Garipally

Screened

Mid-level AI/ML Engineer specializing in GenAI, LLMs, and computer vision

USA5y exp

UiPathSacred Heart University

“Built and productionized a multi-agent, LLM-powered document understanding system to replace manual review of long documents, using LangGraph orchestration plus RAG to reduce hallucinations. Implemented layered reliability controls (structured templates, checker agent, and human-in-the-loop feedback) and reported ~40% speed improvement after orchestration; also has hands-on Airflow experience for scheduled data pipelines.”

AWS AWS Lambda CI/CD Containerization Data Preprocessing Deep Learning+91

View profile

Ashwitha E

Screened

Junior Data Scientist specializing in fraud analytics and cloud data platforms

Dallas, TX3y exp

Bank of AmericaUniversity of North Texas

“Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.”

Python SQL R Machine Learning Predictive Modeling Feature Engineering+105

View profile

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp

Data Security Council of IndiaUSC

“Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.”

A/B Testing Artificial Intelligence Backend Development C++Cloud Computing Data Structures and Algorithms+76

View profile

Rushir Bhavsar

Screened

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp

Cadence Design SystemsArizona State University

“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”

AI Agents Angular Apache Spark AWS AWS CloudFormation AWS Lambda+121

View profile

Aniket Janrao

Screened

Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs

Houma, LA2y exp

Objective Medical Systems LLCUniversity at Buffalo

“Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.”

BERT C++Data preprocessing Data visualization Deep learning Docker+125

View profile

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp

Receptro.AIUniversity of Texas at Dallas

“Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.”

Python SQL PyTorch TensorFlow scikit-learn Hugging Face+83

View profile

Ankita A Khartmol

Screened

Junior Backend Software Engineer specializing in conversational AI and cloud APIs

Bangalore, India1y exp

HarmanUSC

“Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.”

Python Java JavaScript SQL REST APIs WebSockets+83

View profile

Paritosh Kadam

Screened

Junior Software Engineer specializing in AI/LLM backend systems

Los Angeles, CA2y exp

Easley-Dunn ProductionsUSC

“Built production AI systems in high-stakes domains, including a medical RAG chatbot focused on reducing hallucinations and a document-processing workflow that automated manual PDF extraction. Demonstrates strong end-to-end ownership across backend services, APIs, LLM integration, and iterative reliability improvements based on real usage and failure analysis.”

Python JavaScript SQL React HTML CSS+77

View profile

Yashvi Kishor Vaghela

Screened

Entry-level Software Developer specializing in full-stack web and machine learning applications

California, USA1y exp

Easley-Dunn ProductionsUSC

“Early-career candidate with a thoughtful, engineering-first approach to AI-assisted development: they use AI to accelerate implementation while retaining human ownership of architecture and final code quality. They recently built a speech-to-text workflow using Groq Whisper and showed practical judgment by designing around imperfect transcription accuracy with checks and fallback handling.”

Python C++Java MySQL MongoDB SQL+124

View profile

Keerthana Priya

Screened

Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms

Dallas, TX5y exp

MattelKennesaw State University

“At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).”

SQL Python R PySpark Apache Spark Pandas+123

View profile

Manav Bhasin

Screened

Junior Full-Stack Machine Learning Engineer specializing in production ML systems

San Jose, CA2y exp

AgroFocal Technologies IncSan José State University

“Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.”

Python SQL JavaScript TypeScript C C+++65

View profile

Yogi Makadiya

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and DevSecOps

Seattle, WA3y exp

CuraJoyUniversity of Maryland, College Park

“Backend-leaning product engineer with DevSecOps depth who has shipped real-time, Kafka-driven data pipelines and AI-enabled customer-facing features to production on AWS. Built a Spring Boot API layer serving real-time predictions at 100K+ requests/day, improving latency by 35% and user task completion by ~25%, and delivered a React/TypeScript dashboard plus a Postgres audit/history model optimized for search and large event volumes.”

Python Java JavaScript TypeScript SQL Shell Scripting+101

View profile

Machine Learning Engineers Software Engineers Data Scientists Research Assistants Software Developers AI Engineers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?