Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Cleaning Professionals

Pre-screened and vetted.

Data Cleaning Python SQL pandas AWS Docker

Nitin Shivakumar

Screened

Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI

Morris Plains, NJ4y exp

CignaUniversity at Buffalo

“Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.”

A/B Testing AWS Classification Computer Vision Databricks Data Analysis+72

View profile

Venkatarama Sai Teja Dasarathi

Screened

Mid-level Machine Learning Engineer specializing in deep learning and generative AI

San Jose, CA5y exp

MetLifeUniversity of Alabama at Birmingham

“ML/NLP engineer with hands-on experience building production systems for unstructured insurance claims and customer data linking. Delivered measurable impact at scale (millions of documents), combining transformer-based NLP, vector search (FAISS/Pinecone), and human-in-the-loop validation, and has strong production workflow/observability practices (Airflow, AWS Batch, Grafana/Prometheus).”

Python R SQL MATLAB TensorFlow Keras+126

View profile

Alexander Conn

Screened

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp

Beyond IdentityIowa State University

“ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).”

Machine Learning Artificial Intelligence Supervised Learning Unsupervised Learning Deep Learning Computer Vision+118

View profile

PremKumar Gandla

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and scalable model deployment

Texas, USA4y exp

BlackbaudSouthern Arkansas University

“Built and deployed a production autonomous AI data analyst agent (LangChain + GPT + Streamlit on AWS) that turns natural-language questions into validated SQL, visualizations, and insights, cutting manual analysis time by ~50%. Emphasizes reliability and MLOps: schema-aware validation/guardrails to prevent hallucinations, scalable large-data processing, and Azure DevOps CI/CD + MLflow for automated deployment and experiment tracking.”

Python SQL R TensorFlow PyTorch Scikit-learn+87

View profile

Sai sri mylavarapu

Screened

Junior Data Analyst specializing in BI, ETL, and reporting

Fort Lauderdale, FL3y exp

AdaniFlorida Atlantic University

“Analytics professional with hands-on experience building SQL and Python workflows across SAP, Oracle, and internal operational systems, processing roughly 5 million records per month. They combine strong data quality rigor with stakeholder-friendly Power BI reporting, and cite a concrete impact of cutting reporting turnaround time from four days to two while surfacing cost anomalies for business teams.”

Python SQL Pandas NumPy Data Cleaning Power BI+68

View profile

Ramya Sree Kanijam

Screened

Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines

Remote, USA3y exp

NetomiTexas A&M University-Corpus Christi

“Built and owned an end-to-end AI-driven content enrichment pipeline for a news workflow, using n8n, LLM agents, and external APIs to automate ingestion, deduplication, categorization, and approval routing. Stands out for production-minded AI systems work: they improved reliability with schema validation, retries, idempotency, and monitoring, while automating 90% of processing and cutting duplication errors by 95%+.”

Python SQL Bash FastAPI Celery Apache Airflow+231

View profile

Aditya Anil Raut

Screened

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp

TCSCalifornia State University, Chico

“Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.”

AWS AWS Lambda Bootstrap C C++Data cleaning+106

View profile

Vaidik Vyas

Screened

Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms

Franklin, NJ5y exp

MetLifeNJIT

“AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.”

Amazon DynamoDB Amazon ECS Amazon RDS Amazon S3 AWS Lambda Bash+106

View profile

Abdelrahman Zeidan

Screened

Mid-Level Software Engineer specializing in Generative AI and LLM applications

Johnston, Iowa4y exp

CortevaNortheastern University

“Built and deployed a production RAG-based AI assistant for sales reps to unify access to product info, pricing, and internal documents across multiple systems. Implemented ETL pipelines for normalization/chunking/embeddings, integrated the assistant into internal React/TypeScript UIs with user-specific context, and enforced security with private vector storage and permission-filtered retrieval.”

Agile Amazon EC2 Amazon S3 Amazon SageMaker Angular API Development+73

View profile

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp

W. R. BerkleySacred Heart University

“Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.”

Python SQL R C HTML CSS+153

View profile

Yash Pankhania

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp

Humanitarians.AINortheastern University

“AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).”

Agile Amazon Bedrock Amazon DynamoDB Amazon RDS Amazon Redshift Amazon S3+127

View profile

Krishna Rajput

Screened

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp

HCLTechArizona State University

“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”

A/B Testing AI Agents Anomaly Detection AWS Glue AWS Lambda Azure Machine Learning+126

View profile

nitesh bommisetty

Screened

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp

LumenUniversity of South Florida

“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”

Python R SQL TensorFlow PyTorch Keras+123

View profile

saran palle

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

North Carolina4y exp

Acentrik Technology SolutionsUniversity at Buffalo

“AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.”

Python FastAPI Flask TypeScript REST APIs System Design+101

View profile

Sam Wiley

Screened

Junior Full-Stack Software Engineer specializing in web, mobile, and cloud infrastructure

New York, NY1y exp

Omega BlackLehigh University

“Built a demo-live LangGraph/LangSmith LLM agent that translates natural language into SQL against a self-built MLB statistics database, using a vector-store knowledge base of example queries. Focused on predictable orchestration via conditional nodes, YAML-driven behaviors, and tool-gated function calling, with testing via LangSmith and Python scripts.”

API Development Bash C C++Cybersecurity Data Visualization+44

View profile

Santhoshi Priya Sunchu

Screened

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp

Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

“AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.”

Python SQL R NumPy Pandas Scikit-learn+147

View profile

Mohith Venkata

Screened

Mid-level Full-Stack Developer specializing in cloud-native APIs and data workflows

Tukwila, WA4y exp

Reshmi’s Group Inc.Seattle University

“Built and owned end-to-end ordering and inventory/order management systems for a wholesale distributor, delivering an MVP quickly and iterating based on direct observation of daily users. Experienced with TypeScript/React + Node.js layered architectures and microservices using RabbitMQ, including real-world scaling issues (duplicates, backpressure) and observability practices (correlation IDs, structured logging).”

Python Java JavaScript TypeScript C++C#+147

View profile

Surya Danturty

Screened

Intern AI/ML Engineer specializing in computer vision and time-series forecasting

Riverside, CA0y exp

University of California, RiversideUC Riverside

“Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.”

C Python Java JavaScript SQL HTML+74

View profile

Tejasri Alla

Screened

Mid-level Full-Stack Python Developer specializing in banking microservices

Northridge, CA5y exp

Simmons BankCalifornia State University, Northridge

“Built and led production LLM-agent systems in enterprise environments (Simmons Bank, Mindtree), automating support ticket triage end-to-end with strong reliability engineering (99.9% uptime, Prometheus/Grafana, ECS autoscaling, CI/CD rollback). Demonstrated clear business impact (55% faster handling, SLA compliance 72%→96%, 800+ hours saved/month, +18% CSAT) and mature eval/feedback loops that improved extraction accuracy by 21%.”

Python TypeScript SQL Data Structures & Algorithms Object-Oriented Programming (OOP)Design Patterns+101

View profile

Abhiroopsudansh Karengula

Screened

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp

Black & VeatchUniversity of Central Missouri

“Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.”

Python R C++Java SQL Bash+161

View profile

Keerthi Basam

Screened

Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare

United States4y exp

IBMWright State University

“Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.”

Python Java SQL JavaScript TypeScript LangChain+134

View profile

Austin Pierce-Ptak

Screened

Mid-level Software Engineer specializing in full-stack and ETL systems

Seattle, WA5y exp

BroadridgeLoyola University Maryland

“Backend engineer with end-to-end ownership experience across enterprise SaaS and high-volume data systems, including PostgreSQL/.NET services at Visual Lease and ETL pipelines at Broadridge processing millions of records for Fortune 500 clients. Stands out for combining production support, observability thinking, and pragmatic architecture tradeoffs, while also experimenting with LLM-powered job application automation using Claude.”

Python Java C#C C++JavaScript+80

View profile

sahithi A

Screened

Mid-level AI Engineer specializing in LLM agents and RAG for health-tech

Remote6y exp

Milton AITexas Tech University

“Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.”

Agile AI Agents API Integration AWS Backend Development CI/CD+121

View profile

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp

ZymebalanzBoston University

“LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.”

Python C++Java C HTML JavaScript+174

View profile

Software Engineers Machine Learning Engineers Data Analysts Data Scientists Data Engineers Research Assistants Data & Analytics Engineering AI & Machine Learning Education

Need someone specific?

AI Search

Related

Need someone specific?