Vetted Data Cleaning Professionals

Pre-screened and vetted.

JC

Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps

USA5y exp
McKessonSUNY

Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.

View profile
MV

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp
DiscoverCentral Michigan University

ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.

View profile
TW

Senior Data Analytics & Data Science professional specializing in Financial Services

4y exp
InfosysGeorgia State University

Worked on large financial analytics datasets combining complaint text, transaction logs, and demographics; built end-to-end NLP/ML pipelines (TF-IDF + Random Forest) and data integration in BigQuery with Tableau reporting, citing ~95–98% accuracy. Also implemented entity resolution with fuzzy matching and semantic linking using BERT sentence-transformer embeddings stored in FAISS, including fine-tuning on labeled pairs to improve search/linking relevance.

View profile
Dhairya Desai - Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics in Chicago, IL

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
Meghana Nandivada - Junior Machine Learning Engineer specializing in production ML systems and MLOps

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp
TCSStevens Institute of Technology

ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.

View profile
Ponugoti Sushma - Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML in Texas, USA

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp
AllstateTexas A&M University-Corpus Christi

Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.

View profile
Akhil Bharadwaj Mateti - Mid-level Software Engineer specializing in Data Science and Machine Learning in Arlington, Virginia

Mid-level Software Engineer specializing in Data Science and Machine Learning

Arlington, Virginia4y exp
ElevateMeGeorge Washington University

Robotics/AV perception engineer who built a semantic-segmentation road detection system and integrated it into a ROS-based real-time pipeline (ROS bag camera feed to live monitor) achieving ~12 FPS. Strong in practical deployment work: solved multi-library versioning issues (ROS/OpenCV/TensorFlow), containerized the stack with Docker, and optimized inference by shifting runtime to C++ for large latency gains on NVIDIA hardware.

View profile
Andrew Clayman - Senior Data Scientist specializing in ML, NLP, and production AI systems in Remote

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.

View profile
Sreedivya Nagalli - Junior AI/ML Engineer specializing in deep learning and full-stack ML applications

Junior AI/ML Engineer specializing in deep learning and full-stack ML applications

2y exp
Amrita Vishwa VidyapeethamUniversity at Buffalo

Built and operated a production-used RAG-based AI study planner (GPT-4 + FAISS) that handled 250+ concurrent users, with real-world reliability engineering (caching, fallbacks, schema validation, Redis state, monitoring). Also has healthcare data integration experience at Medinet Analytics, standardizing messy EHR/practice-management data with canonical schemas, idempotency hashing, and compliance-grade audit trails.

View profile
Sravya Chunduri - Mid-level AI/ML Engineer specializing in LLM, NLP, and MLOps in Virginia, USA

Mid-level AI/ML Engineer specializing in LLM, NLP, and MLOps

Virginia, USA4y exp
Blackhawk NetworkUniversity of Maryland, Baltimore

AI/ML Engineer with 3+ years of experience spanning RAG pipelines, MLOps, large-scale data workflow automation, and resilient Playwright-based UI automation. At Black Hawk Network and Wipro, they describe shipping production systems with strong observability and compliance controls, including reducing flaky automation failures from 30% to under 2% and automating 3+ TB/day reconciliation workflows.

View profile
AR

Abheesht Roy

Screened

Junior Software Engineer specializing in AI and distributed systems

San Francisco, CA2y exp
Agent-Techs AIArizona State University

Built and shipped a production LLM-driven data harmonization/record-matching pipeline for pharmaceutical datasets, combining normalization, embeddings/vector search, and an LLM validation step. Emphasizes production reliability via guardrails, confidence thresholds, idempotent/retryable stages, and human-in-the-loop fallbacks, with monitoring focused on manual review and error rates to reduce false positives.

View profile
GP

Mid-level Solutions Engineer specializing in enterprise SaaS and FinTech

Charlotte, NC4y exp
KPI SolutionsUniversity of Cincinnati

Engineer with a solutions-engineering profile who has operated at the intersection of enterprise SaaS architecture, customer-facing technical discovery, and implementation in logistics and fintech environments. He has supported high-scale warehouse management systems processing 500,000+ daily transactions, led integration and security discussions, and improved release efficiency by 50% through CI/CD automation.

View profile
AA

Junior Software Engineer specializing in cloud, DevOps, and applied AI security

West Lafayette, Indiana3y exp
Freight PinsPurdue University

Founding engineer who built a multi-tenant AWS backend from scratch focused on ultra-fast, configuration-driven client onboarding and low operational cost. Automated tenant provisioning/deployments with Terraform + GitHub Actions (new client infra in ~13 minutes) and scaled to 62 production clients handling ~75k requests/day without a major rewrite. Hands-on with migrations (DynamoDB->MongoDB), reliability/observability, and performance tuning (indexes, Redis, queueing, connection management).

View profile
AB

Mid-level Customer/Technology Development Engineer specializing in AI and data-driven solutions

Oakland, CA6y exp
HarbisonWalker InternationalUniversité de Sherbrooke

Application/security-focused customer-facing implementer who has secured multi-customer data aggregation apps using per-tenant isolation, short-lived/scoped tokens, and vault-based secrets management. Troubleshoots production issues via API gateway logs and performance tuning, and runs repeatable onboarding playbooks with strong customer-specific and cross-project documentation. Emphasizes AWS least-privilege IAM and secure agent deployment patterns, plus container scanning practices that catch vulnerabilities pre-production and build developer trust.

View profile
GM

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp
Eastern Illinois UniversityEastern Illinois University

Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.

View profile
PRAHARSHA JANDHYALA - Mid-level Data Scientist/Data Analyst specializing in ML, BI dashboards, and ETL pipelines in Dallas, TX

Mid-level Data Scientist/Data Analyst specializing in ML, BI dashboards, and ETL pipelines

Dallas, TX4y exp
HumanaArizona State University

Data/ML practitioner with experience at Humana and Hexaware, focused on turning messy, semi-structured datasets into production-ready pipelines. Built an age-prediction model from book ratings using heavy feature engineering and multiple regression models, and has hands-on entity resolution (deterministic + fuzzy matching) plus embeddings/vector DB approaches for linking and search relevance.

View profile
Prashanth Kedri - Mid-level Machine Learning Engineer specializing in MLOps, NLP, and predictive maintenance in AL, USA

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and predictive maintenance

AL, USA4y exp
General MotorsAuburn University at Montgomery

ML engineer with General Motors experience deploying production AI systems, including a BERT-based sentiment classifier for over a million customer support call transcripts (reported ~91% precision) and sub-200ms latency via FastAPI/Docker optimization. Also built predictive maintenance models and automated retraining/monitoring workflows using Airflow and MLflow, collaborating closely with non-technical customer support stakeholders.

View profile
Sachin Dulla - Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps in Kentwood, MI

Sachin Dulla

Screened

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

Kentwood, MI3y exp
Fifth Third BankCalifornia State University, San Bernardino

Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.

View profile
JP

Jeet Patel

Screened

Junior AI and Backend Engineer specializing in LLM systems

Massachusetts, USA3y exp
Boston Wholesale Outlet IncNortheastern University

AI/LLM engineer who has shipped production RAG copilots and multi-agent workflows, including a real-time Llama3 (Ollama) copilot backend handling 12k+ concurrent queries at 99.9% uptime. Deep on orchestration (Langflow/Airflow/Kubernetes), reliability evaluation (hallucination detection, p95 latency, token cost), and monitoring (Prometheus/Grafana), with demonstrated stakeholder-facing analytics delivery via Tableau.

View profile
SR

Mid-level Business Analyst specializing in healthcare data and application consulting

Houston, TX5y exp
CapgeminiUniversity of Florida

Analytics professional with University of Florida experience in occupational health reporting, including bloodborne pathogen and needlestick exposure programs. Stands out for turning messy healthcare operational data into trusted, analysis-ready reporting assets using SQL and Python, while partnering closely with stakeholders to define reliable metrics and improve operational oversight.

View profile
Bryan Jones - Mid-level Data Analyst specializing in analytics, budgeting, and sports data systems in Maryland, USA

Bryan Jones

Screened

Mid-level Data Analyst specializing in analytics, budgeting, and sports data systems

Maryland, USA5y exp
Department of Homeland SecurityMcDaniel College

Baseball advisor/recruiter with a player-development lens shaped by his own injury experience, combining TrackMan-driven analytics with deep coach and program relationships. He has helped athletes navigate high-stakes draft, rehab, and college decisions, including identifying under-scouted talent like John Klein and supporting his path to the Twins' 40-man roster.

View profile
SM

Mid-level Business Analyst specializing in healthcare data and reporting

Sacramento, CA3y exp
CVS HealthCalifornia State University

Worked on a CVS Health project transforming large healthcare claims data from databases and APIs into clean reporting tables and Power BI dashboards. Brings hands-on experience in SQL, Python automation, data validation, and stakeholder-driven metric definition for analytics workflows.

View profile
SC

Mid-level Software Engineer specializing in Python backend and AI/GenAI

Jersey City, NJ4y exp
PTCSt. Francis College

Backend/infrastructure-focused engineer building AI-agent products for small businesses, including a customer-service agent platform with intent routing, RAG over Pinecone, and external booking API integration. Has shipped Python/FastAPI services with JWT auth, versioned APIs, Docker deployments to AWS EC2 via GitHub Actions, and production monitoring with Prometheus/Grafana.

View profile

Need someone specific?

AI Search