Vetted Data Cleaning Professionals

Pre-screened and vetted.

NS

Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI

Morris Plains, NJ4y exp
CignaUniversity at Buffalo

Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.

View profile
VS

Mid-level Machine Learning Engineer specializing in deep learning and generative AI

San Jose, CA5y exp
MetLifeUniversity of Alabama at Birmingham

ML/NLP engineer with hands-on experience building production systems for unstructured insurance claims and customer data linking. Delivered measurable impact at scale (millions of documents), combining transformer-based NLP, vector search (FAISS/Pinecone), and human-in-the-loop validation, and has strong production workflow/observability practices (Airflow, AWS Batch, Grafana/Prometheus).

View profile
AC

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp
Beyond IdentityIowa State University

ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).

View profile
PremKumar Gandla - Mid-level AI/ML Engineer specializing in MLOps, NLP, and scalable model deployment in Texas, USA

Mid-level AI/ML Engineer specializing in MLOps, NLP, and scalable model deployment

Texas, USA4y exp
BlackbaudSouthern Arkansas University

Built and deployed a production autonomous AI data analyst agent (LangChain + GPT + Streamlit on AWS) that turns natural-language questions into validated SQL, visualizations, and insights, cutting manual analysis time by ~50%. Emphasizes reliability and MLOps: schema-aware validation/guardrails to prevent hallucinations, scalable large-data processing, and Azure DevOps CI/CD + MLflow for automated deployment and experiment tracking.

View profile
SS

Junior Data Analyst specializing in BI, ETL, and reporting

Fort Lauderdale, FL3y exp
AdaniFlorida Atlantic University

Analytics professional with hands-on experience building SQL and Python workflows across SAP, Oracle, and internal operational systems, processing roughly 5 million records per month. They combine strong data quality rigor with stakeholder-friendly Power BI reporting, and cite a concrete impact of cutting reporting turnaround time from four days to two while surfacing cost anomalies for business teams.

View profile
Ramya Sree Kanijam - Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines in Remote, USA

Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines

Remote, USA3y exp
NetomiTexas A&M University-Corpus Christi

Built and owned an end-to-end AI-driven content enrichment pipeline for a news workflow, using n8n, LLM agents, and external APIs to automate ingestion, deduplication, categorization, and approval routing. Stands out for production-minded AI systems work: they improved reliability with schema validation, retries, idempotency, and monitoring, while automating 90% of processing and cutting duplication errors by 95%+.

View profile
AA

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp
TCSCalifornia State University, Chico

Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.

View profile
VV

Vaidik Vyas

Screened

Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms

Franklin, NJ5y exp
MetLifeNJIT

AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.

View profile
AZ

Mid-Level Software Engineer specializing in Generative AI and LLM applications

Johnston, Iowa4y exp
CortevaNortheastern University

Built and deployed a production RAG-based AI assistant for sales reps to unify access to product info, pricing, and internal documents across multiple systems. Implemented ETL pipelines for normalization/chunking/embeddings, integrated the assistant into internal React/TypeScript UIs with user-specific context, and enforced security with private vector storage and permission-filtered retrieval.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
KR

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp
HCLTechArizona State University

LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
SP

saran palle

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

North Carolina4y exp
Acentrik Technology SolutionsUniversity at Buffalo

AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.

View profile
SW

Sam Wiley

Screened

Junior Full-Stack Software Engineer specializing in web, mobile, and cloud infrastructure

New York, NY1y exp
Omega BlackLehigh University

Built a demo-live LangGraph/LangSmith LLM agent that translates natural language into SQL against a self-built MLB statistics database, using a vector-store knowledge base of example queries. Focused on predictable orchestration via conditional nodes, YAML-driven behaviors, and tool-gated function calling, with testing via LangSmith and Python scripts.

View profile
Santhoshi Priya Sunchu - Mid-level Data Scientist specializing in NLP and predictive modeling in Massachusetts, USA

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp
Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.

View profile
Mohith Venkata - Mid-level Full-Stack Developer specializing in cloud-native APIs and data workflows in Tukwila, WA

Mid-level Full-Stack Developer specializing in cloud-native APIs and data workflows

Tukwila, WA4y exp
Reshmi’s Group Inc.Seattle University

Built and owned end-to-end ordering and inventory/order management systems for a wholesale distributor, delivering an MVP quickly and iterating based on direct observation of daily users. Experienced with TypeScript/React + Node.js layered architectures and microservices using RabbitMQ, including real-world scaling issues (duplicates, backpressure) and observability practices (correlation IDs, structured logging).

View profile
Surya Danturty - Intern AI/ML Engineer specializing in computer vision and time-series forecasting in Riverside, CA

Intern AI/ML Engineer specializing in computer vision and time-series forecasting

Riverside, CA0y exp
University of California, RiversideUC Riverside

Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.

View profile
Tejasri Alla - Mid-level Full-Stack Python Developer specializing in banking microservices in Northridge, CA

Tejasri Alla

Screened

Mid-level Full-Stack Python Developer specializing in banking microservices

Northridge, CA5y exp
Simmons BankCalifornia State University, Northridge

Built and led production LLM-agent systems in enterprise environments (Simmons Bank, Mindtree), automating support ticket triage end-to-end with strong reliability engineering (99.9% uptime, Prometheus/Grafana, ECS autoscaling, CI/CD rollback). Demonstrated clear business impact (55% faster handling, SLA compliance 72%→96%, 800+ hours saved/month, +18% CSAT) and mature eval/feedback loops that improved extraction accuracy by 21%.

View profile
AK

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp
Black & VeatchUniversity of Central Missouri

Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.

View profile
KB

Keerthi Basam

Screened

Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare

United States4y exp
IBMWright State University

Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.

View profile
Austin Pierce-Ptak - Mid-level Software Engineer specializing in full-stack and ETL systems in Seattle, WA

Mid-level Software Engineer specializing in full-stack and ETL systems

Seattle, WA5y exp
BroadridgeLoyola University Maryland

Backend engineer with end-to-end ownership experience across enterprise SaaS and high-volume data systems, including PostgreSQL/.NET services at Visual Lease and ETL pipelines at Broadridge processing millions of records for Fortune 500 clients. Stands out for combining production support, observability thinking, and pragmatic architecture tradeoffs, while also experimenting with LLM-powered job application automation using Claude.

View profile
SA

sahithi A

Screened

Mid-level AI Engineer specializing in LLM agents and RAG for health-tech

Remote6y exp
Milton AITexas Tech University

Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.

View profile
BY

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp
ZymebalanzBoston University

LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.

View profile

Need someone specific?

AI Search