Reval Logo

Vetted Data Cleaning Professionals

Pre-screened and vetted.

KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
SP

Mid-level Data Analyst specializing in AI/ML and advanced analytics

USA3y exp
AccentureMurray State University

Accenture data/ML practitioner who deployed a retail churn prediction and BERT-based sentiment analysis system to production, integrating behavioral + feedback data and operationalizing it with ETL automation, orchestration, and CI/CD. Experienced managing 2TB+ multi-source data, monitoring drift in Databricks, and translating results into Power BI dashboards for marketing teams (including K-means customer segmentation).

View profile
HW

Huihai Wang

Screened

Mid-level Applied AI Engineer specializing in knowledge graphs, GraphRAG, and urban mobility

Austin, TX5y exp
Urban Information Lab, The University of Texas at AustinUniversity of Texas at Austin

ML/NLP practitioner focused on knowledge-graph-based retrieval for LLM question answering, including an urban/autonomous-vehicle decision-making use case. Built a hierarchical GraphRAG + vector database system and an entity-resolution pipeline that blends spatial and semantic similarity, validated using LLM-generated synthetic datasets; uses Python tooling like RDFLib, GraphDB, OpenAI APIs, and LangChain.

View profile
BS

Senior Software Engineer specializing in distributed systems and FinTech

Washington, USA6y exp
Principal Financial GroupTrine University

Data/analytics-focused engineer who builds end-to-end KPI reporting and validation products used daily by plant leads and leadership to track yield, downtime, and defects. Combines Python/SQL + Power BI data pipelines with strong data-quality practices (automated validation, monitoring/alerts) and has experience designing scalable frontend architecture in TypeScript/React and working in distributed/microservices-style data systems.

View profile
SP

Soham Patil

Screened

Junior Cloud & AI/ML Engineer specializing in AWS GovCloud and MLOps

Washington, DC2y exp
IBMTexas Tech University

Robotics software engineer with hands-on ROS 2 autonomy experience on an obstacle-avoiding quadrotor (ROS 2 + Gazebo + PX4 + Nav2/SLAM), including custom work to extend Nav2 into a 3D aerial domain and output PX4 trajectory setpoints. Also built cost-saving ML infrastructure (PostgreSQL + AWS data-cleaning pipeline) and improved object detection accuracy by 40% using CUDA/PyTorch, with strong containerization and CI/CD practices (Docker + Kubernetes, aggressive version pinning) to prevent environment drift.

View profile
MS

Executive Operations & GTM Leader specializing in startups across logistics, esports, and civic tech

Redwood City, CA15y exp
Erebor ManagementUCLA

Founder/CEO who built Hometown Heart from the ground up—creating SOPs and standing up hiring, GTM, finance, and investor/government relations—scaling from 6 employees and $100K debt to 350 employees and $40M in annual revenue in 3 years. Led expansion into San Francisco County, proactively managing compliance/licensing and municipal stakeholders to secure early approvals and drive major revenue growth.

View profile
AR

Mid-level Full-Stack Software Engineer specializing in Java/Spring microservices and AWS

Ohio, United States3y exp
Fifth Third BankUniversity of Houston

Backend/platform engineer who has owned a real-time business analytics dashboard backend (Python/Flask/MongoDB) and built Kafka event-streaming pipelines with idempotent processing and DLQs. Strong DevOps/GitOps experience deploying containerized microservices to AWS EKS with CI/CD (Jenkins/GitHub Actions/CodePipeline) and ArgoCD auto-sync/drift detection, plus hands-on support for phased hybrid cloud/on-prem migrations using feature flags and replication.

View profile
AM

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp
Cardinal HealthIndiana Tech

Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.

View profile
AS

Anuj Shah

Screened

Senior Data Analyst specializing in cloud data platforms, experimentation, and predictive analytics

GA, USA9y exp
UnitedHealth GroupNorthwestern Polytechnic University

Healthcare data/ML practitioner with experience at UnitedHealth Group building production ETL and streaming pipelines (Python, BigQuery, Kafka) that unify EHR, IoT device, and lab data for patient risk prediction. Also implemented embedding-based semantic search/linking for noisy clinical notes via domain adaptation and rigorous validation with clinical stakeholders; previously built churn prediction at DirecTV using XGBoost.

View profile
PK

Phani K

Screened

Mid-level AI/ML Engineer specializing in NLP, computer vision, and Generative AI

Indiana, USA4y exp
UnitedHealth GroupIndiana State University

Built and deployed a production LLM-powered clinical insights/summarization assistant for healthcare teams, including a Spark+Airflow pipeline, fine-tuned transformer models, and a FastAPI Docker service on AWS. Demonstrates strong MLOps/LLMOps depth (Airflow on Kubernetes, custom AWS operators/IAM, MLflow, CloudWatch) and practical reliability work like hallucination mitigation, confidence scoring, and retrieval-backed evaluation with shadow deployments.

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
KP

Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms

Dallas, TX5y exp
MattelKennesaw State University

At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).

View profile
RR

Mid-level Data Scientist & Machine Learning Engineer specializing in fraud and forecasting

USA5y exp
JPMorgan ChaseUniversity of Texas at Dallas

ML/LLM practitioner who has shipped production RAG systems (summarization + Q&A) and end-to-end Airflow-orchestrated demand forecasting pipelines at NEON IT. Strong focus on reliability—uses evaluation scripts, retrieval/chunking tuning, validation/retries/alerts, and stakeholder-driven iteration to make AI workflows consistent and usable.

View profile
FX

Junior Machine Learning Engineer specializing in NLP and biomedical entity extraction

Boston, MA2y exp
Northeastern UniversityNortheastern University

Built and deployed a production LLM-powered biomedical knowledge extraction pipeline that processed millions of papers to identify tools/techniques and produce a unified knowledge graph via active learning NER (Prodigy + spaCy transformers) and entity linking (Bio-tools/Wikidata). Addressed hard NLP engineering challenges like WordPiece span-offset alignment and scaled inference over ~1.5M documents using batching/caching, containerized services, async workers, and orchestration with Prefect/Airflow.

View profile
IK

Junior ML Engineer specializing in energy forecasting and battery optimization

San Carlos, CA3y exp
ElecricFishUniversity of Michigan

Backend/ML engineer working on a battery energy storage system operations dashboard: built a Flask backend integrated with OAuth and a separate FastAPI optimization/simulation service, deployed via Docker CI/CD to Azure Container Apps. Strong in productionizing ML (AzureML to batch endpoints) and in performance/scalability patterns (Postgres indexing/JSONB, per-unit data isolation, async throttling + caching for year-long CPU-intensive simulations across 40+ scenarios).

View profile
NP

Neha P

Screened

Mid-level Full-Stack Java Developer specializing in cloud-native microservices

Texas, State4y exp
Bank of AmericaUniversity of Central Missouri

Full-stack engineer with Bank of America experience modernizing a large-scale financial reporting platform. Built React frontends and Java/Spring Boot microservice APIs end-to-end, optimized data-heavy SQL performance (indexing/caching/pagination), and implemented an AI feature for forecasting and anomaly detection using Python/scikit-learn, with deployments supported on AWS.

View profile
HK

Mid-level Data Scientist specializing in Generative AI and NLP

USA6y exp
CVS HealthUniversity of Central Missouri

ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).

View profile
SL

Mid-level Data Engineer specializing in cloud ETL/ELT and healthcare analytics

Dallas, TX5y exp
Lightbeam Health SolutionsSyracuse University

Healthcare-focused data engineer/ML practitioner with experience at Lightbeam Health Solutions and Humana building production entity-resolution and semantic similarity pipelines across EMR, lab, and claims data. Uses NLP/ML (spaCy, scikit-learn, BioBERT/LightGBM) plus Snowflake/Airflow and vector search (Pinecone) to improve linkage accuracy (reported 90%) and semantic match quality (reported +12–15%), while reducing manual cleanup by 40%+.

View profile
VA

Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems

6y exp
Wellmark Blue Cross and Blue ShieldIndiana Wesleyan University

GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.

View profile
KP

Senior AI Engineer specializing in Generative AI and RAG applications

8y exp
Keurig Dr PepperGeorge Mason University

AI engineer who has shipped production LLM systems across customer service and marketing use cases—building a RAG app on Azure OpenAI and speeding retrieval with Redis caching tied to Okta sessions. Also implemented a LangGraph multi-agent workflow that pulls image context from Figma to generate structured HTML marketing emails, adding a verification agent to improve image-selection accuracy while optimizing solution cost for business stakeholders.

View profile
SS

Saniya Shinde

Screened

Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems

Washington, DC4y exp
World BankGeorge Washington University

Built and deployed a production-style vision-language pipeline that generates structured medical reports from chest X-rays using BioViLT embeddings, an image-text alignment module, and BiGPT fine-tuned with LoRA, delivered via Streamlit and hosted on AWS EC2. Also collaborating experience presenting EDA findings, feature importance, and model performance to Ford managers while working with vehicle parts data at Bimcon.

View profile
JL

Junior Machine Learning Engineer specializing in LLMs, NLP, and computer vision

Bengaluru, Karnataka2y exp
PwCArizona State University

Built a production, agentic multi-agent pharmaceutical intelligence system for US oncology (breast cancer) conference/news intelligence, automating MSL-style information gathering and summarization for pharma and healthcare stakeholders. Uses CrewAI + LangChain orchestration, custom scraping across ~15 pharma newsrooms, and a grounding-score evaluation approach (sentence transformers/cosine similarity) to mitigate hallucinations.

View profile
TA

Mid-level Solutions & Pre-Sales Manager specializing in HRMS, analytics, and multi-cloud AI

CA, USA3y exp
University of California, RiversideUC Riverside

Enterprise implementation/deployment specialist focused on HRMS and payroll systems across APAC customers, combining cloud/hybrid (AWS/Azure/GCP) integration work with strong client-facing delivery. Demonstrated ability to debug complex production issues across application, database, and network layers (e.g., isolating VPN/router congestion) and to tailor Python-based data cleaning/scoring/utilities to customer-specific workflows.

View profile
NM

Mid-level Data Scientist/ML Engineer specializing in healthcare AI and MLOps

USA4y exp
CVS HealthUniversity at Buffalo

Designed and deployed an enterprise LLM-powered clinical/pharmacy policy knowledge assistant at CVS Health, replacing manual searches across PDFs/Word/SharePoint with a HIPAA-compliant RAG system. Built end-to-end ingestion and orchestration (Airflow + Azure ML/Data Lake + vector index) with PHI masking, versioned re-embedding, and production monitoring (Prometheus/Grafana), and partnered closely with clinicians/compliance to ensure policy-grounded, auditable answers.

View profile

Need someone specific?

AI Search