Reval Logo

Vetted Data Cleaning Professionals

Pre-screened and vetted.

AZ

Mid-Level Software Engineer specializing in Generative AI and LLM applications

Johnston, Iowa4y exp
CortevaNortheastern University

Built and deployed a production RAG-based AI assistant for sales reps to unify access to product info, pricing, and internal documents across multiple systems. Implemented ETL pipelines for normalization/chunking/embeddings, integrated the assistant into internal React/TypeScript UIs with user-specific context, and enforced security with private vector storage and permission-filtered retrieval.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
KR

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp
HCLTechArizona State University

LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
MM

Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and MLOps

OH, USA5y exp
General MotorsUniversity of Dayton

ML/LLM engineer with production experience at General Motors building Transformer-based search and recommendation personalization for a high-traffic vehicle platform. Delivered significant KPI gains (17% conversion lift, 14% bounce-rate reduction) and optimized real-time inference via ONNX Runtime and INT8 quantization while implementing robust MLOps (Airflow/MLflow, monitoring, drift-triggered retraining) and stakeholder-facing explainability/dashboards.

View profile
SP

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp
Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.

View profile
SP

saran palle

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

North Carolina4y exp
Acentrik Technology SolutionsUniversity at Buffalo

AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.

View profile
SA

sahithi A

Screened

Mid-level AI Engineer specializing in LLM agents and RAG for health-tech

Remote6y exp
Milton AITexas Tech University

Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.

View profile
BY

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp
ZymebalanzBoston University

LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.

View profile
JC

Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps

USA5y exp
McKessonSUNY

Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.

View profile
MV

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp
DiscoverCentral Michigan University

ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.

View profile
DD

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
SS

Shimil Shijo

Screened

Senior AI Software Engineer specializing in Generative AI and NLP

Dearborn, MI6y exp
University of Michigan-DearbornUniversity of Michigan-Dearborn

Built and deployed a production multimodal language translation platform (text-to-text, speech-to-text, text-to-speech) using fine-tuned pretrained models (NLLB, XLSR), MLflow-orchestrated pipelines, and Docker/Kubernetes on AWS. Worked closely with non-technical linguists to tackle data cleaning and dialect variation in minority languages, improving accuracy through consistent evaluation and monitoring.

View profile
MN

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp
TCSStevens Institute of Technology

ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.

View profile
PS

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp
AllstateTexas A&M University-Corpus Christi

Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.

View profile
AC

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.

View profile
TW

Senior Data Analytics & Data Science professional specializing in Financial Services

4y exp
InfosysGeorgia State University

Worked on large financial analytics datasets combining complaint text, transaction logs, and demographics; built end-to-end NLP/ML pipelines (TF-IDF + Random Forest) and data integration in BigQuery with Tableau reporting, citing ~95–98% accuracy. Also implemented entity resolution with fuzzy matching and semantic linking using BERT sentence-transformer embeddings stored in FAISS, including fine-tuning on labeled pairs to improve search/linking relevance.

View profile
AB

Mid-level Software Engineer specializing in Data Science and Machine Learning

Arlington, Virginia4y exp
ElevateMeGeorge Washington University

Robotics/AV perception engineer who built a semantic-segmentation road detection system and integrated it into a ROS-based real-time pipeline (ROS bag camera feed to live monitor) achieving ~12 FPS. Strong in practical deployment work: solved multi-library versioning issues (ROS/OpenCV/TensorFlow), containerized the stack with Docker, and optimized inference by shifting runtime to C++ for large latency gains on NVIDIA hardware.

View profile
AA

Junior Software Engineer specializing in cloud, DevOps, and applied AI security

West Lafayette, Indiana3y exp
Freight PinsPurdue University

Founding engineer who built a multi-tenant AWS backend from scratch focused on ultra-fast, configuration-driven client onboarding and low operational cost. Automated tenant provisioning/deployments with Terraform + GitHub Actions (new client infra in ~13 minutes) and scaled to 62 production clients handling ~75k requests/day without a major rewrite. Hands-on with migrations (DynamoDB->MongoDB), reliability/observability, and performance tuning (indexes, Redis, queueing, connection management).

View profile
DP

Junior Software Engineer specializing in cloud-native microservices and applied NLP

Baltimore, MD3y exp
CVS HealthUniversity of Maryland, Baltimore County

Backend engineer who built an AI-driven "Smart Feedback Analyzer" API (Flask → FastAPI) that processes user feedback with NLP (Hugging Face + OpenAI) and returns structured insights. Demonstrates strong production-minded architecture: stateless services, Cloud Run + Docker deployment, Redis/Celery background processing, and Postgres/SQLAlchemy performance tuning (EXPLAIN ANALYZE, indexing, N+1 fixes), plus multi-tenant data isolation via JWT/API-key derived tenant IDs.

View profile
SD

Sachin Dulla

Screened

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

Kentwood, MI3y exp
Fifth Third BankCalifornia State University, San Bernardino

Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.

View profile
PJ

Mid-level Data Scientist/Data Analyst specializing in ML, BI dashboards, and ETL pipelines

Dallas, TX4y exp
HumanaArizona State University

Data/ML practitioner with experience at Humana and Hexaware, focused on turning messy, semi-structured datasets into production-ready pipelines. Built an age-prediction model from book ratings using heavy feature engineering and multiple regression models, and has hands-on entity resolution (deterministic + fuzzy matching) plus embeddings/vector DB approaches for linking and search relevance.

View profile

Need someone specific?

AI Search