Reval Logo

Vetted Data Engineering Professionals

Pre-screened and vetted.

MG

Marc Greenberg

Screened ReferencesStrong rec.

Executive Technology Leader (CTO) specializing in cloud architecture, distributed systems, and AI

Delray Beach, FL25y exp
Beyond Limited Thinking, LLCSyracuse University

Bootstrapped founder/fractional-CTO type building a community-first tooling platform and niche product lab, emphasizing trust and "family" culture. Operates extremely lean by self-hosting open-source LLM tooling locally and leverages a network of senior generalists willing to trade short-term compensation for learning and equity. Has worked across real estate tech, logistics, and hospitality POS and is applying a multi-modal "brain trails" concept to product opportunities.

View profile
CR

Mid-level Machine Learning Engineer specializing in MLOps and production ML systems

TX, USA5y exp
CignaUniversity of North Texas
View profile
SL

Mid-level AI/ML Engineer specializing in generative AI and MLOps

Remote, USA5y exp
MizuhoAuburn University at Montgomery
View profile
SR

Mid-level Data Scientist specializing in GenAI, NLP, and MLOps

USA5y exp
State StreetUniversity of Texas at Dallas
View profile
SB

Sai Bandaru

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems

Boston, MA6y exp
FiVerityNortheastern University

At FiVerity, built and deployed a production LLM/RAG-based Information Gathering Tool for credit union fraud analysts that generates auditable investigation summaries from verified evidence. Focused on high-stakes constraints—hallucination prevention, cross-entity leakage controls, compliance/PII-safe monitoring, and latency—while also shipping customer-facing agentic workflows using CrewAI and LangGraph in close partnership with fraud and compliance stakeholders.

View profile
AM

Junior AI/ML Engineer specializing in LLMs, RAG, and information retrieval

Boston, MA2y exp
University of Massachusetts AmherstUniversity of Massachusetts Amherst

Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.

View profile
AS

Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations

San Francisco, California11y exp
SpiderOakSan Francisco State University

Solutions Architect with 5+ years leading pre- and post-sales engagements, focused on taking complex tooling from test/prototype to secure production through a structured discovery-to-deployment approach. Experienced in LLM workflow troubleshooting using tools like Langfuse/Gopher and in developer enablement via concise, hands-on workshops (e.g., Jenkins on Kubernetes at scale). Has navigated internal and external blockers to drive adoption and keep enterprise deals moving (including a Jenkins sale to Love's).

View profile
YA

Yashi Agarwal

Screened

Mid-level Machine Learning Engineer specializing in NLP, Generative AI, and RAG systems

Los Angeles, CA4y exp
KaiyrosCalifornia State University, East Bay

Built and deployed a production LLM-powered phone assistant for a healthcare clinic, combining streaming STT/TTS with RAG over approved clinic documents and strict safety guardrails to prevent unverified medical advice, plus seamless human handoff. Also has hands-on Apache Airflow experience building robust daily ML/data pipelines with data validation, retries/timeouts, monitoring, and metric-gated model deployment, and iterates closely with clinic staff using real call reviews.

View profile
MY

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
State StreetWebster University

Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.

View profile
SP

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp
American Family InsuranceGeorge Mason University

Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.

View profile
RR

Rajeev Reddy

Screened

Mid-level AI/ML Engineer specializing in NLP and production ML on cloud

4y exp
The HartfordFlorida Atlantic University

ML engineer/data scientist who deployed a production credit risk + insurance claims triage platform at Hartford Financial, combining XGBoost default prediction with BERT-based document classification. Demonstrated strong MLOps by cutting inference latency to sub-500ms and building drift monitoring plus automated retraining/deployment pipelines (MLflow, CloudWatch, GitHub Actions, SageMaker) with human-in-the-loop review and SHAP-based explainability for underwriting adoption.

View profile
DG

Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics

Dallas, TX4y exp
UnitedHealth GroupJawaharlal Nehru Technological University, Hyderabad

NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.

View profile
DR

Junior AI/ML Engineer specializing in LLM agents and RAG systems

Boston, MA2y exp
Humanitarians.AINortheastern University

Backend/data engineer who built a production-ready multi-agent financial intelligence system (Mycroft) that orchestrates specialized AI agents to analyze real-time market data using FastAPI and Pinecone vector search. Brings strong security/reliability instincts (rate limiting, JWT/OAuth2, retries/backoff, health checks) and has caught high-impact data integrity issues in financial migrations (timezone normalization across global legacy systems).

View profile
SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.

View profile
AP

Mid-level AI/ML Software Engineer specializing in data pipelines, BI dashboards, and computer vision

Wichita, Kansas3y exp
Friends UniversityFriends University

Graduate Assistant Intern at Friends University who built and deployed a GenAI-driven requirement understanding system that automates extraction and semantic grouping of technical requirements from large unstructured documents. Demonstrates strong LLM engineering rigor (golden datasets, regression testing, post-processing validation) and production-minded delivery using LangChain/LlamaIndex orchestration, FastAPI microservices, Docker, and cloud deployment.

View profile
AD

Junior Full-Stack Software Engineer specializing in React, Kubernetes, and AI-powered apps

Scottsdale, Arizona2y exp
onsemiArizona State University

Backend/DevOps-leaning engineer managing multiple customer service platforms end-to-end (requirements through deployment). Built an in-house Python monitoring/alerting solution for Salesforce-to-Java contact sync jobs (Snowflake dependencies) that increased uptime ~60%, and helped modernize delivery by moving the team from manual releases to automated Jenkins-based deployments while coordinating an Oracle EBS→Fusion transition with business/data/IT stakeholders.

View profile
SS

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.

View profile
KK

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud MLOps

California, USA6y exp
CVS HealthCleveland State University

Built and deployed a production LLM/RAG system at CVS to automate clinical documents, addressing PHI compliance, retrieval accuracy, and latency; achieved a 35–40% reduction in review effort through chunking and FP16/INT8 optimization. Also has experience translating AI outputs into actionable insights for non-technical stakeholders (sports analysts).

View profile
NS

Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI

Morris Plains, NJ4y exp
CignaUniversity at Buffalo

Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.

View profile
AS

Mid-level GenAI & Data Engineer specializing in agentic AI systems and AWS Bedrock

Fort Mill, SC4y exp
OneData Software SolutionsNortheastern University

At onedata, built and deployed an LLM-powered, multi-agent analytics platform on AWS Bedrock that lets users create Amazon QuickSight dashboards through natural-language conversation, cutting dashboard build time from ~30 minutes to ~5 minutes. Strong in production concerns (observability, token/cost tracking, model tradeoffs) and in bridging business + technical work, owning pre-sales pitching through delivery with an engineering management background focused on AI product management.

View profile
HP

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Reston, VA4y exp
TruistUniversity of Central Missouri

ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.

View profile
AC

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp
Beyond IdentityIowa State University

ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).

View profile
NC

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp
CignaUniversity of North Texas

Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.

View profile
DK

Dhruv Kumar

Screened

Senior Backend Developer specializing in Python and AWS cloud-native systems

New York, NY9y exp
SuperblocksUniversity of California

Backend/data engineer with production experience building Python FastAPI services and AWS-native data pipelines. Has delivered containerized and serverless workloads (ECS/EKS/Lambda) with Terraform-based IaC, strong reliability patterns (JWT/RBAC, retries/circuit breakers, observability), and AWS Glue ETL into S3/Redshift. Demonstrated measurable SQL performance wins (40–50s to <4s) and owned real pipeline incidents through detection, mitigation, and prevention.

View profile

Need someone specific?

AI Search