Reval Logo
Home Browse Talent Skilled in Data Preprocessing

Vetted Data Preprocessing Professionals

Pre-screened and vetted.

Data PreprocessingPythonDockerSQLAWSTensorFlow
AG

Akshit Gaur

Mid-level AI Engineer specializing in LLM agents, evaluation pipelines, and microservices

Mountain View, CA4y exp
Carnegie Mellon UniversityCarnegie Mellon University
AgileBashCI/CDC++Data GovernanceData Preprocessing+57
View profile
JC

Jacob Colombo

Intern AI Engineer specializing in agentic RAG systems and computer vision

Orlando, FL2y exp
Hibiscus HealthCornell University
ChromaDBComputer VisionData PreprocessingData StructuresDashboard DevelopmentJava+43
View profile
AN

Anuj Naik

Mid-level AI/ML Engineer specializing in production ML, LLMs, and MLOps

Remote, USA4y exp
StripeCalifornia State University
Amazon EC2Amazon S3AWSAnomaly DetectionAPI DevelopmentCI/CD+67
View profile
HL

Haochen LI

Intern Machine Learning Engineer specializing in Generative AI and LLM systems

Hong Kong, China1y exp
Blue InsuranceDuke University
Amazon SageMakerApache HadoopApache SparkAPI IntegrationArtificial IntelligenceAWS+61
View profile
PM

Prathamesh matte

Mid-level Full-Stack Software Engineer specializing in React, Node.js, and cloud deployments

USA4y exp
JPMorgan ChaseUniversity of Massachusetts Dartmouth
PythonJavaC++CJavaScriptTypeScript+83
View profile
AL

Andrew Long

Senior Full-Stack & AI Engineer specializing in scalable ML systems

Seattle, WA11y exp
Evertune AIUniversity of Illinois Chicago
AWSAWS LambdaAzure FunctionsCI/CDC#D3.js+55
View profile
HO

Hiroaki Oshima

Mid-level Machine Learning & Data Engineer specializing in MLOps and cloud data platforms

San Francisco, CA4y exp
Blue River TechnologyUC Berkeley
Apache SparkAWS GlueCI/CDContainerizationData EngineeringData Preprocessing+64
View profile
YW

Yepu Wang

Mid-level Software Engineer specializing in Python, cloud tooling, and NLP/RAG systems

Seattle, WA3y exp
Amazon Web ServicesGeorgia Tech
AngularJSAPI DesignAuthenticationC++CI/CDCSS+35
View profile
TZ

Tianhao Zang

Intern Machine Learning Engineer specializing in NLP and LLM/RAG systems

Los Angeles, CA1y exp
MeituanUCLA
A/B TestingAWSBERTCachingCC#+79
View profile
TS

Tyler Swanson

Senior Data Engineer specializing in healthcare ETL/ELT and ML

Pasadena, CA12y exp
Doheny Eye InstituteUniversity of Texas at Austin
Amazon EC2Amazon KinesisAmazon RedshiftAmazon S3Apache AirflowApache Kafka+128
View profile
AD

Andrew Dyer

Senior AI/Machine Learning Engineer specializing in RAG and MLOps

Odessa, TX8y exp
DataRobotJohns Hopkins University
A/B TestingAgileApache KafkaApache SparkAWSCI/CD+36
View profile
RS

Rohith Sadanala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and MLOps

Missouri, USA3y exp
AirbnbUniversity of South Florida

“LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.”

A/B TestingAmazon BedrockAmazon EC2Amazon EKSAmazon RDSAmazon S3+154
View profile
BP

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”

PythonRSQLPandasNumPyScikit-learn+132
View profile
RR

Rushi Reddy Lambu

Screened

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Remote, USA5y exp
McKinsey & CompanyUniversity of North Texas

“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”

PythonSQLRJavaC++Bash+192
View profile
SD

Sarath Dunga

Screened

Mid-level Full-Stack Developer specializing in cloud microservices and AI/ML integration

Remote, USA4y exp
eBayArizona State University

“Full-stack engineer (~3 years) with eBay production experience building and operating high-scale, event-driven Python microservices for order processing and AI-powered recommendations (Kafka/Redis/FastAPI on AWS with Prometheus/Grafana). Also delivered polished React+TypeScript analytics dashboards and designed high-concurrency PostgreSQL schemas with significant latency reductions. Recently built AI-agent orchestration and an interactive node-based requirements dashboard for Siemens Polarion via MCP servers, improving user interaction by ~17.8%+.”

Anomaly detectionAuthenticationAuthorizationAWSAWS CodePipelineAWS Lambda+183
View profile
RV

Rucha Visal

Screened

Mid-Level Software Development Engineer specializing in distributed systems and full-stack web apps

Seattle, USA4y exp
AmazonUniversity of North Carolina at Charlotte

“Software engineer who owned customer-facing, high-traffic TypeScript/React + TypeScript backend systems end-to-end, emphasizing safe velocity through feature flags, staged rollouts, observability, and rollback-ready incremental delivery. Reports shipping more frequently with fewer production incidents and faster recovery due to these guardrails.”

JavaPythonJavaScriptTypeScriptGoC+79
View profile
SS

Sumanth Salluri

Screened

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

“Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.”

PythonSQLRHTMLCSSJavaScript+118
View profile
NV

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”

PythonSQLCC++RMATLAB+90
View profile
ZI

Zufeshan Imran

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”

Machine LearningDeep LearningGenerative AITransformersLarge Language Models (LLMs)LLM fine-tuning+120
View profile
PK

Piyush Kautkar

Screened

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp
NeocisCarnegie Mellon University

“CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.”

AgileAnalyticsAnomaly DetectionAuthenticationAWSC+++190
View profile
HC

Harsh Chaudhari

Screened

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp
SplunkUniversity of Colorado Boulder

“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”

C++Data PipelinesData PreprocessingDockerEmbeddingsFAISS+70
View profile
RM

Rakesh Munaga

Screened

Mid-level Full-Stack Engineer specializing in AI and FinTech platforms

TX, USA4y exp
JPMorgan ChaseUniversity of Texas at Arlington

“Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.”

Amazon API GatewayAmazon CloudWatchAmazon EC2Amazon RDSAmazon S3Amazon SNS+132
View profile
BD

Bhargav Diyora

Screened

Mid-level Full-Stack Software Engineer specializing in FinTech microservices

California, USA4y exp
PayPalCalifornia State University, Long Beach

“Robotics software engineer who has built end-to-end pipelines spanning backend/data processing through model interfaces and hardware integration. Has hands-on ROS2 experience building Python nodes and debugging real-time behavior via profiling, publish-rate tuning, and latency fixes, plus experience standardizing multi-robot communication with QoS adjustments. Uses Gazebo simulation and Docker/CI/CD to catch integration issues early and speed iteration.”

JavaJavaScriptTypeScriptPythonC#SQL+161
View profile
NN

Niyaz Nurbhasha

Screened

Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

4y exp
BlueHaloDuke University

“ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.”

PythonC++SQLJavaScriptTypeScriptPyTorch+75
View profile
1...345...37

Related

Machine Learning EngineersSoftware EngineersData ScientistsResearch AssistantsAI EngineersSoftware DevelopersAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search