Reval Logo
Home Browse Talent Skilled in Reinforcement Learning

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement LearningPythonPyTorchDockerTensorFlowSQL
BV

Bhavagyna Vegunta

Junior Mechanical Engineer specializing in robotics, mechatronics, and test automation

2y exp
Mito RoboticsCarnegie Mellon University
GoMATLABRoboticsPythonC++ROS 2+129
View profile
VN

Varun narra

Junior ML Engineer specializing in MLOps and real-time inference

TX, USA2y exp
TeslaUniversity of Texas at Dallas
PythonSQLRC++JavaMachine Learning+70
View profile
SA

Sony Arravena

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production NLP

CA, USA6y exp
MetaUniversity of Central Missouri
A/B TestingAmazon EMRAmazon EKSAmazon RedshiftAmazon S3Amazon SageMaker+144
View profile
SM

Satish Mattam

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference

Bay Area, CA5y exp
PerplexitySaint Louis University
A/B TestingAgileAnomaly DetectionApache HiveApache KafkaApache Spark+165
View profile
CK

Chad Kapadia

Executive Technology & Product Leader specializing in AI/ML, Cloud Platforms, and SaaS

Pleasanton, CA31y exp
Interior Logic GroupUC Davis
Machine LearningReinforcement LearningSaaSCloud ComputingAWSMicrosoft Azure+62
View profile
SP

Saurabh Paul

Staff Data Scientist / AI-ML Engineer specializing in fraud detection, NLP, and recommendations

Sunnyvale, CA11y exp
WalmartIIEST Shibpur
Machine LearningArtificial IntelligenceBERTGPTDeep LearningLSTM+78
View profile
RD

Richi Dubey

Mid-level Software Engineer specializing in systems, CUDA, and robotics/AI

Santa Clara, CA2y exp
NVIDIAGeorgia Tech
PythonCJavaSQLJavaScriptTypeScript+43
View profile
NS

Niteesh Singh

Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference

New York city, NY4y exp
PerplexityCleveland State University
A/B TestingAmazon EC2Amazon EKSAmazon S3Apache SparkArgo CD+145
View profile
DA

Daksh Adhar

Screened

Junior Robotics & Reinforcement Learning Engineer specializing in dexterous manipulation

Palo Alto, CA2y exp
1X TechnologiesCarnegie Mellon University

“Robotics software engineer (master’s student) who placed 3rd in the CMU VLA challenge and presented at IROS, building an LLM-powered language system (Gemini 2.5) for mobile-robot scene Q&A and language-based navigation. Hands-on ROS1/ROS2 experience including ros2_control + PILZ planning for a KUKA arm, plus simulation (Gazebo) and containerized submissions with Docker.”

PythonCC++MATLABPyTorchTensorFlow+98
View profile
PV

Prahlad Vivek

Screened

Intern Robotics Engineer specializing in robot learning, SLAM, and control

Wilton, CT3y exp
ASMLColumbia University

“Robotics architect intern/new-grad focused on warehouse AMRs, building ROS2 sensor-fusion and SLAM stacks (FastSLAM-style particle filter) and validating in Gazebo with ground-truth metrics. Also interned at ASML debugging real-time in-vacuum robot behavior via Python state-machine telemetry scripts, identifying a firmware driver issue impacting throughput.”

PythonC++GazeboMATLABBashGit+103
View profile
KM

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp
NVIDIAOregon State University

“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”

A/B TestingAnsibleApache KafkaApache SparkAutomated TestingAWS+113
View profile
KT

Kenil Tanna

Screened

Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services

New York, NY7y exp
JPMorgan ChaseIIT Guwahati

“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”

PythonRSQLJavaScriptREST APIsgRPC+124
View profile
HM

Het Maheshkumar Sekhalia

Screened

Entry-level Robotics Researcher specializing in autonomy, motion planning, and control

Pittsburgh, PA1y exp
KomatsuCarnegie Mellon University

“Robotics software engineer focused on simulation-first autonomy and learning: implemented TD3 and CLIP-guided pretraining for physics-based humanoid skill learning in Isaac Gym/DeepMimic. Also built a ROS2 + dual-Docker closed-loop stack for an autonomous wheel loader in Isaac Sim, combining global planning, B-spline smoothing, and real-time NMPC control.”

CC++Computer VisionDeep LearningDockerGit+77
View profile
ZS

Ziwen Shen

Screened

Junior Machine Learning Engineer specializing in computer vision, reinforcement learning, and PINNs

Remote, USA1y exp
Okapi Sports IntelligenceBrown University

“ML/Simulation engineer who productionized a Multi-Agent Reinforcement Learning system for 30+ firms at Belt and Road Big Data Company, integrating research code into an enterprise backend via Dockerized deployment and scalable data pipelines on GCP/Vertex AI. Demonstrated strong production debugging by tracing apparent network timeouts to hardware memory exhaustion caused by software state-history garbage collection issues, and built custom reward functions to model complex market dynamics (entry/exit, pricing).”

PythonCC++SQLMATLABR+71
View profile
KR

Krishna Reddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants

New York, NY6y exp
StripeIndiana Wesleyan University

“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”

AgileAmazon BedrockApache HadoopApache HiveApache KafkaApache Spark+143
View profile
YX

Yuxin Xiong

Screened

Intern Machine Learning Engineer specializing in LLM reasoning, agents, and deployment

0y exp
Nexa AIUC San Diego

“AWS AI Lab engineer who deployed a production Chain-of-Thought analytical agent for tabular reasoning, emphasizing grounded tool-constrained workflows with schema-validated intermediate outputs. Built robust evaluation/logging with step-level observability to catch regressions across model versions, and has experience scaling distributed LLM training via Slurm + DeepSpeed/FSDP with checkpointing and failure recovery.”

Large Language Models (LLMs)Model deploymentPyTorchReinforcement learningFeature engineeringXGBoost+91
View profile
AR

Anagha Ram

Screened

Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search

Los Altos, CA2y exp
Columbia UniversityCornell University

“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”

Anomaly DetectionAWSCData StructuresDjangoGenerative AI+123
View profile
DH

Dexin Huang

Screened

Junior AI Engineer specializing in LLM systems, RAG, and full-stack automation

Guilford, CT1y exp
Slothful LLC (Iris)Columbia University

“Built and deployed an AI receptionist product for field-service businesses (HVAC/electrician), including real-time Jobber scheduling integrations and Twilio-based calling. Combines hands-on customer/operator shadowing with strong production engineering (queueing to handle API limits, rigorous testing/mocking, mirrored prod environment) and cross-layer troubleshooting, driving user adoption through review/override workflows.”

A/B TestingAnalyticsAPI DesignAuthenticationAWSAWS Lambda+99
View profile
RP

Rohan Punamiya

Screened

Junior Robotics Engineer specializing in robot learning, controls, and tactile sensing

Stanford, CA4y exp
FlexivStanford University

“Robotics software engineer with Stanford coursework and Georgia Tech research experience, focused on end-to-end autonomy for mobile manipulation and real-time planning under uncertainty. Built a ROS 2 LoCoBot system combining Gemini speech-to-text, YOLO-based RGB-D perception, navigation, and grasping with robust synchronization/TF fixes, and developed an information-theoretic UGV planner for radiological source localization validated via Monte Carlo simulation.”

GazeboMATLABCC++PythonPyTorch+124
View profile
KS

Kyutae Sim

Junior Robotics Researcher specializing in robot learning and manipulation

Pittsburgh, PA2y exp
CMU Robots Perceiving and Doing LabCarnegie Mellon University
Artificial IntelligenceC++Deep LearningMachine LearningNeural NetworksOperating Systems+36
View profile
VY

Vinnie Yerramadha

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

San Francisco, CA6y exp
ShopifyUniversity of North Texas
PythonSQLBashCJavaScriptPHP+173
View profile
YG

Yutong Guo

Intern Machine Learning Engineer specializing in AI security and anomaly detection

Remote, CA2y exp
FordCarnegie Mellon University
PythonJavaMATLABSQLNoSQLR+124
View profile
1...345...44

Related

Machine Learning EngineersResearch AssistantsSoftware EngineersData ScientistsTeaching AssistantsAI EngineersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search