Reval Logo
Home Browse Talent Skilled in Reinforcement Learning

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement LearningPythonPyTorchDockerTensorFlowSQL
NA

Navyasri Arekatla

Mid-level AI/ML Engineer specializing in GenAI agents and production ML systems

Dallas, TX5y exp
PerplexityUniversity of North Texas
PythonJavaCC++MATLABBash+159
View profile
PV

Pravarsha Vantipalli

Mid-level Machine Learning Engineer specializing in MLOps and Generative AI

CA, USA5y exp
NetflixUniversity of Missouri
A/B TestingAmazon EC2Amazon EKSAmazon EMRAmazon RedshiftAmazon S3+86
View profile
DG

Devdatt Golwala

Mid-level Data Scientist/ML Engineer specializing in LLMs, NLP, and recommender systems

New York, NY3y exp
AdobeColumbia University
A/B TestingAlgorithmsAWSBashChromaDBC+81
View profile
TG

Thristha Gurajala

Mid-level AI/ML Engineer specializing in LLM, RAG, and multimodal systems

San Francisco, CA6y exp
PerplexityUniversity of Tampa
A/B TestingAmazon DynamoDBAmazon EC2Amazon EKSAmazon S3Amazon SageMaker+122
View profile
VS

Vedant Saraswat

Staff Platform/ML Engineer specializing in agentic AI, RAG, and cloud infrastructure

5y exp
Articul8 AIUC Irvine
A/B TestingAmazon SageMakerAWSBashCI/CDC+82
View profile
JK

Jasmeet Kaur

Mid-level Machine Learning Engineer specializing in Bayesian inference and reinforcement learning

Princeton, NJ5y exp
Ricovr HealthcareUniversity of Texas at Austin
C++ClassificationDeep LearningDockerGitJava+28
View profile
RG

Ramya Gurrala

Mid-level Machine Learning Engineer specializing in fraud detection and recommendations

Bay Area, CA6y exp
StripeBinghamton University
A/B TestingAgileAmazon RedshiftAmazon SageMakerAmazon S3Anomaly Detection+179
View profile
AV

Aaditya Voruganti

Screened ReferencesStrong rec.

Junior AI & Software Engineer specializing in robotics and ML infrastructure

2y exp
SamsaraUniversity of Illinois Urbana-Champaign

“Robotics engineer from UIUC’s Intelligent Motion Lab who led the perception stack for a humanoid robotic nurse, fusing camera/LiDAR/IMU on NVIDIA Jetson Orin for real-time localization and scene understanding across six robots. Deep expertise in ROS 2 and edge ML optimization (TensorRT, CUDA, zero-copy), delivering major latency/throughput gains (10 FPS to 22+ FPS) and building fault-tolerant pipelines with gRPC offloading and real-time reliability practices.”

C++PythonGoCJavaScriptPyTorch+128
View profile
JZ

Jacqueline Zhang

Screened

Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML

Illinois, USA4y exp
iSchool Statistical ML & AI LabUniversity of Illinois Urbana-Champaign

“ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.”

A/B TestingAPI DevelopmentCI/CDComputer VisionCData Engineering+93
View profile
MZ

Muhan Zhang

Screened

Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG

Palo Alto, USA2y exp
Platflow.AICornell University

“Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.”

PythonJavaScriptReactRC++Java+89
View profile
SC

Shweta Chavan

Screened

Junior Computer Vision & ML Engineer specializing in autonomous perception systems

Pittsburgh, PA2y exp
Magna InternationalCarnegie Mellon University

“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”

PythonC++OpenCVMATLABPyTorchTensorFlow+126
View profile
SR

Siddhik Reddy Kurapati

Screened

Junior Controls & Motion Planning Engineer specializing in MPC, RL, and autonomous systems

Boston, Massachusetts2y exp
Mitsubishi Electric Research LaboratoriesUniversity of Michigan

“Robotics researcher focused on learning-based navigation: builds sub-goal generation and cost-to-go models (Bayesian network-based) integrated with motion planning and MPC/NMPC control. Has hands-on ROS 2 package development across vehicles, drones, and manipulators, and uses a broad simulation stack (Isaac Sim, Gazebo, MuJoCo, PyBullet, PX4) to test and integrate systems.”

PythonC++CMATLABBashKeras+112
View profile
CW

Chinmayee Wamorkar

Screened

Mid-level Robotics & Autonomy Engineer specializing in MPC, RL, and GPU-accelerated optimization

4y exp
Georgia Institute of TechnologyUC Berkeley

“Robotics software engineer from Ati Motors who brought a Linear MPC approach (based on Kuhne et al.) into production, rebuilding parts of the planning stack to eliminate oscillations and safely double AMR speed from 0.8 m/s to 1.6 m/s. Also delivered an end-to-end point-cloud detection pipeline (PointPillars) including synthetic data generation in Isaac Sim and TensorRT deployment for real-time human/trolley detection, with a strong focus on production reliability via iterative hardening and nightly SIL.”

Artificial IntelligenceC#C++CI/CDCUDAData Analysis+106
View profile
GK

Gurnoor Kaur

Screened

Intern Robotics Software Engineer specializing in motion planning and robot perception

1y exp
AmazonUniversity of Michigan

“Robotics software engineer with Amazon Robotics internship experience who built a visual-servoing architecture from scratch, navigating multiple simulator pivots to achieve a closed-loop motion-planning and execution prototype. Currently working with ROS 2 on a medical assistive feeding robot using the Kinova Kortex platform (MoveIt2, ros2_control, Gazebo/RViz), and has demonstrated strong real-time debugging and distributed-system synchronization using Carbon and Docker.”

Artificial IntelligenceBackend DevelopmentC#C++CUDAData Analysis+85
View profile
AA

Ankit Aggarwal

Screened

Intern Robotics Engineer specializing in ROS, motion planning, and embedded systems

Pittsburgh, PA1y exp
Carnegie Mellon UniversityCarnegie Mellon University

“Robotics software engineer who delivered the Lunar ROADSTER—an autonomous bulldozing rover for lunar terrain manipulation—building the control system, path planning, and perception in ROS 2. Implemented crater detection using a YOLO model fused with ZED stereo depth to recover crater geometry, and structured autonomy around ROS 2 actions integrated into an FSM with CI/CD-backed system testing. Also has industrial robotics experience controlling a Fanuc arm for additive manufacturing and building ROS interfaces for PLC I/O.”

C++DockerGazeboGitLinuxMATLAB+121
View profile
CD

Chris Du

Screened

Intern Full-Stack Software Engineer specializing in web apps and AI systems

Mountain View, CA0y exp
BoschCarnegie Mellon University

“Product/UX designer who builds end-to-end systems across both consumer wellness and industrial/technical domains. Designed BloomPath (mental-wellness platform for therapists and young professionals) using research-driven, emotionally safe interaction patterns, and also simplified a Bosch autonomous parking vision-language mapping pipeline into a developer-facing real-time UI with layered debug tooling. Comfortable collaborating deeply with engineers and contributing in React/JS.”

ReactReact NativeNext.jsNode.jsExpressRedux+98
View profile
VM

Vishal Mittal

Screened

Director-level Engineering Manager specializing in cloud security platforms and AI-driven automation

Fremont, CA18y exp
Palo Alto NetworksStanford University

“Senior engineering leader in the Bay Area with experience spanning VMware, Hortonworks/Cloudera, Barracuda, and Palo Alto Networks, including leading open-source work (Apache Knox) and architecting large-scale security platforms. Has driven disaster recovery and cloud security products, designed Python microservices for Microsoft 365 security, and scaled teams (3x) while formalizing enterprise readiness practices with automated documentation using Notebook LLM.”

Team leadershipAgileRisk managementCross-functional collaborationStakeholder managementQuality assurance+189
View profile
MM

Mason McBride

Screened

Junior Software Engineer specializing in AI, game theory, and blockchain protocols

Los Angeles, CA2y exp
All In BitsUC Berkeley

“Backend engineer who built gnocal, a ~150-line stateless Go service that turns on-chain event data into standards-compliant .ics calendar feeds consumable by Apple/Google Calendar, deployed on Fly.io. Also refactored MCTS into Monte Carlo Graph Search (Python-to-Rust) using deterministic tests and state canonicalization to handle transpositions, and implemented decentralized role-based ACLs in Gno for a smart-contract web hosting network (gno.land / All in Bits).”

PythonGoCCUDAMachine LearningLarge Language Models (LLMs)+111
View profile
DL

Daniel Luzzatto

Screened

Junior Machine Learning Engineer specializing in LLMs, computer vision, and robotics

Tirat Carmel, Israel1y exp
FusmobileUCLA

“Built and deployed an agentic, multimodal LLM system that automates privacy redaction pipelines (audio/video/tabular) using LangChain orchestration and a closed-loop self-correction design. Personally implemented and performance-optimized core CV tooling (face blurring with tracking/Kalman filter) achieving >100 FPS on CPU, and validated reliability with golden-dataset benchmarking across 100+ privacy intents and measurable redaction metrics.”

Machine LearningDeep LearningReinforcement LearningTransformersLarge Language Models (LLMs)Computer Vision+102
View profile
AK

Anirudh Kunduru

Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference

CA, USA5y exp
NetflixUniversity of Central Missouri
A/B TestingAmazon EC2Amazon EKSAmazon EMRAmazon RedshiftAmazon S3+86
View profile
TY

Taehoon Yang

Junior Robotics & AI Engineer specializing in autonomous navigation and embodied learning

Stanford, CA1y exp
Stanford UniversityStanford University
PythonCC++C#MATLABPyTorch+45
View profile
AS

Aryamaan Saha

Intern AI/ML Engineer specializing in LLM systems and cloud-native microservices

New York, NY1y exp
Solstice HealthColumbia University
PythonCC++GoJavaScriptReact+62
View profile
SP

Sreeharsha Paruchuri

Intern Perception/Robotics Engineer specializing in computer vision and embodied AI

San Francisco, CA5y exp
Mach9Carnegie Mellon University
PythonC++CUDAROS 2DockerCI/CD+87
View profile
1...678...44

Related

Machine Learning EngineersResearch AssistantsSoftware EngineersData ScientistsTeaching AssistantsAI EngineersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search