Reval Logo
Home Browse Talent Skilled in Reinforcement Learning

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement LearningPythonPyTorchDockerTensorFlowSQL
SP

Sreehari Premkumar

Junior Robotics & Controls Engineer specializing in state estimation, simulation, and ROS2

Tucson, AZ1y exp
PTCNortheastern University
RoboticsTestingComputer VisionDeep LearningReinforcement LearningC+++65
View profile
JR

Jangama Reddy

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and RAG for financial services

Hyattsville, MD4y exp
Morgan StanleyUniversity of Maryland, College Park
A/B TestingAgileAmazon S3Amazon SageMakerApache AirflowAWS+124
View profile
SK

Supanat Khaodhiar

Intern-level Software Engineer specializing in Machine Learning and Full-Stack Web Development

Houston, TX1y exp
SCB XRice University
PythonJavaJavaScriptTypeScriptCC+++82
View profile
BH

Baban Hamesalh

Mid-level Full-Stack Engineer specializing in Python, FastAPI, and cloud-native systems

San Diego, CA4y exp
DynataGeorgia Tech
PythonFastAPIAPI DevelopmentBackend DevelopmentMicroservicesSQL+46
View profile
MS

Miguel Saldana

Senior AI/ML Engineer specializing in GenAI, MLOps, and healthcare analytics

Chicago, IL13y exp
WezomRice University
A/B TestingAgileAmazon ECSAmazon EKSAmazon RedshiftAnomaly Detection+359
View profile
JB

Jarred Bultema

Principal Data Scientist specializing in AI/ML forecasting and MLOps

Fort Collins, CO14y exp
HasbroGalvanize
Machine LearningArtificial IntelligenceForecastingTime Series ForecastingPredictive ModelingDeep Learning+108
View profile
SR

Saiteja Reddy

Mid-level AI/ML Engineer specializing in forecasting, MLOps, and generative AI

Remote, USA3y exp
Fisher InvestmentsUniversity of Missouri-Kansas City
A/B TestingAmazon BedrockAmazon EKSAmazon KinesisAmazon S3AWS+107
View profile
JS

Jimmy Smith

Principal Data Scientist specializing in LLMs, RAG, and enterprise AI products

Winchester, TN9y exp
SambaNovaSewanee: The University of the South
AgileApache HadoopApache KafkaApache SparkAWSBERT+125
View profile
JS

Jainum Sanghavi

Screened

Mid-level DevOps Engineer specializing in cloud automation and Kubernetes platforms

Boston, MA2y exp
Northeastern UniversityNortheastern University

“Robotics/ML engineer who has built SO(3)-equivariant models for robotic manipulation, including custom equivariant layers and differentiable point-cloud rasterization/derasterization workflows. Also brings 2 years of DevOps experience in banking systems, automating CI/CD and infrastructure at scale (managed 180 OCI servers; reduced rebuild downtime by 80%).”

PythonC++JavaCTypeScriptGo+88
View profile
SS

Saffinah Shi

Screened

Junior Software Engineer specializing in full-stack, cloud serverless, and AI systems

Los Angeles, US2y exp
CoreSpeedNorthwestern University

“SDE who worked on an MGICS Lab robotics project building a multi-agent model to help agents understand tasks and generate robot instructions, emphasizing task-splitting, checking, and a reflection agent to improve accuracy. Also has experience using GitHub with automated CI/CD pipelines.”

API GatewayAWSAWS IAMAWS LambdaBERTCI/CD+133
View profile
IK

Ibrahim Kurban Ozaslan

Screened

Junior Robotics & Controls Researcher specializing in optimization, MPC, and reinforcement learning

Los Angeles, CA
University of Southern CaliforniaUSC

“Robotics software candidate who designed and implemented a hierarchical motion-planning and whole-body control pipeline for a 37-state Spot robot to traverse complex terrain, using Graph of Convex Sets for safe footstep selection plus optimization-based IK and nonlinear trajectory optimization for joint trajectories/contact forces. Strong in optimization-heavy robotics workflows (PyDrake, MATLAB/Simulink) and methodical debugging down to signal-level and numerical stability; has not used ROS/ROS2 yet.”

PythonMATLABC++PyTorchReinforcement learningLSTM+68
View profile
CL

Chia-En Lu

Screened

Junior AI/ML Systems Engineer specializing in LLM infrastructure and distributed training

1y exp
GenseeAIUC San Diego

“Built and shipped a production NMT system translating medical documentation for a rare/low-resource language, tackling data scarcity with retrieval-driven pattern matching plus dictionary/grammar- and LLM-based augmentation and validating quality with a linguistic expert. Also develops agentic LLM workflows with LangChain/LangGraph (including a deep-research style system) and has experience aligning medical AI deployments with clinician-defined risk metrics and human-in-the-loop decision making.”

API DevelopmentAWSCC++CI/CDCUDA+86
View profile
MV

Maggie vonEbers

Screened

Mid-level Research Engineer specializing in machine learning and computational neuroscience

3y exp
Dell TechnologiesUniversity of Texas at Austin

“Master’s-level ML researcher with hands-on embodied/edge deployment experience: built a Google Glass motion-tracking system at Sandia using MobileNetV1 + LSTM trained in TensorFlow and deployed via TensorFlow Lite. Has reimplemented transformer-based research for a thesis and demonstrated strong judgment adapting quickly when upstream assumptions changed, and stays current through active reading groups and a JEPA collaboration.”

C++Data PipelinesDeep LearningNeural NetworksSQLLSTM+39
View profile
AR

Ali Rahmati

Screened

Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI

Santa Clara, CA9y exp
QualcommNorth Carolina State University

“Engineer with hands-on experience debugging and hardening a fixed-point implementation for an internal PoC, quickly diagnosing overflow/underflow issues that caused intermittent failures across thousands of runs and delivering a code fix. Comfortable presenting technical solutions with layered slide depth and doing follow-up deep dives for interested stakeholders, though has limited direct customer/sales partnership experience.”

CC++Computer VisionDeep LearningDockerFlask+67
View profile
SR

Santhosh Reddy

Screened

Mid-level AI/ML Engineer specializing in deep learning, NLP/LLMs, and MLOps

MA, USA6y exp
Flatiron HealthClark University

“Built and shipped a real-time oncology risk prediction system used by doctors during patient visits, trained on clinical data in AWS SageMaker and deployed via FastAPI with sub-second responses. Emphasizes clinician-trust features (SHAP explainability, validation checks) and HIPAA-compliant controls (encryption, RBAC, audit logging), plus Kubernetes-based production operations with autoscaling, monitoring, and drift/retraining workflows; collaborated closely with oncologists at Flatiron Health.”

PythonRSQLJavaC++Bash+123
View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”

A/B TestingAPI DevelopmentAPI TestingApache HadoopApache HiveApache Kafka+251
View profile
CS

Christopher Song

Screened

Junior AI/ML Engineer specializing in real-time computer vision and tracking systems

2y exp
Credence Management SolutionsUniversity of Maryland, College Park

“Full-stack engineer who built and owned a production real-time computer-vision inference platform at Credence, spanning Next.js App Router/TypeScript frontend with SSE/WebSocket streaming, a Flask backend, and Postgres analytics. Demonstrated measurable performance wins (70% fewer re-renders; latency cut to ~40–50ms) and strong production rigor (durable orchestration, idempotency, observability, AWS EC2 + CI/CD) with tight post-launch UX iteration based on analyst feedback.”

PythonJavaCC++KotlinJavaScript+87
View profile
DB

Dharmik Bhingradiya

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp
BlackRockTexas A&M University-Kingsville

“AI engineer who built a production RAG-based internal analyst tool at BlackRock, fine-tuning an LLM on proprietary financial data and adding four layers of guardrails (input/retrieval/generation/output) to improve grounding and reduce hallucinations. Implemented a LangChain-based multi-agent orchestration (7 major agents) deployed on AWS ECS, with reliability measured via internal human evaluation, LLM-as-judge, and RLHF/drift monitoring.”

PythonSQLRJavaC++Machine Learning+90
View profile
RR

Rowan Ramamurthy

Screened

Mid-level Robotics Software Engineer specializing in multi-robot control and automation

Atlanta, GA4y exp
Georgia Institute of TechnologyGeorgia Tech

“Robotics software engineer with ~7 years of ROS/ROS2 experience spanning dual-arm metal additive manufacturing and prior work on the DARPA Subterranean Challenge. Developed in-house multi-arm collision/trajectory planning and achieved a major calibration improvement (from ~6 cm error to ~0.5 mm) via ICP point-cloud registration, with strong simulation/digital-twin, SLAM, and deployment (Docker/CI/CD) exposure.”

PythonCC++C#MATLABGit+73
View profile
AS

Aayushi Singh

Screened

Intern AI/ML Engineer specializing in robotics and computer vision

Los Angeles, CA0y exp
BoltIOTUSC

“Worked on Sophia the humanoid robot, building production animation pipelines and enhancing human-robot interaction via perception and behavior orchestration. Experienced in stabilizing noisy perception-driven state transitions and designing smooth, user-centered behavioral flows, collaborating closely with artists, animators, and experience designers to translate creative intent into measurable system behavior.”

AgileAngularJSBlenderBootstrapC++CI/CD+144
View profile
NP

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp
JPMorgan ChaseUniversity of Dayton

“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”

PythonPandasspaCyRSQLPySpark+199
View profile
1...141516...44

Related

Machine Learning EngineersResearch AssistantsSoftware EngineersData ScientistsTeaching AssistantsAI EngineersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search