Reval Logo
Home Browse Talent Skilled in Reinforcement Learning

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement LearningPythonPyTorchDockerTensorFlowSQL
SH

Sri Harsha patallapalli

Screened

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp
Dextr.aiNortheastern University

“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”

PythonJavaC++SQLJavaScriptBash+113
View profile
YJ

Young Joon Suh

Screened

Senior Research Scientist specializing in AI for autonomous driving and semiconductors

Seoul, Korea5y exp
Korea Institute of Science and TechnologySan José State University

“Robotics perception engineer focused on autonomous driving 3D detection, integrating PETR embeddings into BEVFormer and tackling hard orientation/temporal alignment issues in multi-camera BEV pipelines. Uses Gazebo with custom sensor plugins to validate calibration, timing, and transforms, and blends synthetic labels with real imagery for scalable 3D box generation.”

Artificial IntelligenceDeep LearningMachine LearningReinforcement LearningData EngineeringRecommender Systems+62
View profile
HW

Hans Walker

Screened

Junior Machine Learning Engineer specializing in generative AI and computer vision

Boston, MA2y exp
CuebricUSC

“AI engineer who deployed a production LLM-powered safety system for an education platform, combining rule-based checks, multi-LLM verification, and selective context (prompt+image vs image-only) to prevent explicit prompts/images from getting through. Strong focus on reliability via benchmarking, trace-based failure analysis, and continuous improvement driven by stakeholder feedback and manual review.”

AWSBashBERTCC++Classification+80
View profile
TW

Ti Wu

Screened

Junior Full-Stack Developer specializing in web apps and reinforcement learning

Hsinchu, Taiwan1y exp
Industrial Technology Research InstituteUniversity of Wisconsin–Madison

“Built an AI basketball shooting coach that analyzes player form against NBA players and recruited 30+ beta users via Reddit to drive iterative UI/workflow improvements. Also has internship experience building an administrative server and coordinating API/database compatibility with another client server, emphasizing communication and integration quality.”

PythonJavaCC++JavaScriptTypeScript+95
View profile
KR

Krishnakaanth Reddy Yeduguru

Screened

Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps

Texas, USA4y exp
McKessonUniversity of Texas at Arlington

“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”

PythonC++SQLBashTensorFlowPyTorch+129
View profile
JB

Jaideep bommidi

Screened

Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps

Denton, TX8y exp
Webster BankUniversity of North Texas

“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”

A/B TestingAgileAmazon EC2Amazon EKSAmazon ECSAmazon Kinesis+181
View profile
RG

Rohan Gore

Screened

Intern AI/ML Engineer specializing in agentic systems and full-stack development

New York City, NY0y exp
MARV CapitalNYU

“Built and scaled a multi-agent LLM automation pipeline during a fintech internship, growing from a rapid 1-week proof-of-concept to a 15+ agent hierarchical system that cut market brief report generation time from ~5 hours to under 30 minutes. Hands-on with agent frameworks (Haystack, CrewAI, LangChain) and experienced in debugging agent communication issues via sandboxed modular testing and context/token management; also regularly gives architecture-first technical demos at multiple hackathons and university events.”

Apache CassandraApache HadoopApache KafkaAWSAWS LambdaC#+93
View profile
SS

Samarth Saxena

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”

PythonSQLScalaTypeScriptBashJava+162
View profile
TM

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp
CitiusTechUniversity of Michigan

“Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.”

AgileApache HadoopApache KafkaAWSCI/CDCUDA+112
View profile
JL

Jiajun Long

Screened

Junior Robotics Researcher specializing in vision-based manipulation and learning-based control

Urbana, IL3y exp
University of Illinois Urbana-ChampaignUniversity of Illinois Urbana-Champaign

“Robotics software candidate with experience spanning simulation (MuJoCo, Gazebo, Webots) and ROS1/ROS2 development, including hardware-oriented work on a hexapod and a Mecademic Meca500 R3 arm. Built a visually guided interactive indoor robot system using a CV pipeline plus POMDP + imitation learning with PPO-based residual RL, and has practical debugging experience improving LiDAR SLAM stability and migrating sensor interfaces from ROS1 to ROS2.”

PythonCC++MATLABGazeboVisual Studio Code+73
View profile
RR

Rishitha reddy katamareddy

Screened

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp
OptumUniversity at Buffalo

“Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.”

Generative AILarge Language Models (LLMs)LangChainLangGraphReActPrompt Engineering+175
View profile
KK

Kunal Kulkarni

Screened

Intern AI/ML Researcher specializing in computer vision and data engineering

Palo Alto, CA1y exp
TieSetUCLA

“Built a production-oriented multimodal RAG "Fix Assistant" with FastAPI, Tavily search, BM25 + cross-encoder reranking, and a local Phi-3.5 model, emphasizing strict grounding and fallback/verification modes to prevent hallucinations. Also has hands-on federated learning experience using STADLE to orchestrate edge-node training and aggregation for EV telemetry data, plus experience communicating AI results to non-technical stakeholders (traffic RL/congestion outcomes).”

AWSBashCC++CI/CDCUDA+128
View profile
IK

Iaroslav Kovalchuk

Screened

Junior ML Engineer specializing in energy forecasting and battery optimization

San Carlos, CA3y exp
ElecricFishUniversity of Michigan

“Backend/ML engineer working on a battery energy storage system operations dashboard: built a Flask backend integrated with OAuth and a separate FastAPI optimization/simulation service, deployed via Docker CI/CD to Azure Container Apps. Strong in productionizing ML (AzureML to batch endpoints) and in performance/scalability patterns (Postgres indexing/JSONB, per-unit data isolation, async throttling + caching for year-long CPU-intensive simulations across 40+ scenarios).”

Azure Machine LearningBashCI/CDCC++Computer Vision+78
View profile
AM

Amaan Mohammed

Screened

Junior AI/ML Engineer specializing in LLM applications and RAG systems

College Park, MD1y exp
CNPCUniversity of Maryland, College Park

“Built and deployed LLM-powered agentic systems including a multi-agent travel planning assistant using LangChain, RAG (FAISS), real-time APIs, and a supervisor agent to manage coordination and reduce hallucinations. Also developed a Text-to-SQL system with schema-aware validation guardrails, and collaborated with drilling domain experts at CNPC USA to build an ML model predicting rate of penetration (ROP).”

PythonRSQLSQLAlchemySQLiteJSON+95
View profile
AR

Ashwini Ramesh Kumar

Screened

Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows

Remote1y exp
UMass Chan Medical SchoolUniversity of Massachusetts Amherst

“Backend/ML-leaning engineer who built a content-based event recommender for FlowMingle using embeddings + HNSW vector search on Google Cloud, with Firebase as the backend and a managed recommendation lifecycle (15 recs/user, daily async generation, weekly deletion) now serving 1500+ users. Also led a cost-driven migration of ConvAI services to Azure AI using parallel request testing from a Unity client, with post-migration monitoring via logs and model evals; contributed to a Massachusetts law-enforcement conversation analysis system by expanding ingestion to PDF/TXT/Excel and multi-file inputs.”

PythonC++SQLPL/SQLGitDocker+112
View profile
PS

Prashanth Sankaranarayanan

Screened

Entry-Level Robotics Researcher specializing in autonomous vehicles, SLAM, and motion planning

West Lafayette, IN1y exp
Purdue UniversityPurdue University

“Robotics/AV engineer with strong ROS2 and autonomy stack integration experience, including bringing Autoware Universe up on a real Lexus autonomous vehicle platform. Also built a hierarchical reinforcement learning proof-of-concept for Boston Dynamics Spot (navigation + manipulation) and tackled sim-to-real challenges by implementing PD torque conversion for Jetson-based hardware; improved localization accuracy via GNSS+EKF fusion with a reported 28% drift reduction.”

C++GazeboGitJavaScriptLinuxMATLAB+118
View profile
SG

Shivam Goel

Screened

Senior Robotics Researcher specializing in neurosymbolic robot learning and manipulation

Medford, MA9y exp
Tufts UniversityTufts University

“Robotics software researcher who led a Boston Dynamics SPOT project on non-prehensile manipulation of heavy boxes, combining MuJoCo-based RL, ViT-based perception, and SPOT SDK control; the work is under review for ICRA 2026. Also built a ROS planning-and-learning stack on a LoCoBot using PDDL task planning, RTAB-Map SLAM, MoveIt motion planning, and RL to recover from execution failures.”

Reinforcement LearningComputer VisionPythonC++PyTorchTensorFlow+69
View profile
PP

Prateek Pravanjan

Screened

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp
MercorStevens Institute of Technology

“LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.”

PyTorchTransformersNumPyScikit-learnModel evaluationPandas+135
View profile
HV

Harsha vardhan reddy Yerranagu

Screened

Junior Machine Learning & Edge AI Engineer specializing in IoT and robotics

3y exp
Amazon Web ServicesUniversity at Buffalo

“Robotics/ROS2-focused early-career engineer who built a stereo visual-odometry SLAM system for autonomous navigation and optimized it to run reliably in real time on Raspberry Pi. Strong in sensor fusion (camera+IMU), ROS2 debugging/profiling, and distributed robotics/IoT pipelines (ROS2 + MQTT + cloud), with added experience extracting WiFi CSI for sensing/localization and shipping via Docker + GitHub Actions CI/CD.”

LinuxGitDockerPythonCC+++106
View profile
TD

Tejaswini Dilip Deore

Screened

Junior Robotics Engineer specializing in computer vision and SLAM

Boston, MA2y exp
Northeastern UniversityNortheastern University

“Robotics software engineer focused on ROS2 autonomy, with hands-on work building a monocular visual odometry system on KITTI (including GPS-based scale correction and RViz trajectory visualization) and an end-to-end Gazebo simulation integrating URDF, slam_toolbox, and Nav2. Demonstrates strong practical debugging skills around TF frames, lifecycle nodes, and Gazebo plugin/version compatibility.”

Artificial IntelligenceC++Computer VisionContainerizationDockerGazebo+87
View profile
NA

Niveditha A

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and LLM/RAG systems

USA4y exp
UnitedHealth GroupBowling Green State University

“AI/LLM engineer with recent production experience at UnitedHealth Group building an end-to-end RAG system over structured EMR data and unstructured clinical notes, including evidence retrieval, GPT/LLaMA-based reasoning, and a validation layer for reliability. Strong in orchestration (Kubeflow/Airflow/MLflow), prompt engineering for noisy healthcare text, and rigorous evaluation/monitoring with gold-standard benchmarking, plus close collaboration with clinical operations stakeholders.”

PythonNumPyPandasJSONSQLPostgreSQL+152
View profile
RB

Ravi Bijinepalli

Screened

Senior ADAS/Autonomous Vehicle Validation Engineering Manager

Columbus, Ohio8y exp
The Ohio State UniversityOhio State University

“Controls and automated-vehicle engineer from the EcoCar EV Challenge who led an 8-person team implementing ACC and lane-centering directly on a vehicle (1st place result). Strong in CAN-based debugging, simulation-to-real deployment (Simulink to C++/dSPACE), and distributed robotics communication using DDS, with additional exposure to multi-agent RL and control barrier functions for coordinated driving.”

Root-Cause AnalysisProgram ManagementCross-Functional LeadershipStakeholder ManagementMentoringMATLAB+108
View profile
HR

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp
Dreamline AICalifornia State University, Fullerton

“At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.”

A/B TestingAmazon CloudWatchAmazon KinesisAmazon RedshiftAmazon S3Amazon SageMaker+114
View profile
PM

Pooja Murigappa

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp
Charles SchwabUniversity of Central Missouri

“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”

Amazon DynamoDBApache AirflowApache KafkaApache SparkAWSAWS Glue+183
View profile
1...212223...44

Related

Machine Learning EngineersResearch AssistantsSoftware EngineersData ScientistsTeaching AssistantsAI EngineersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search