Reval Logo
Home Browse Talent Skilled in Reinforcement Learning

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement LearningPythonPyTorchDockerTensorFlowSQL
YL

Yurong Luo

Screened

Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems

Remote9y exp
dataAnnotationVirginia Commonwealth University

“Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.”

PythonJavaSQLCC++Linux+109
View profile
MK

Manpreet Kour

Screened

Senior Data Scientist specializing in Generative AI and NLP

Seattle, USA6y exp
SOTIDr. B. R. Ambedkar National Institute of Technology, Jalandhar

“ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.”

Machine LearningDeep LearningGenerative AIComputer VisionPyTorchPySpark+92
View profile
NG

Nishchal Gante

Screened

Mid-level Data Scientist specializing in MLOps and Generative AI

Illinois, IL4y exp
BNY MellonIllinois Institute of Technology

“Robotics software/ML engineer who built perception and navigation-related ML systems for autonomous supermarket carts, including object detection, shelf recognition, and obstacle avoidance. Strong ROS/ROS2 practitioner who optimized real-time performance (reported 50% latency reduction) and deployed containerized ROS/ML pipelines at scale using Docker, Kubernetes, and CI/CD.”

A/B TestingAgileAmazon API GatewayAmazon BedrockAmazon EC2Amazon RDS+133
View profile
SP

Srinidhi Pattala

Screened

Mid-level Robotics Engineer specializing in autonomy, perception, and sensor fusion

Boston, MA5y exp
Institute for Experiential RoboticsNortheastern University

“Robotics software engineer who contributed to an autonomous bartender robot (mobile base + ReactorX200 arm), owning manipulation/grasping, Gazebo simulation, and a YOLOv6 object-detection pipeline built from a manually collected/labeled dataset. Also handled system-level hardware bring-up integrating Raspberry Pi to ESP32 over micro-ROS on ROS2 Foxy, and has additional ROS package experience in EKF sensor fusion (IMU+GPS) and an autonomous disaster response boat.”

AgileBashBitbucketC++CI/CDComputer vision+145
View profile
YA

Yusuf Abdikadir

Screened

Senior Software Engineer specializing in 3D simulation, digital twins, and robotics

London, United Kingdom6y exp
AlphadroidUCL

“UK-based Unity developer who built a 3D simulation/digital-twin platform for an autonomous-vehicle startup, integrating Unity environments with external robotics stacks, web APIs, virtual sensing, and dynamic traffic systems. Interested in moving into VR, though has not shipped VR/Meta Quest titles yet.”

UnityRoboticsPerformance OptimizationC#C++Python+73
View profile
HB

Harideep Balusa

Screened

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp
Freddie MacUniversity of Wisconsin

“Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.”

PythonRSQLScalaMachine LearningDeep Learning+125
View profile
RM

Ramin Mohammadi

Screened

Principal AI/ML Leader specializing in Generative AI, MLOps, and NLP

CA, USA11y exp
iBase-tNortheastern University

“Founding member of Tausight, building AI systems to detect and protect PHI for healthcare organizations; helped take the company through post–Series A funding and exited after ~6 years. Drove a strategic collaboration with Intel’s OpenVINO team—becoming the first to deploy it in a real production system and improving model performance by ~30% on customer Intel-CPU machines.”

A/B TestingAnomaly DetectionChange ManagementCI/CDClassificationClustering+149
View profile
RM

Rakesh Medasani

Screened

Mid-level Full-Stack Developer specializing in scalable web apps and AI/ML systems

Houston, TX4y exp
Kgate Technologies, Inc.University at Buffalo

“Built a healthcare app backend and supporting product pieces from scratch for Maverick Health—covering database schema, API structure, Node.js implementation, and UI design in Figma—while targeting 10,000 patients and keeping AWS run costs to ~$20–$30/month. Shipped an Android closed beta on Google Play and handled real-world launch hurdles like privacy policy compliance and push notification infrastructure.”

PythonCC++SQLJavaScriptHTML+89
View profile
JS

Jitesh Sonkusare

Screened

Junior Robotics Software Engineer specializing in SLAM, autonomy, and perception

Danvers, MA2y exp
forREALNortheastern University

“Lead Robotics Software Engineer (forREAL, inc.) who built a production-ready indoor 3D reconstruction + autonomy stack: factor-graph multi-sensor SLAM (GTSAM), Gaussian-splatting virtual tours, RRT* planning over voxelized reconstructions, and 3D object anchoring using Grounding DINO + SAM2. Has deployed ROS2 systems across drones, AMRs, and ADAS simulation (CARLA), and supported multi-robot surgical platforms at Noah Medical using DDS namespaces/topic remapping.”

CC++Data Structures and AlgorithmsDeep LearningDockerGazebo+140
View profile
SS

Somil Shah

Screened

Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents

San Francisco, CA4y exp
INTERACT Animal LabNortheastern University

“AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).”

API DevelopmentBashBigQueryBusiness IntelligenceChromaDBCI/CD+136
View profile
BC

Brian Cho

Screened

Mid-level Robotics Researcher specializing in robot learning and surgical robotics

Salt Lake City, UT7y exp
University of UtahUniversity of Utah

“Robotics software/ML engineer who led an end-to-end transformer-based real-time 3D shape prediction system for a tendon-driven continuum robot on a KUKA arm, including ROS2 multi-camera RGB-D data collection, multi-view calibration, and optimized ICP point-cloud registration. Also optimized an online sensing + motion planning loop for robot-assisted surgery using Bayesian Hilbert maps and A* search, and has Gazebo + RL experience for a robotic salamander.”

Computer visionDeep learningGPTMachine learningMATLABOpenCV+85
View profile
VA

Vardhan Addakattu

Screened

Mid-level Data Scientist specializing in Generative AI and NLP for financial risk

Glassboro, NJ4y exp
S&P GlobalRowan University

“Built and shipped production generative AI/RAG assistants in regulated financial contexts (S&P Global), automating compliance-oriented Q&A over earnings reports/filings with grounded answers and citations. Experienced across the full stack—AWS-based ingestion (PySpark/Glue), vector retrieval + LangChain agents, GPT-4/Claude model selection, and production reliability (monitoring, caching, retries) plus rigorous evaluation and regression testing.”

PythonRSQLPySparkPandasApache Spark+111
View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp
Capital OneRobert Morris University

“AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).”

Anomaly DetectionApache HadoopApache HiveApache KafkaApache SparkAWS+115
View profile
PS

Prateek Sharma

Screened

Mid-level ADAS/Autonomy Software Engineer specializing in simulation, maps, and motion planning

Michigan, USA4y exp
AptivClemson University

“Robotics software engineer who owned the navigation/control stack for a commercial autonomous lawnmower at Wavemaker Labs, with hands-on sim-to-real tuning and real-world debugging. Experienced in ROS 1/ROS 2 path tracking (Pure Pursuit), including adapting ROS 2 planner code into a ROS 1 system, and building telemetry/logging to quantify tracking errors. Currently in an ADAS simulation team enabling algorithm teams with SIL/HIL and feature validation.”

AuthenticationC++Data PipelinesDockerGazeboGit+91
View profile
VG

Varun Gattamaneni

Screened

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

“Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.”

PythonPandasNumPyRSQLBash+150
View profile
MB

Maneesh Bilalpur

Screened

Mid-level AI Researcher specializing in multimodal LLMs and human-centered AI

Pittsburgh, PA7y exp
University of PittsburghUniversity of Pittsburgh

“Has production deployment experience delivering computer-vision systems on AWS (Docker + S3) including a GDPR-focused face/license-plate obfuscation pipeline and a semantic-segmentation project aimed at reducing annotation time. Worked closely with DevOps and frontend teams and partnered with CEO/CMO to present an AI-driven annotation workflow to non-technical VC stakeholders.”

Large Language Models (LLMs)Deep LearningTransformersComputer VisionNatural Language ProcessingModel Deployment+60
View profile
VS

Venkatesh Sanaboina

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

“Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.”

A/B TestingAgileAmazon RedshiftAmazon S3Amazon SageMakerAnomaly Detection+168
View profile
PY

Pavan Yarlagadda

Screened

Junior Robotics Software Engineer specializing in ROS2 autonomy

Buffalo, NY1y exp
University at BuffaloUniversity at Buffalo

“Graduate student researcher on the EARTH project (college collaboration with Moog) working on robotics for an arm/bucket system. Implemented waypoint-based path planning, built an Apriltag data pipeline, and developed ROS 2 tooling including a joystick-to-DeltaCAN teleop node; exploring reinforcement learning policies trained from Tera simulator + ROS 2 bag data to optimize trajectory planning under varying pressure/load conditions.”

Artificial IntelligenceC++CI/CDDeep LearningDistributed SystemsGazebo+102
View profile
PY

Palaniappan Yeagappan

Screened

Junior Robotics Engineer specializing in autonomous driving and SLAM

Bengaluru, India2y exp
CognizantNortheastern University

“Robotics software engineer focused on real-time state estimation and perception pipelines, with hands-on C++/ROS work improving LiDAR+IMU odometry stability via an iterative EKF and careful timing/synchronization fixes. Has integrated LIO-SAM, built multi-robot communication bridges (ROS + custom UDP with heartbeat/fallback), and uses Gazebo + Docker for repeatable testing, backed by CI/CD experience maintaining Azure DevOps pipelines at Cognizant.”

GazeboPyTorchTensorFlowPythonC++MATLAB+174
View profile
AS

Ashok Sai Doredla

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

United States5y exp
CVS HealthUniversity of Maryland, Baltimore County

“At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.”

A/B TestingAsynchronous ProcessingAWSAWS LambdaAzure Blob StorageAzure Functions+142
View profile
KV

Ketan Verma

Screened

Junior Applied AI Engineer specializing in data pipelines and ML systems

College Station, TX2y exp
ElysiTexas A&M University

“Built an end-to-end wafer-data anomaly detection and reporting system at Samsung using PySpark, Random Forest models, SQL, and Grafana to help engineers track faults and take corrective action. Also has strong UX prototyping and validation practices in Figma plus hands-on front-end/full-stack experience (HTML/CSS/TypeScript), including a student project recognized as best design out of 25 teams, and early-stage startup experience pivoting a product based on user interviews into a real-time in-context feedback overlay.”

PythonSQLC++JavaGitPySpark+59
View profile
VS

Varun Senthil Kumar

Screened

Intern Robotics & Cloud/DevOps Engineer specializing in autonomous systems

Dubai, UAE1y exp
DubizzleArizona State University

“Robotics-focused engineer with hands-on projects ranging from a solo Dobot Magician Lite tic-tac-toe system (computer vision + minimax) to integrating an LLM with a Dobot arm for real-time pick-and-place via structured action outputs and validation. Also brings prior full-time DevOps experience (Docker/Kubernetes and CI/CD) and has used ROS/Gazebo for simulation work, including exploring improvements to crowd-aware navigation using human-trajectory datasets.”

AWSAzure DevOpsBlenderCC++CI/CD+51
View profile
AV

Abhinav Vengala

Screened

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Chantilly, VA3y exp
VerizonUniversity of North Texas

“LLM/agentic systems engineer who built a production "Agentic AI Diagnostic Assistant" for network engineers, using a multi-agent Llama 2 + LangChain architecture with RAG over telemetry/incident data in DynamoDB and confidence-based deferrals to reduce hallucinations. Also has strong MLOps/orchestration experience (Airflow, EventBridge, Spark, Docker, SageMaker/ECS) at multi-terabyte/day scale and delivered multilingual NLP analytics (fine-tuned BERT/spaCy) for support operations through hands-on stakeholder workshops.”

PythonNumPyPandasSciPyPyTorchTensorFlow+116
View profile
VN

Vasanthi N.

Screened

Senior AI/ML Engineer and Data Scientist specializing in Generative AI and MLOps

Los Angeles, CA9y exp
Pacific Community BankAurora University

“ML/NLP practitioner focused on financial-services document intelligence and compliance workflows—built an end-to-end pipeline to classify documents and extract financial entities from loan applications, emails, and statements stored in S3/internal databases. Strong in entity resolution/record linkage and in productionizing pipelines with GitHub Actions CI/CD, testing, data validation, and Docker, plus semantic search using OpenAI embeddings and a vector database.”

A/B TestingAgileAnomaly DetectionAPI IntegrationAWSAWS Glue+137
View profile
1...262728...44

Related

Machine Learning EngineersResearch AssistantsSoftware EngineersData ScientistsTeaching AssistantsAI EngineersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search