Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Ti Wu - Junior Full-Stack Developer specializing in web apps and reinforcement learning in Hsinchu, Taiwan

Ti Wu

Screened

Junior Full-Stack Developer specializing in web apps and reinforcement learning

Hsinchu, Taiwan1y exp
Industrial Technology Research InstituteUniversity of Wisconsin–Madison

Built an AI basketball shooting coach that analyzes player form against NBA players and recruited 30+ beta users via Reddit to drive iterative UI/workflow improvements. Also has internship experience building an administrative server and coordinating API/database compatibility with another client server, emphasizing communication and integration quality.

View profile
Young Joon Suh - Senior Research Scientist specializing in AI for autonomous driving and semiconductors in Seoul, Korea

Senior Research Scientist specializing in AI for autonomous driving and semiconductors

Seoul, Korea5y exp
Korea Institute of Science and TechnologySan José State University

Robotics perception engineer focused on autonomous driving 3D detection, integrating PETR embeddings into BEVFormer and tackling hard orientation/temporal alignment issues in multi-camera BEV pipelines. Uses Gazebo with custom sensor plugins to validate calibration, timing, and transforms, and blends synthetic labels with real imagery for scalable 3D box generation.

View profile
Narendhiran Saravanane - Junior Robotics Software Engineer specializing in ROS 2, controls, and applied AI in Denver, USA

Junior Robotics Software Engineer specializing in ROS 2, controls, and applied AI

Denver, USA2y exp
DreamFace TechnologiesArizona State University

Robotics software engineer with 2+ years across ROS1/ROS2 projects spanning humanoid behavior engines and agricultural robots. Built an LLM-driven, ROS2-lifecycle-based decision system plus micro-ROS firmware on Teensy for modular sensors/motors, adding health monitoring that improved reliability 10x. Strong simulation/testing and deployment discipline (Gazebo, 95% coverage, Docker + AWS Greengrass/ECR, CI/CD) and demonstrated localization expertise with EKF sensor fusion achieving <0.5% error.

View profile
Utkarsh Gupta - Junior Robotics & ML Engineer specializing in perception, navigation, and VLA models in Los Angeles, CA

Utkarsh Gupta

Screened

Junior Robotics & ML Engineer specializing in perception, navigation, and VLA models

Los Angeles, CA1y exp
PSI Lab, USCUSC

Robotics software engineer with hands-on AGV/AMR experience at ERIC Robotics, building ROS2-based LiDAR perception and localization on NVIDIA Jetson for real-time deployment. Improved unstable localization in challenging environments (e.g., tunnels/bushes along rail tracks) via scan-matching, filtering, and consistency checks, and cut latency by moving from rclpy to rclcpp and leveraging CUDA. Comfortable across the stack from simulation (MuJoCo/Isaac Sim/Gazebo, domain randomization) to deployment tooling (Docker, basic CI) and distributed ROS2/DDS systems.

View profile
Sri Harsha patallapalli - Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS in Boston, MA

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp
Dextr.aiNortheastern University

Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.

View profile
SS

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.

View profile
TM

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp
CitiusTechUniversity of Michigan

Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.

View profile
JL

Jiajun Long

Screened

Junior Robotics Researcher specializing in vision-based manipulation and learning-based control

Urbana, IL3y exp
University of Illinois Urbana-ChampaignUniversity of Illinois Urbana-Champaign

Robotics software candidate with experience spanning simulation (MuJoCo, Gazebo, Webots) and ROS1/ROS2 development, including hardware-oriented work on a hexapod and a Mecademic Meca500 R3 arm. Built a visually guided interactive indoor robot system using a CV pipeline plus POMDP + imitation learning with PPO-based residual RL, and has practical debugging experience improving LiDAR SLAM stability and migrating sensor interfaces from ROS1 to ROS2.

View profile
IK

Junior ML Engineer specializing in energy forecasting and battery optimization

San Carlos, CA3y exp
ElecricFishUniversity of Michigan

Backend/ML engineer working on a battery energy storage system operations dashboard: built a Flask backend integrated with OAuth and a separate FastAPI optimization/simulation service, deployed via Docker CI/CD to Azure Container Apps. Strong in productionizing ML (AzureML to batch endpoints) and in performance/scalability patterns (Postgres indexing/JSONB, per-unit data isolation, async throttling + caching for year-long CPU-intensive simulations across 40+ scenarios).

View profile
AM

Junior AI/ML Engineer specializing in LLM applications and RAG systems

College Park, MD1y exp
CNPCUniversity of Maryland, College Park

Built and deployed LLM-powered agentic systems including a multi-agent travel planning assistant using LangChain, RAG (FAISS), real-time APIs, and a supervisor agent to manage coordination and reduce hallucinations. Also developed a Text-to-SQL system with schema-aware validation guardrails, and collaborated with drilling domain experts at CNPC USA to build an ML model predicting rate of penetration (ROP).

View profile
Rishitha reddy katamareddy - Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems in USA

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp
OptumUniversity at Buffalo

Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.

View profile
Kunal Kulkarni - Intern AI/ML Researcher specializing in computer vision and data engineering in Palo Alto, CA

Intern AI/ML Researcher specializing in computer vision and data engineering

Palo Alto, CA1y exp
TieSetUCLA

Built a production-oriented multimodal RAG "Fix Assistant" with FastAPI, Tavily search, BM25 + cross-encoder reranking, and a local Phi-3.5 model, emphasizing strict grounding and fallback/verification modes to prevent hallucinations. Also has hands-on federated learning experience using STADLE to orchestrate edge-node training and aggregation for EV telemetry data, plus experience communicating AI results to non-technical stakeholders (traffic RL/congestion outcomes).

View profile
Wilson Harron - Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI in Los Angeles, CA

Wilson Harron

Screened

Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI

Los Angeles, CA15y exp
silvr.aiUniversity of Guelph

Candidate is not currently pursuing entrepreneurship (no business plan and no capital raised) and is not familiar with the VC/accelerator landscape. They show pragmatic, problem-first thinking about evaluating startup ideas—prioritizing real customer pain points and the quality of the founding team—and are open to working for others rather than founding "at all costs."

View profile
PS

Polam Srija

Screened

Mid-level AI/ML Engineer specializing in Generative AI and FinTech

Texas, USA3y exp
Fidelity InvestmentsUniversity of North Carolina at Charlotte

AI Engineer with hands-on ownership of a production multi-agent RAG platform in financial services, spanning experimentation, architecture, deployment, monitoring, and iterative optimization. Stands out for measurable impact: 35% retrieval relevance improvement and nearly 50% reduction in manual operational analysis effort, plus strong experience making enterprise LLM systems safer and more reliable in production.

View profile
PS

Entry-Level Robotics Researcher specializing in autonomous vehicles, SLAM, and motion planning

West Lafayette, IN1y exp
Purdue UniversityPurdue University

Robotics/AV engineer with strong ROS2 and autonomy stack integration experience, including bringing Autoware Universe up on a real Lexus autonomous vehicle platform. Also built a hierarchical reinforcement learning proof-of-concept for Boston Dynamics Spot (navigation + manipulation) and tackled sim-to-real challenges by implementing PD torque conversion for Jetson-based hardware; improved localization accuracy via GNSS+EKF fusion with a reported 28% drift reduction.

View profile
HV

Junior Machine Learning & Edge AI Engineer specializing in IoT and robotics

3y exp
Amazon Web ServicesUniversity at Buffalo

Robotics/ROS2-focused early-career engineer who built a stereo visual-odometry SLAM system for autonomous navigation and optimized it to run reliably in real time on Raspberry Pi. Strong in sensor fusion (camera+IMU), ROS2 debugging/profiling, and distributed robotics/IoT pipelines (ROS2 + MQTT + cloud), with added experience extracting WiFi CSI for sensing/localization and shipping via Docker + GitHub Actions CI/CD.

View profile
TD

Junior Robotics Engineer specializing in computer vision and SLAM

Boston, MA2y exp
Northeastern UniversityNortheastern University

Robotics software engineer focused on ROS2 autonomy, with hands-on work building a monocular visual odometry system on KITTI (including GPS-based scale correction and RViz trajectory visualization) and an end-to-end Gazebo simulation integrating URDF, slam_toolbox, and Nav2. Demonstrates strong practical debugging skills around TF frames, lifecycle nodes, and Gazebo plugin/version compatibility.

View profile
NA

Niveditha A

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and LLM/RAG systems

USA4y exp
UnitedHealth GroupBowling Green State University

AI/LLM engineer with recent production experience at UnitedHealth Group building an end-to-end RAG system over structured EMR data and unstructured clinical notes, including evidence retrieval, GPT/LLaMA-based reasoning, and a validation layer for reliability. Strong in orchestration (Kubeflow/Airflow/MLflow), prompt engineering for noisy healthcare text, and rigorous evaluation/monitoring with gold-standard benchmarking, plus close collaboration with clinical operations stakeholders.

View profile
HR

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp
Dreamline AICalifornia State University, Fullerton

At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.

View profile
PM

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp
Charles SchwabUniversity of Central Missouri

ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.

View profile
MM

Executive Enterprise Architect & CTO specializing in cloud, digital transformation, and AI/ML

Chicago, IL21y exp
WindyCity TraderDePaul University

Senior enterprise architecture and engineering leader (Sr. Director / Principal Architect) who has owned enterprise IT strategy and governance for a $100M budget and partnered directly with C-suite stakeholders. Led a cruise-industry employee/crew digital transformation, scaling to 10 agile teams (~70 people) using SAFe/TOGAF and making architecture decisions optimized for low-connectivity environments (local database to avoid internet authentication).

View profile
Molli Dinesh - Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps in Remote, USA

Molli Dinesh

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps

Remote, USA4y exp
Marsh McLennanIllinois Institute of Technology

Built an AI-driven insurance policy summarization platform at Marsh, taking it end-to-end from messy PDF ingestion/OCR and custom extraction through LLM fine-tuning and AWS SageMaker deployment. Delivered measurable impact (25% reduction in manual review time, 99% uptime) and demonstrated strong production MLOps/LLMOps practices with Airflow/Step Functions orchestration, rigorous evaluation (ROUGE + human review), and continuous monitoring for drift, latency, and hallucinations.

View profile
Prateek Pravanjan - Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines in Remote

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp
MercorStevens Institute of Technology

LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.

View profile
Ashwini Ramesh Kumar - Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows in Remote

Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows

Remote1y exp
UMass Chan Medical SchoolUniversity of Massachusetts Amherst

Backend/ML-leaning engineer who built a content-based event recommender for FlowMingle using embeddings + HNSW vector search on Google Cloud, with Firebase as the backend and a managed recommendation lifecycle (15 recs/user, daily async generation, weekly deletion) now serving 1500+ users. Also led a cost-driven migration of ConvAI services to Azure AI using parallel request testing from a Unity client, with post-migration monitoring via logs and model evals; contributed to a Massachusetts law-enforcement conversation analysis system by expanding ingestion to PDF/TXT/Excel and multi-file inputs.

View profile

Need someone specific?

AI Search