Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Reinforcement Learning Professionals

Pre-screened and vetted.

Reinforcement Learning Python PyTorch Docker TensorFlow SQL

Narendhiran Saravanane

Screened

Junior Robotics Software Engineer specializing in ROS 2, controls, and applied AI

Denver, USA2y exp

DreamFace TechnologiesArizona State University

“Robotics software engineer with 2+ years across ROS1/ROS2 projects spanning humanoid behavior engines and agricultural robots. Built an LLM-driven, ROS2-lifecycle-based decision system plus micro-ROS firmware on Teensy for modular sensors/motors, adding health monitoring that improved reliability 10x. Strong simulation/testing and deployment discipline (Gazebo, 95% coverage, Docker + AWS Greengrass/ECR, CI/CD) and demonstrated localization expertise with EKF sensor fusion achieving <0.5% error.”

Python C C++MATLAB Git Bash+106

View profile

Utkarsh Gupta

Screened

Junior Robotics & ML Engineer specializing in perception, navigation, and VLA models

Los Angeles, CA1y exp

PSI Lab, USCUSC

“Robotics software engineer with hands-on AGV/AMR experience at ERIC Robotics, building ROS2-based LiDAR perception and localization on NVIDIA Jetson for real-time deployment. Improved unstable localization in challenging environments (e.g., tunnels/bushes along rail tracks) via scan-matching, filtering, and consistency checks, and cut latency by moving from rclpy to rclcpp and leveraging CUDA. Comfortable across the stack from simulation (MuJoCo/Isaac Sim/Gazebo, domain randomization) to deployment tooling (Docker, basic CI) and distributed ROS2/DDS systems.”

C++Classification Keras Linux Machine Learning Matplotlib+80

View profile

Sri Harsha patallapalli

Screened

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp

Dextr.aiNortheastern University

“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”

Python Java C++SQL JavaScript Bash+113

View profile

Samarth Saxena

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp

Cloud9USC

“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”

Python SQL Scala TypeScript Bash Java+162

View profile

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp

CitiusTechUniversity of Michigan

“Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.”

Agile Apache Hadoop Apache Kafka AWS CI/CD Classification+112

View profile

Jiajun Long

Screened

Junior Robotics Researcher specializing in vision-based manipulation and learning-based control

Urbana, IL3y exp

University of Illinois Urbana-ChampaignUniversity of Illinois Urbana-Champaign

“Robotics software candidate with experience spanning simulation (MuJoCo, Gazebo, Webots) and ROS1/ROS2 development, including hardware-oriented work on a hexapod and a Mecademic Meca500 R3 arm. Built a visually guided interactive indoor robot system using a CV pipeline plus POMDP + imitation learning with PPO-based residual RL, and has practical debugging experience improving LiDAR SLAM stability and migrating sensor interfaces from ROS1 to ROS2.”

Python C C++MATLAB Visual Studio Code Computer Vision+73

View profile

Iaroslav Kovalchuk

Screened

Junior ML Engineer specializing in energy forecasting and battery optimization

San Carlos, CA3y exp

ElecricFishUniversity of Michigan

“Backend/ML engineer working on a battery energy storage system operations dashboard: built a Flask backend integrated with OAuth and a separate FastAPI optimization/simulation service, deployed via Docker CI/CD to Azure Container Apps. Strong in productionizing ML (AzureML to batch endpoints) and in performance/scalability patterns (Postgres indexing/JSONB, per-unit data isolation, async throttling + caching for year-long CPU-intensive simulations across 40+ scenarios).”

Azure Machine Learning Bash CI/CD C C++Computer Vision+78

View profile

Rishitha reddy katamareddy

Screened

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp

OptumUniversity at Buffalo

“Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.”

Agentic AI Generative AI Large Language Models (LLMs)LangChain LangGraph Multi-Agent Systems+175

View profile

Kunal Kulkarni

Screened

Intern AI/ML Researcher specializing in computer vision and data engineering

Palo Alto, CA1y exp

TieSetUCLA

“Built a production-oriented multimodal RAG "Fix Assistant" with FastAPI, Tavily search, BM25 + cross-encoder reranking, and a local Phi-3.5 model, emphasizing strict grounding and fallback/verification modes to prevent hallucinations. Also has hands-on federated learning experience using STADLE to orchestrate edge-node training and aggregation for EV telemetry data, plus experience communicating AI results to non-technical stakeholders (traffic RL/congestion outcomes).”

AWS Bash C C++CI/CD Computer Vision+128

View profile

Wilson Harron

Screened

Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI

Los Angeles, CA15y exp

silvr.aiUniversity of Guelph

“Candidate is not currently pursuing entrepreneurship (no business plan and no capital raised) and is not familiar with the VC/accelerator landscape. They show pragmatic, problem-first thinking about evaluating startup ideas—prioritizing real customer pain points and the quality of the founding team—and are open to working for others rather than founding "at all costs."”

Machine Learning Computer Vision Large Language Models (LLMs)Reinforcement Learning OCR ETL+75

View profile

Polam Srija

Screened

Mid-level AI/ML Engineer specializing in Generative AI and FinTech

Texas, USA3y exp

Fidelity InvestmentsUniversity of North Carolina at Charlotte

“AI Engineer with hands-on ownership of a production multi-agent RAG platform in financial services, spanning experimentation, architecture, deployment, monitoring, and iterative optimization. Stands out for measurable impact: 35% retrieval relevance improvement and nearly 50% reduction in manual operational analysis effort, plus strong experience making enterprise LLM systems safer and more reliable in production.”

Python SQL Java C C++JavaScript+176

View profile

Amaan Mohammed

Screened

Entry-level Machine Learning Engineer specializing in generative AI and applied ML

College Park, MD1y exp

CNPCUniversity of Maryland, College Park

“Built and deployed LLM-powered agentic systems including a multi-agent travel planning assistant using LangChain, RAG (FAISS), real-time APIs, and a supervisor agent to manage coordination and reduce hallucinations. Also developed a Text-to-SQL system with schema-aware validation guardrails, and collaborated with drilling domain experts at CNPC USA to build an ML model predicting rate of penetration (ROP).”

Python R SQL Go TypeScript PyTorch+143

View profile

Indrajeet Patwardhan

Screened

Intern Data Scientist specializing in machine learning and predictive modeling

Irvine, CA2y exp

Trilemma FoundationUC Irvine

“Built across data, backend, analytics, and visualization-heavy applications, including a nonprofit financial forecasting app, large-scale insurance model analysis at Mercury Insurance, and a publicly deployed soccer analytics dashboard. Stands out for combining machine learning, large-dataset SQL work, and practical production improvements like cutting dashboard load times to under two seconds and refactoring codebases for smoother team handoff.”

Machine Learning Predictive Modeling Data Engineering Neural Networks Deep Learning Reinforcement Learning+105

View profile

Prashanth Sankaranarayanan

Screened

Entry-Level Robotics Researcher specializing in autonomous vehicles, SLAM, and motion planning

West Lafayette, IN1y exp

Purdue UniversityPurdue University

“Robotics/AV engineer with strong ROS2 and autonomy stack integration experience, including bringing Autoware Universe up on a real Lexus autonomous vehicle platform. Also built a hierarchical reinforcement learning proof-of-concept for Boston Dynamics Spot (navigation + manipulation) and tackled sim-to-real challenges by implementing PD torque conversion for Jetson-based hardware; improved localization accuracy via GNSS+EKF fusion with a reported 28% drift reduction.”

C++Git JavaScript Linux MATLAB Neural networks+118

View profile

Harsha vardhan reddy Yerranagu

Screened

Junior Machine Learning & Edge AI Engineer specializing in IoT and robotics

3y exp

Amazon Web ServicesUniversity at Buffalo

“Robotics/ROS2-focused early-career engineer who built a stereo visual-odometry SLAM system for autonomous navigation and optimized it to run reliably in real time on Raspberry Pi. Strong in sensor fusion (camera+IMU), ROS2 debugging/profiling, and distributed robotics/IoT pipelines (ROS2 + MQTT + cloud), with added experience extracting WiFi CSI for sensing/localization and shipping via Docker + GitHub Actions CI/CD.”

Linux Git Docker Python C C+++106

View profile

Tejaswini Dilip Deore

Screened

Junior Robotics Engineer specializing in computer vision and SLAM

Boston, MA2y exp

Northeastern UniversityNortheastern University

“Robotics software engineer focused on ROS2 autonomy, with hands-on work building a monocular visual odometry system on KITTI (including GPS-based scale correction and RViz trajectory visualization) and an end-to-end Gazebo simulation integrating URDF, slam_toolbox, and Nav2. Demonstrates strong practical debugging skills around TF frames, lifecycle nodes, and Gazebo plugin/version compatibility.”

Artificial Intelligence C++Computer Vision Containerization Docker Git+87

View profile

Niveditha A

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and LLM/RAG systems

USA4y exp

UnitedHealth GroupBowling Green State University

“AI/LLM engineer with recent production experience at UnitedHealth Group building an end-to-end RAG system over structured EMR data and unstructured clinical notes, including evidence retrieval, GPT/LLaMA-based reasoning, and a validation layer for reliability. Strong in orchestration (Kubeflow/Airflow/MLflow), prompt engineering for noisy healthcare text, and rigorous evaluation/monitoring with gold-standard benchmarking, plus close collaboration with clinical operations stakeholders.”

Python NumPy Pandas JSON SQL PostgreSQL+152

View profile

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp

Dreamline AICalifornia State University, Fullerton

“At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.”

A/B Testing Amazon CloudWatch Amazon Kinesis Amazon Redshift Amazon S3 Amazon SageMaker+114

View profile

Pooja Murigappa

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp

Charles SchwabUniversity of Central Missouri

“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”

Amazon DynamoDB Apache Airflow Apache Kafka Apache Spark AWS AWS Glue+183

View profile

Mueed MOHAMMED

Screened

Executive Enterprise Architect & CTO specializing in cloud, digital transformation, and AI/ML

Chicago, IL21y exp

WindyCity TraderDePaul University

“Senior enterprise architecture and engineering leader (Sr. Director / Principal Architect) who has owned enterprise IT strategy and governance for a $100M budget and partnered directly with C-suite stakeholders. Led a cruise-industry employee/crew digital transformation, scaling to 10 agile teams (~70 people) using SAFe/TOGAF and making architecture decisions optimized for low-connectivity environments (local database to avoid internet authentication).”

.NET Android iOS Microsoft SQL Server Microsoft Azure Salesforce+129

View profile

Molli Dinesh

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps

Remote, USA4y exp

Marsh McLennanIllinois Institute of Technology

“Built an AI-driven insurance policy summarization platform at Marsh, taking it end-to-end from messy PDF ingestion/OCR and custom extraction through LLM fine-tuning and AWS SageMaker deployment. Delivered measurable impact (25% reduction in manual review time, 99% uptime) and demonstrated strong production MLOps/LLMOps practices with Airflow/Step Functions orchestration, rigorous evaluation (ROUGE + human review), and continuous monitoring for drift, latency, and hallucinations.”

Python Pandas NumPy Scikit-learn R SQL+132

View profile

Prateek Pravanjan

Screened

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp

MercorStevens Institute of Technology

“LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.”

PyTorch Transformers NumPy Scikit-learn Model evaluation Pandas+135

View profile

Ashwini Ramesh Kumar

Screened

Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows

Remote1y exp

UMass Chan Medical SchoolUniversity of Massachusetts Amherst

“Backend/ML-leaning engineer who built a content-based event recommender for FlowMingle using embeddings + HNSW vector search on Google Cloud, with Firebase as the backend and a managed recommendation lifecycle (15 recs/user, daily async generation, weekly deletion) now serving 1500+ users. Also led a cost-driven migration of ConvAI services to Azure AI using parallel request testing from a Unity client, with post-migration monitoring via logs and model evals; contributed to a Massachusetts law-enforcement conversation analysis system by expanding ingestion to PDF/TXT/Excel and multi-file inputs.”

Python C++SQL PL/SQL Git Docker+112

View profile

Shivam Goel

Screened

Senior Robotics Researcher specializing in neurosymbolic robot learning and manipulation

Medford, MA9y exp

Tufts UniversityTufts University

“Robotics software researcher who led a Boston Dynamics SPOT project on non-prehensile manipulation of heavy boxes, combining MuJoCo-based RL, ViT-based perception, and SPOT SDK control; the work is under review for ICRA 2026. Also built a ROS planning-and-learning stack on a LoCoBot using PDDL task planning, RTAB-Map SLAM, MoveIt motion planning, and RL to recover from execution failures.”

Reinforcement Learning Computer Vision Python C++PyTorch TensorFlow+69

View profile

Machine Learning Engineers Software Engineers Research Assistants Data Scientists Teaching Assistants AI Engineers AI & Machine Learning Engineering Education Data & Analytics

Need someone specific?

AI Search

Related

Need someone specific?