“Research Extern at Google DeepMind and former AWS Software Development Engineer Intern with a strong focus on practical, trustworthy AI engineering. Built a multi-agent RAG system for personalized news headline generation using a fine-tuned Flan-T5 model, parallel critic agents, FAISS retrieval, and style embeddings, while also leading a 3-person team on the project.”

Python Java SQL JavaScript R PyTorch+102

View profile

Chanakya rudru

Screened

Senior Machine Learning Engineer specializing in conversational AI and Generative AI

San Francisco, CA6y exp

Scale AIDallas Baptist University

“ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.”

Python Java C++R JavaScript TypeScript+111

View profile

Yashshree Patil

Screened

Mid-Level Software Development Engineer specializing in full-stack systems and ML

Seattle, WA3y exp

Amazon Web ServicesWestcliff University

“AWS engineer who productionized an internal ML-driven data pipeline from a notebook prototype into a scalable, observable Python service (schema validation, deduplication, idempotency, safe retries, versioned transforms, CloudWatch alarms), reducing manual effort and improving data accuracy/trust. Experienced diagnosing workflow issues in real time (e.g., upstream schema changes) and partnering with account managers/support to unblock adoption of seller-facing Marketplace features by demonstrating reliability with concrete metrics.”

Python Java C++SQL Ruby C+92

View profile

Kella Dhanush Venkata Sai

Screened

Junior ML Engineer specializing in Generative AI and LLM applications

Thousand Oaks, California3y exp

NVIDIACalifornia Lutheran University

“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”

Python NumPy Pandas Scikit-Learn Matplotlib Seaborn+95

View profile

Shardul Jeurkar

Screened

Intern Applied AI/Software Engineer specializing in computer vision and full-stack platforms

San Francisco Bay Area, CA1y exp

BoschCarnegie Mellon University

“Built production LLM systems focused on reliability and safety, including a plain-English deployment tool that generates validated plans and provisions to Kubernetes while preventing unsafe actions via schema enforcement and plan/execute separation. Also created multi-LLM workflows (LangGraph) and stakeholder-friendly demos at Bosch, including a PyQt/FastAPI/CUDA app comparing SAM2 vs SAMWISE for on-device object detection with intuitive UX for business users.”

Agile Automation Bash C C++Data ingestion+90

View profile

Chappidi Sasi

Screened

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference

Bay Area, CA5y exp

NVIDIAWebster University

“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”

A/B Testing Apache Spark AWS AWS Lambda BigQuery Claude+141

View profile

Yash Jajoo

Screened

Senior Software Engineer specializing in AI and FinTech platforms

New York City, NY8y exp

Walter AINew York University

“Built a production LLM pipeline at Walter AI that scans massive user inboxes, identifies financial newsletters, and extracts trading strategies into structured JSON for downstream paper-trading workflows. Stands out for combining agent architecture with strong production discipline—cutting scan time from 20 to 5 minutes, reducing LLM costs by 90%, and achieving 3-second P99 latency while handling messy, inconsistent email data at scale.”

Python JavaScript TypeScript Java FastAPI Flask+96

View profile

Samuel Prabhakar Vara

Screened

Mid-level AI Engineer specializing in machine learning and healthcare research

Philadelphia, PA4y exp

The Wharton School, University of PennsylvaniaUniversity of Pennsylvania

“Backend engineer with end-to-end ownership of scientific and AI-powered systems, including neuron imaging pipelines at Monell Chemical Senses Center and an LLM-based structured information extraction platform for Wharton and PSG. Stands out for turning messy, compute-heavy workflows into reliable production backends with measurable impact, including saving researchers over 50 hours per week.”

Python Django Flask Java SQL MySQL+96

View profile

Machine Learning Engineers Software Engineers Data Scientists Research Assistants Software Developers AI Engineers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?