“Candidate is deeply focused on AI-native software development, using a deliberate planner/implementer agent workflow with tools like Cursor, Claude, and Kimi. They also built a personal project called Config Proctor, an AI-agent-driven Terraform/AWS self-healing system that identifies infrastructure configuration gaps and proposes fixes.”

Python TypeScript Java JavaScript Bash REST APIs+175

View profile

Yuan-Hsuan Wen

Screened

Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems

Taipei, Taiwan3y exp

NVIDIAUSC

“Built a web-based interface that connects an internal bug system to an LLM for initial debugging and issue classification, aiming to boost QA and software engineer efficiency while balancing latency and accuracy. Worked as a one-person project and managed constraints like limited hardware and difficulty extracting team debugging context, relying on manager communication and rapid modeling to validate direction.”

Machine Learning Artificial Intelligence LangChain TensorFlow PyTorch Python+59

View profile

Rana Taki

Screened

Junior Mechanical Engineering & Software Developer specializing in aviation autonomy and retrieval systems

Stanford, CA2y exp

Stanford UniversityStanford University

“Robotics/embedded builder who trained an aviation-specific LLM and deployed it offline on an NVIDIA Jetson for an in-flight voice assistant, solving performance and cabling constraints with NVMe storage and Bluetooth. Also has hands-on Raspberry Pi/Arduino robot builds (including a cigarette-butt picking prototype with hydraulic actuation) plus Docker-based FEA work using FEniCS/Gmsh and strong CI/CD + automated testing practices.”

Python C C++MATLAB JavaScript Swift+97

View profile

Kenil Tanna

Screened

Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services

New York, NY7y exp

JPMorgan ChaseIIT Guwahati

“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”

Python R SQL JavaScript REST APIs gRPC+124

View profile

Sai supriya

Screened

Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference

St. Louis, MO7y exp

AnthropicSaint Louis University

“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”

Apache Spark AWS CI/CD Data Ingestion Data Pipelines Data Preprocessing+127

View profile

Nishitha Thummala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp

PerplexityUniversity of Nebraska Omaha

“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”

Python FastAPI Flask Django gRPC JavaScript+167

View profile

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp

NVIDIAOregon State University

“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”

A/B Testing Ansible Apache Kafka Apache Spark Automated Testing AWS+113

View profile

Geetika Jain

Screened

Mid-Level Software Engineer specializing in Azure AI and full-stack development

Park City, UT6y exp

NICEUniversity of Texas at Dallas

“Hands-on AI/LLM engineer who built a RAG-based product feature end-to-end, including prompt engineering, safety guardrails, and an automated adversarial + load-testing harness. Diagnosed real production issues (null responses) via Azure logs/metrics and drove an architectural fix by separating model deployments to address token/quota limits. Also runs internal developer enablement through short theory-to-hands-on AI workshops after completing a Microsoft AI certification.”

C#Java TypeScript PowerShell Kotlin HTML+67

View profile

Dilpreet Singh

Screened

Executive CTO and Founder specializing in AI platforms and hyper-scale SaaS

South San Francisco, CA26y exp

Deep OriginUC Berkeley

“CTO-minded builder seeking to join a startup; previously created an AI-driven platform that abstracted away DevOps and infrastructure for drug discovery researchers. Emphasizes high-leverage, zero-to-one execution with managed cloud/open-source tooling, and a strong reliability/reproducibility mindset validated against existing scientific pipelines.”

Agentic AI Large Language Models (LLMs)LangChain Retrieval-Augmented Generation (RAG)Machine learning Predictive modeling+128

View profile

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp

NVIDIASaint Louis University

“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”

Python Java Spring Boot JavaScript TypeScript React+129

View profile

Krishna Reddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants

New York, NY6y exp

StripeIndiana Wesleyan University

“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”

Agile Amazon Bedrock Apache Hadoop Apache Hive Apache Kafka Apache Spark+143

View profile

Anagha Ram

Screened

Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search

Los Altos, CA2y exp

Columbia UniversityCornell University

“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”

Anomaly Detection AWS C Data Structures Django Generative AI+123

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?