“Robotics software engineer (master’s student) who placed 3rd in the CMU VLA challenge and presented at IROS, building an LLM-powered language system (Gemini 2.5) for mobile-robot scene Q&A and language-based navigation. Hands-on ROS1/ROS2 experience including ros2_control + PILZ planning for a KUKA arm, plus simulation (Gazebo) and containerized submissions with Docker.”

Python C C++MATLAB PyTorch TensorFlow+98

View profile

LuYao Chen

Screened

Junior Software/ML Engineer specializing in AI systems, cloud infrastructure, and applied research

Los Angeles, CA3y exp

University of Southern CaliforniaUSC

“Backend/infra-focused engineer with experience spanning Go-based MCP servers for an AI-assisted Kubernetes on-call diagnosis chatbot and a Python/Flask PagerDuty automation integration. Previously at Tesla, optimized high-volume battery test data in PostgreSQL using JSONB, partitioning, and a timestamp normalization pipeline; also built PyTorch PINN training workflows and achieved a 20x speedup via batch vectorization.”

Python Go C C++TypeScript SQL+57

View profile

Nishitha Thummala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp

PerplexityUniversity of Nebraska Omaha

“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”

Python FastAPI Flask Django gRPC JavaScript+167

View profile

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp

NVIDIAOregon State University

“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”

A/B Testing Ansible Apache Kafka Apache Spark Automated Testing AWS+113

View profile

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp

NVIDIASaint Louis University

“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”

Python Java Spring Boot JavaScript TypeScript React+129

View profile

Krishna Sahith Poruri

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

CA, USA4y exp

AnthropicCalifornia State University, Long Beach

“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”

Python SQL R C++Git Classification+101

View profile