Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level Software Engineer specializing in Agentic AI and RAG systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Mid-level AI/ML Engineer specializing in MLOps and production ML systems
“Backend/ML engineer who has shipped high-scale real-time systems across e-commerce and healthcare: built a PharmEasy real-time recommendation engine for ~2M monthly users (cut feature latency 5 min→30 sec; +15% cross-sell) and architected a HIPAA-compliant multimodal clinical diagnostic workflow (DICOM+EHR) with XAI, MLOps (MLflow/Airflow/K8s), and drift/monitoring guardrails supporting 10k+ daily predictions.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level Robotics & ML Engineer specializing in perception, control, and scalable systems
“Robotics software engineer/researcher focused on perception, SLAM, and sensor fusion, with hands-on experience taking systems from simulation to embedded/real-time deployment. Led transparent-surface (glass) detection using GDNet and achieved a major real-time speedup (~7–9 FPS to ~30 FPS) while preserving >90% recall, and has built ROS-based EKF GPS-IMU fusion plus profiled/optimized Visual SLAM for performance and memory stability. Also brings production-style deployment skills via Docker/Kubernetes orchestration of ML inference services with autoscaling and model update rollouts.”
Mid-Level Software Engineer specializing in embedded RTOS and applied AI
“Master’s student and Deep Learning teaching assistant who teaches LLM/VLM fine-tuning (including LoRA) and built a Hugging Face LLM fine-tuned for unit conversion, improving reliability by analyzing synthetic data and filling missing number-system conversion examples. Also implemented the Raft consensus protocol using gRPC in a distributed systems course with correctness validated by unit tests.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI
“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”
Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics
“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”
Mid-level Cloud Engineer specializing in AWS & Azure infrastructure automation
“Backend/platform engineer (American Express) who built a Flask-based orchestration layer to automate infrastructure provisioning and integrated Azure AD/JWT RBAC security. Strong in PostgreSQL/SQLAlchemy performance optimization (70%+ query-time reduction) and scalable async/event-driven architectures, including ML inference pipelines (SageMaker/Azure ML/Hugging Face) and high-throughput job queues (Celery/Redis) with reliability patterns like DLQs and idempotency.”
Senior Software Engineer specializing in cloud-native microservices and healthcare integrations
“Backend engineer at Cerebrone.ai building cloud-native Flask microservices for an AI-driven automation platform on GCP (Cloud Run/App Engine), including dedicated inference services integrating OpenAI and internal ML pipelines. Demonstrated strong performance and scalability wins across Postgres/SQLAlchemy optimization, multi-tenant (healthcare/HIPAA-grade) data isolation, and high-throughput background processing with Celery/Redis/RabbitMQ, with multiple quantified latency/CPU/throughput improvements.”
Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems
“Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level Data Scientist specializing in ML, MLOps, and Generative AI
“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”
Senior AI/ML Engineer specializing in Generative AI and RAG
“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”
Mid-Level Software Engineer specializing in backend systems and cloud-native platforms
“Software engineer with experience across TCS, Rakuten, and USC who has owned production integrations and data pipelines end-to-end. Notably improved a trading platform payment flow by replacing fragile polling with a webhook-driven status system with robust fallbacks, and has shipped LLM-assisted design-to-webpage automation plus evaluation-driven prompt iteration (NYT Connections).”
Mid-level Backend Software Engineer specializing in distributed microservices
“Internship at ActiveVM where they tackled large-scale Spring Boot 2→3/library migrations across hundreds of downstream products by combining OpenRewrite (AST-based recipes) with an LLM/RAG-based classifier that routed risky files to human experts. Reported ~70% reduction in manual effort and 90%+ accuracy after testing across multiple branches and cutovers; also built a CTR-driven book recommendation capstone showcased at the Google office in Cambridge.”
Mid-level Full-Stack Software Engineer specializing in Generative AI
“Full-stack engineer who shipped an end-to-end speech capability for an LLM chatbot UI, integrating OpenAI APIs and publishing via Google Apigee with client documentation. Has experience operating deployments with Jenkins/Kubernetes/Docker and monitoring with Datadog, and has worked in an innovation-center environment building rapid prototypes under ambiguity with tight stakeholder feedback loops.”
“Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.”
Senior Java Full-Stack & DevOps Engineer specializing in cloud-native microservices
“Software engineer with a CS/Computer Engineering background who has worked on ML/NLP (Hugging Face, clinical NLP, text generation and structured extraction) and has a school robotics project integrating a trained ML model with microprocessor-controlled hardware to drive motor movement and writing. Currently focused on building and deploying applications and ML models to AWS/Azure using Docker, Kubernetes, and CI/CD; targeting ~$150K compensation.”
Mid-level Software Engineer specializing in systems, cloud, and applied machine learning
“Robotics software engineer focused on ROS 2 localization/SLAM: built a particle-filter (Monte Carlo) localization system in Python with likelihood-field modeling to handle noisy LiDAR and dynamic environments. Strong in debugging ROS 2 integration issues (tf2 frame sync, DDS/QoS message reliability) and in profiling/optimizing pipelines to reach real-time performance (~10 Hz) using precomputation and KD-trees.”
Junior Machine Learning Engineer specializing in computer vision and LLM applications
“Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).”