Pre-screened and vetted.
Senior Data Scientist specializing in healthcare analytics and scalable ML pipelines
Senior Full-Stack & AI Engineer specializing in LLM integrations and cloud-native systems
“Backend/data engineer with hands-on production experience building FastAPI Python APIs and AWS-native platforms (Lambda/API Gateway, SQS, ECS Fargate) with Terraform + GitHub Actions CI/CD and strong reliability practices (JWT/RBAC, retries/timeouts, structured errors/logging). Also built AWS Glue ETL pipelines (S3/RDS to curated S3/Athena) with schema evolution and data quality controls, modernized legacy processing via parallel-run validation and phased cutovers, and has demonstrated SQL tuning impact (seconds to <200ms) plus incident ownership for batch pipeline SLAs.”
Mid-level AI/ML Engineer specializing in agentic AI and production ML systems
“ML/AI engineer with hands-on experience shipping production computer vision and GenAI systems, including a fabric defect detection platform that combined vision models with agentic LLM workflows to reach 89% human-inspector agreement at 200 ms latency. Also built a RAG-based code QA tool for developers and emphasizes production monitoring, evaluation, caching, and reusable Python service design.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and analytics automation
“AI/ML Engineer (TCS) who built and deployed a production LLM-powered audit transaction validation service to reduce manual review of unstructured transaction records and comments. Implemented a LangChain/Python pipeline for extraction/normalization and discrepancy detection, with strong production reliability practices (decision logging, dashboards, labeled eval sets) and a human-in-the-loop auditor feedback loop to improve precision/recall under strict data-sensitivity and near-real-time constraints.”
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level Software Engineer specializing in AI platforms and full-stack systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Mid-level Software Engineer specializing in AI backend and FinTech
Mid-level AI/ML Engineer specializing in MLOps and production ML systems
“Backend/ML engineer who has shipped high-scale real-time systems across e-commerce and healthcare: built a PharmEasy real-time recommendation engine for ~2M monthly users (cut feature latency 5 min→30 sec; +15% cross-sell) and architected a HIPAA-compliant multimodal clinical diagnostic workflow (DICOM+EHR) with XAI, MLOps (MLflow/Airflow/K8s), and drift/monitoring guardrails supporting 10k+ daily predictions.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level Robotics & ML Engineer specializing in perception, control, and scalable systems
“Robotics software engineer/researcher focused on perception, SLAM, and sensor fusion, with hands-on experience taking systems from simulation to embedded/real-time deployment. Led transparent-surface (glass) detection using GDNet and achieved a major real-time speedup (~7–9 FPS to ~30 FPS) while preserving >90% recall, and has built ROS-based EKF GPS-IMU fusion plus profiled/optimized Visual SLAM for performance and memory stability. Also brings production-style deployment skills via Docker/Kubernetes orchestration of ML inference services with autoscaling and model update rollouts.”
Mid-Level Software Engineer specializing in embedded RTOS and applied AI
“Master’s student and Deep Learning teaching assistant who teaches LLM/VLM fine-tuning (including LoRA) and built a Hugging Face LLM fine-tuned for unit conversion, improving reliability by analyzing synthetic data and filling missing number-system conversion examples. Also implemented the Raft consensus protocol using gRPC in a distributed systems course with correctness validated by unit tests.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Senior Software Engineer specializing in cloud-native microservices and healthcare integrations
“Backend engineer at Cerebrone.ai building cloud-native Flask microservices for an AI-driven automation platform on GCP (Cloud Run/App Engine), including dedicated inference services integrating OpenAI and internal ML pipelines. Demonstrated strong performance and scalability wins across Postgres/SQLAlchemy optimization, multi-tenant (healthcare/HIPAA-grade) data isolation, and high-throughput background processing with Celery/Redis/RabbitMQ, with multiple quantified latency/CPU/throughput improvements.”
Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems
“Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level Data Scientist specializing in ML, MLOps, and Generative AI
“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”
Mid-Level Software Engineer specializing in backend systems and cloud-native platforms
“Software engineer with experience across TCS, Rakuten, and USC who has owned production integrations and data pipelines end-to-end. Notably improved a trading platform payment flow by replacing fragile polling with a webhook-driven status system with robust fallbacks, and has shipped LLM-assisted design-to-webpage automation plus evaluation-driven prompt iteration (NYT Connections).”
Mid-level Backend Software Engineer specializing in distributed microservices
“Internship at ActiveVM where they tackled large-scale Spring Boot 2→3/library migrations across hundreds of downstream products by combining OpenRewrite (AST-based recipes) with an LLM/RAG-based classifier that routed risky files to human experts. Reported ~70% reduction in manual effort and 90%+ accuracy after testing across multiple branches and cutovers; also built a CTR-driven book recommendation capstone showcased at the Google office in Cambridge.”
Junior Software Engineer specializing in cloud microservices and database systems
“Grad student who co-developed a safety-oriented mental health LLM consulting agent using RAG + Gemini and Hugging Face emotion detection to assess user crisis level and adapt responses. Implemented a key reliability improvement for CRISIS scenarios by bypassing generative output and returning direct, emotionless, knowledge-base guidance to seek immediate real-world help.”
Senior AI/ML Engineer specializing in Generative AI and RAG
“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”
Mid-Level Full-Stack Software Engineer specializing in cloud, distributed systems, and LLM apps
“Built and owned a hackathon project (Gritto) with a Python/FastAPI backend that routes user text through a sequence of Gemini agents to produce structured JSON outputs. Has hands-on production deployment experience using Docker/Docker Compose, GitHub Actions CI/CD, AWS App Runner, MongoDB, and secrets management (Doppler + migration to AWS Secrets Manager), plus implemented a chat-like experience via multiple HTTP requests when SSE wasn’t viable.”