Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, NLP, and analytics automation
“AI/ML Engineer (TCS) who built and deployed a production LLM-powered audit transaction validation service to reduce manual review of unstructured transaction records and comments. Implemented a LangChain/Python pipeline for extraction/normalization and discrepancy detection, with strong production reliability practices (decision logging, dashboards, labeled eval sets) and a human-in-the-loop auditor feedback loop to improve precision/recall under strict data-sensitivity and near-real-time constraints.”
Director-level Technology Leader specializing in cloud-native platforms, AI/ML, and SaaS
“Engineering leader (Director/VP level) who has repeatedly aligned product and engineering through ROI-driven quarterly roadmaps and strong stakeholder communication, including board presentations. Built a parallel cloud team to migrate an on-prem product to the cloud, credited with delivering $9M ARR, and led a Python monolith-to-serverless event-driven microservices transformation. Currently manages distributed teams across Mexico, India, and the US using pod-based structures, clear KPIs, and a supportive accountability culture.”
Mid-level Software Engineer specializing in Agentic AI and RAG systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Mid-level AI Engineer specializing in GenAI and RAG systems
“AI engineer who built a production e-commerce system that analyzes product images alongside sales and demographic data to generate actionable creative recommendations, now used by 20+ clients. Also built orchestrated document/agent pipelines (Airflow, LangGraph) including a compliance drift detector auditing 401 compliance documents, with an emphasis on traceability, logging, and production integration.”
Mid-level Software Engineer specializing in SRE, observability, and LLM-powered automation
Intern Software Engineer specializing in full-stack development and applied AI
“Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Senior AI Engineer specializing in production GenAI systems
“AI engineer who has shipped production LLM systems end-to-end, including a natural-language-to-SQL analytics copilot for career advisors that achieved ~95% query success through schema grounding, access controls, and automated regression testing with golden queries. Also builds LangGraph-orchestrated multi-step agents (resume analysis, recommendations) and RAG pipelines (PDF ingestion + FAISS) and partners closely with non-technical users to drive adoption and trust.”
Junior AI/ML Engineer specializing in anomaly detection and LLM/RAG systems
“Built and productionized a tool-first, multi-agent framework that augments an anomaly detection model with domain context to generate trustworthy, evidence-backed anomaly explanations (including false-positive likelihood). Architected the platform to be model/orchestration/vectorDB agnostic (e.g., GPT + CrewAI + ChromaDB vs Claude + LangGraph + other vector DB) with strong performance, reliability, and OpenTelemetry-based observability. Also built a personal LangGraph-based "mock interviewer" agent that asynchronously fuses voice + live code input using state reducers, stop conditions, and fallback routing.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI
“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”
Intern Software Engineer specializing in AI/LLMs and full-stack development
“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics
“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”
Mid-level Forward Deployed Engineer specializing in AI automation for finance and data platforms
“LLM/agentic workflow specialist with healthcare deployment experience who has taken LLM-based automation from prototype to production using operator-in-the-loop validation, RAG-style retrieval, RBAC, and monitoring for sensitive data compliance. Demonstrated real-time incident resolution (retrieval timeouts due to network/proxy misconfig) and strong GTM support—hands-on developer workshops and sales demos translating technical safeguards and real-time ETL into measurable ROI (70% ops reduction, ~$200K/year savings).”
Mid-level Solutions Architect / Full-Stack Developer specializing in LLM-enabled applications
“LLM/agentic systems practitioner focused on taking customer prototypes to production by hardening reliability (APIs, monitoring, security) and adding guardrails, evals, and incremental rollouts. Experienced diagnosing RAG/agent failures via structured tracing and fixing retrieval-quality issues (freshness checks, filters, schema enforcement). Also supports pre-sales by leading developer demos/workshops and building targeted POCs to address scalability/reliability objections and drive adoption.”
Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation
“Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.”
Mid-level AI/ML Engineer specializing in real-time anomaly detection and AI agents
“Built a production real-time anomaly detection platform for high-frequency trading at HSBC, using a streaming stack (Pulsar + Spark Structured Streaming + AWS Lambda) and a transformer-based model combining time-series and numerical signals. Experienced in MLOps and safe deployment (Kubernetes, canary releases, MLflow/Grafana monitoring) and in aligning model performance with risk/compliance expectations through SLA-driven tuning and stakeholder-friendly dashboards.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level Data Scientist specializing in ML, MLOps, and Generative AI
“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”
Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications
“Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.”
Mid-level Backend Software Engineer specializing in distributed microservices
“Internship at ActiveVM where they tackled large-scale Spring Boot 2→3/library migrations across hundreds of downstream products by combining OpenRewrite (AST-based recipes) with an LLM/RAG-based classifier that routed risky files to human experts. Reported ~70% reduction in manual effort and 90%+ accuracy after testing across multiple branches and cutovers; also built a CTR-driven book recommendation capstone showcased at the Google office in Cambridge.”