Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in MLOps and production ML systems
“Backend/ML engineer who has shipped high-scale real-time systems across e-commerce and healthcare: built a PharmEasy real-time recommendation engine for ~2M monthly users (cut feature latency 5 min→30 sec; +15% cross-sell) and architected a HIPAA-compliant multimodal clinical diagnostic workflow (DICOM+EHR) with XAI, MLOps (MLflow/Airflow/K8s), and drift/monitoring guardrails supporting 10k+ daily predictions.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level Machine Learning Engineer specializing in healthcare NLP and MLOps
“ML/AI practitioner in healthcare (Syneos Health) who has deployed production clinical NLP and risk models. Built a BERT-based physician-note information extraction system on Docker + AWS SageMaker (reported ~42% retrieval improvement) and automated retraining/deployment with Airflow and drift detection, while partnering closely with clinicians to drive adoption (reported ~18% readmission reduction).”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level AI/ML Engineer specializing in real-time anomaly detection and AI agents
“Built a production real-time anomaly detection platform for high-frequency trading at HSBC, using a streaming stack (Pulsar + Spark Structured Streaming + AWS Lambda) and a transformer-based model combining time-series and numerical signals. Experienced in MLOps and safe deployment (Kubernetes, canary releases, MLflow/Grafana monitoring) and in aligning model performance with risk/compliance expectations through SLA-driven tuning and stakeholder-friendly dashboards.”
Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems
“Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps
“AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.”
Mid-level Generative AI Engineer specializing in LLM systems and RAG
“Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.”
Senior AI/ML Engineer specializing in Generative AI and RAG
“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”
Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps
“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Mid-level Data Scientist specializing in LLM development and scalable ML pipelines
“Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.”
Senior AI Engineer specializing in production GenAI systems
“AI engineer who has shipped production LLM systems end-to-end, including a natural-language-to-SQL analytics copilot for career advisors that achieved ~95% query success through schema grounding, access controls, and automated regression testing with golden queries. Also builds LangGraph-orchestrated multi-step agents (resume analysis, recommendations) and RAG pipelines (PDF ingestion + FAISS) and partners closely with non-technical users to drive adoption and trust.”
Junior Machine Learning Researcher specializing in knowledge distillation
“Built and shipped LLM-powered agents including a production RAG research assistant that cut research lookup time from ~20 minutes to ~10–20 seconds using caching, retrieval thresholds, and citation-enforced grounded answers. Also designed multi-step, tool-calling workflows with stateful critique/revision loops and pragmatic monitoring (retry/schema-failure/low-confidence signals) plus normalization/validation layers for messy notes/spreadsheet-style data.”
Mid-level AI/ML Engineer specializing in financial analytics and production ML systems
“Analytics candidate with experience in financial transaction and fraud detection projects, combining SQL data preparation, Python-based automation, and dashboarding. They have owned projects from stakeholder alignment and metric definition through rollout, with emphasis on reducing false positives, improving operational efficiency, and making analytics outputs easy for business teams to adopt.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI
“Built and owned a production GPT-4 RAG assistant for clinical and enterprise query resolution, taking it from initial experiment to deployment, monitoring, and iterative improvement. Their work cut resolution time from 45 minutes to under 2 minutes, achieved roughly 95% accuracy, and scaled to thousands of additional monthly queries while emphasizing safety and trust in a sensitive clinical domain.”
Mid-level AI/ML Engineer specializing in NLP, MLOps, and FinTech
“ML/AI engineer with production experience at S&P Global and Accenture, focused on regulated, enterprise-grade systems. Built end-to-end financial risk and credit default models with >90% precision and 12% fewer false positives, and is currently developing secure RAG pipelines on AWS SageMaker for enterprise insight extraction.”
Staff Machine Learning Engineer specializing in NLP, LLMs, and document intelligence
“ML/AI engineer at PNC who has shipped enterprise-grade RAG and document intelligence systems for compliance and policy workflows. Stands out for combining LLM product thinking with production rigor—owning FastAPI/Kubernetes deployments, monitoring, evaluation, and human-feedback loops that drove measurable gains like 40% faster policy search and 30% faster compliance review.”
Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise ML systems
“ML/AI engineer with hands-on experience at Morgan Stanley building production fraud detection and enterprise RAG systems. Stands out for owning systems end-to-end—from experimentation and deployment to monitoring and iteration—and for delivering measurable impact, including an 18% reduction in fraud false positives, 40% lower inference latency, and internal tooling that reduced model deployment time from days to hours.”
Mid-level AI/ML Engineer specializing in Generative AI for Financial Services
“ML/AI engineer with strong financial-services domain experience who has built production systems spanning trade anomaly detection, investment-research RAG, and agentic LLM workflows. Particularly compelling for teams needing someone who can take ML/GenAI from prototype to monitored production while balancing compliance, latency, cost, and reliability.”
Mid-level Generative AI Engineer specializing in LLMs and enterprise AI
“Built and owned an enterprise LLM/RAG document intelligence platform for PNC Financial Services in a compliance-heavy environment, focused on grounded answers over internal finance and policy documents. Stands out for combining GenAI product delivery with production engineering discipline, delivering 60% faster document review and materially better answer quality while creating reusable FastAPI-based AI services for multiple teams.”