Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”
Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps
“Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.”
Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps
“AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.”
Intern Software Engineer specializing in full-stack and LLM/RAG systems
“Full-stack engineer who built "Workstream AI," an AI-powered engineering visibility product that converts GitHub activity into real-time insights using an event-driven microservices stack (RabbitMQ/Postgres/Express) and GPT-4 with a React frontend. Previously a Founding SWE at a health & wellness startup, building data-driven user management tooling, and also delivered a real-time shuttle tracking/ride request system using Java Spring Boot/Hibernate + React; comfortable owning production deployment details (AWS EC2, DNS, SSL).”
“AI/ML engineer with banking domain experience (M&T Bank) who built a production credit-risk prediction and reporting platform combining ML models (XGBoost/TensorFlow) with a RAG pipeline (LangChain + GPT-4) over compliance documents. Delivered measurable impact (≈20% better risk detection/precision, 50% less manual reporting) and productionized workflows on Vertex AI/Kubeflow with CI/CD and monitoring; also implemented embedding-based semantic search using FAISS/Pinecone.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG
“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”
Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps for FinTech
“ML/LLM engineer with production experience building a compliant RAG-based virtual assistant at Intuit, optimizing embeddings and FAISS retrieval (including PCA) for low-latency, privacy-controlled search and deploying via AWS SageMaker containers. Also built scalable Airflow+MLflow pipelines using Docker and KubernetesExecutor, cutting training cycles by 37%, and partnered with civil engineers/project managers at Aegis Infra to deliver predictive maintenance for construction equipment.”
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”
Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions
“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”
Mid-level Applied AI Engineer specializing in agentic LLM workflows
“AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.”
Junior AI & Data Engineer specializing in LLM systems and analytics platforms
“Backend/ML engineer who built a job-search automation SaaS using a modular Selenium ETL pipeline, rigorous testing/observability, and a cost-optimized two-pass LLM ranking approach. Has led high-integrity data extraction from messy multi-city PDF records (95% integrity) and managed modular production rollouts for a 20+ engineer team, with a strong security focus (deny-by-default, row-level access control) in an AI-assisted grading platform.”
Mid-level AI Engineer specializing in LLM agents and RAG for health-tech
“Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.”
Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting
“Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.”
Mid-level AI/ML Engineer specializing in production RAG systems and MLOps
“Built and deployed a GPT-4 + Pinecone RAG system that lets users query large internal document collections with grounded, cited answers. Demonstrates strong applied LLM engineering (chunking experiments, hallucination controls, metadata recency boosting) plus production-minded evaluation/monitoring and performance tuning (rate-limit mitigation via pooling/batching). Also effective at translating complex AI concepts to non-technical stakeholders through prototypes and live demos, helping secure client sponsorship.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Mid-level ML Engineer specializing in NLP and Generative AI
“Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.”
Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots
“Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.”
Mid-level AI/ML Engineer specializing in Generative AI and MLOps
“Built and shipped a production RAG assistant using GPT-4, LangChain, and Pinecone/FAISS to search 50K+ institutional documents, with a strong focus on groundedness and hallucination reduction through retrieval optimization and re-ranking. Pairs this with a metrics-driven evaluation/monitoring approach (BLEU/ROUGE, manual sampling, logging) and workflow automation via Airflow, and has experience translating stakeholder needs into iterative AI prototypes.”
Mid-level Machine Learning Engineer specializing in cloud-native generative AI for healthcare
“AI engineer at Cleveland Clinic building production LLM/NLP systems for radiology documentation, focused on HIPAA-aware, real-time performance across ~298 campuses. Re-architected infrastructure with AWS event-driven services to handle scaling and improved SLA compliance ~40%, and complements this with a personal multi-agent debate system (CrewAI) using local Llama/Mistral plus rigorous evaluation (A/B tests, red teaming, observability).”
Senior AI/ML Engineer specializing in financial risk, fraud detection, and GenAI analytics
“AI/ML engineer with experience at Northern Trust and Persistent Systems building production LLM + RAG systems for regulated financial use cases, including liquidity forecasting, anomaly detection, and credit scoring. Emphasizes compliance-first design with explainability (SHAP), traceability (MLflow), and hallucination controls (FAISS + citation-grounded prompting), and has delivered drift-triggered retraining pipelines using Airflow and Kubernetes while translating model outputs into business-ready marketing segments.”
Mid-level Machine Learning Engineer specializing in data security and GenAI systems
“Built Hexagon’s production Text-to-CAD Copilot that converts text and rough sketches into editable CAD code, combining GraphRAG (Neo4j/LangChain) with a Gemini-powered vision module and multi-agent geometric validation—cutting manual modeling from a day to ~45 seconds and driving retrieval latency below 50ms. Also has large-scale GCP data/ML orchestration experience (Airflow/Cloud Composer, Dataflow, Pub/Sub, Snowflake) processing 50M+ daily records with drift monitoring and automated reliability controls.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.”
Mid-level Data Engineer specializing in multi-cloud real-time data pipelines
“Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.”
Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps
“Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.”