Pre-screened and vetted.
“LLM engineer who has deployed production RAG systems for regulated document QA (PDFs/knowledge bases), emphasizing grounded answers with citations, RBAC, monitoring, and continuous feedback. Demonstrates deep practical expertise in retrieval quality (semantic chunking, hybrid BM25+embeddings, re-ranking), reliability (guardrails, deterministic workflows), and measurable evaluation (golden sets, log replay, A/B tests) while partnering closely with compliance/operations stakeholders.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
“AI/ML engineer with HP experience building and productionizing an LLM-powered document intelligence platform (LangChain + Pinecone) to deliver semantic search and contextual Q&A across millions of enterprise support documents. Demonstrates strong MLOps and scaling expertise (Airflow, Kubernetes autoscaling, Triton GPU inference, monitoring with Prometheus/W&B) plus a structured approach to evaluation (A/B tests, shadow deployments, failover) and effective collaboration with non-technical stakeholders.”
Mid-Level Software Engineer specializing in secure cloud microservices and FinTech
“Built and owned major parts of a real-time distributed AI fraud-detection pipeline (ingestion, inference microservice integration, and automated action layer), optimizing latency and observability and reducing false positives by ~35%. Understands ROS/ROS2 concepts (nodes/topics/services) and planned hands-on ramp-up via ROS2 pub/sub exercises and Gazebo simulation, but has not worked on physical robots or ROS in production.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.”
Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI
“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”
Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps
“AI/ML engineer with experience at Accenture and Morgan Stanley, building production LLM systems (GPT-3 summarization) and finance-focused ML models (credit risk and trading anomaly detection). Combines MLOps depth (Docker/Kubernetes, AWS SageMaker/Glue/Lambda, MLflow, A/B testing, drift monitoring) with practical domain adaptation techniques like few-shot prompting and RAG/knowledge-base integration.”
Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI
“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”
Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications
“Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and risk modeling
“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”
Senior AI/ML Engineer specializing in Generative AI, agentic systems, and RAG
“Built and deployed an agentic RAG assistant in production to automate enterprise knowledge search and multi-step workflows with tool calling, tackling real-world issues like hallucinations, retrieval accuracy, and latency. Demonstrates strong LLMOps and orchestration depth (MLflow, Airflow, LangGraph/LangChain/LlamaIndex) plus a metrics-driven approach to agent testing/evaluation and cross-functional delivery with business stakeholders.”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“GenAI/LLM engineer with production deployments in both fintech and retail: built an AI-powered mortgage document analysis/automated underwriting pipeline at Fannie Mae (OCR + custom LLM) cutting underwriting review from 3–4 hours to under an hour with privacy-by-design controls. Also helped build Sephora’s GenAI product advisory bot using LangChain-orchestrated RAG (Azure GPT-4, Azure AI Search, MySQL HeatWave vector search), focusing on grounding, evaluation, and compliance-aware architecture choices.”
Senior Full-Stack Engineer specializing in AI/LLM and cloud-native SaaS
“Software engineer with strong end-to-end ownership across frontend, backend, data, and infrastructure, including real-time systems (Kafka/Postgres) and observability (Datadog). Built and productionized an AI-native RAG support assistant (OpenAI embeddings + Pinecone) with prompt/guardrail design, achieving 48% agent adoption and 30% faster responses. Experienced in legacy modernization and reliability work using feature flags, event/transaction replay, and rapid embedded delivery.”
Mid-level Applied AI/ML Engineer specializing in LLMs, RAG, and fraud/anomaly detection
“Built and productionized an internal LLM-powered document Q&A system at Morgan Stanley using a LangChain-based RAG pipeline (FAISS + OpenAI) with AWS ingestion (S3/Lambda), handling 100k+ pages and cutting lookup time ~35% while keeping responses under 3 seconds. Strong on reliability: automated evals/CI (pytest + GitHub Actions), CloudWatch monitoring, drift detection (prompt drift and fraud-model drift), and security controls (IAM + app-level authorization) in a financial-services environment.”
Mid-level GenAI/ML Engineer specializing in LLM applications and RAG systems
“GenAI/LLMOps practitioner who deployed a production RAG-based customer service and knowledge retrieval system for a global bank using LangChain, FAISS/Azure Cognitive Search, GPT-4/Claude, and Guardrails—driving a reported 35% Q&A accuracy lift while reducing handle time and escalations. Also partnered with non-technical leaders at CVS Health to deliver ML-driven supply chain risk and inventory insights via anomaly detection, NLG summaries, and stakeholder-friendly dashboards.”
Junior Data Infrastructure Software Engineer specializing in distributed pipelines and AI extraction
Senior Backend Software Engineer specializing in Supply Chain and Generative AI
Mid-level AI/ML Engineer specializing in fraud detection and Generative AI
Mid-level AI/ML Engineer specializing in fraud detection, risk modeling, and real-time ML systems
Junior AI/ML Engineer specializing in Computer Vision and LLM/RAG systems
Junior Software Engineer specializing in DevOps/SRE and ML platform infrastructure
Mid-level Applied Scientist specializing in production GenAI and RAG systems
Mid-level Data Scientist / Data Engineer specializing in Azure ML and scalable data pipelines