Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
“AI/ML engineer with HP experience building and productionizing an LLM-powered document intelligence platform (LangChain + Pinecone) to deliver semantic search and contextual Q&A across millions of enterprise support documents. Demonstrates strong MLOps and scaling expertise (Airflow, Kubernetes autoscaling, Triton GPU inference, monitoring with Prometheus/W&B) plus a structured approach to evaluation (A/B tests, shadow deployments, failover) and effective collaboration with non-technical stakeholders.”
“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”
“Backend/AI engineer who built a real-time vector database system for high-frequency financial data using Kafka/Flink on Kubernetes, achieving sub-100ms similarity search at 10k+ concurrent load and resolving tricky duplication issues with idempotency/versioning. Also shipped an end-to-end LLM-based travel itinerary feature (profiling + prompt workflows + APIs) with a focus on quality consistency and low latency.”
Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare
“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”
“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”
Mid-Level Software Engineer specializing in Payments and Financial Services
“Software engineer with hands-on experience improving performance and reliability in financial workflows (settlements/loan processing), spanning React/TypeScript and Angular frontends plus Spring Boot microservices. Has delivered measurable latency improvements using PostgreSQL optimization and Redis caching, and has operated Kafka-based systems at scale with idempotent processing and backoff/retry strategies while iterating internal ops tooling with support/finance teams.”
Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP
“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”
Mid-Level Software Engineer specializing in secure cloud microservices and FinTech
“Built and owned major parts of a real-time distributed AI fraud-detection pipeline (ingestion, inference microservice integration, and automated action layer), optimizing latency and observability and reducing false positives by ~35%. Understands ROS/ROS2 concepts (nodes/topics/services) and planned hands-on ramp-up via ROS2 pub/sub exercises and Gazebo simulation, but has not worked on physical robots or ROS in production.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.”
Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps
“Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.”
Mid-level Full-Stack Developer specializing in cloud microservices and internal tooling
“LLM/RAG engineer who has shipped production systems in high-stakes domains (fraud analytics at Mastercard and security compliance as a CI/CD gate). Strong focus on reliability: hybrid retrieval for latency, citation-backed outputs for trust, and code-driven eval/regression pipelines using golden datasets. Also built scalable OCR-based ingestion for messy classroom artifacts (handwriting, PDFs, whiteboard photos) using Go/Python and cloud services.”
Mid-level AI/ML Engineer specializing in GenAI, LLMs, RAG, and MLOps
“Built and deployed a production LLM-powered RAG document intelligence/Q&A system for healthcare prior authorization, reducing manual medical document review time and improving decision efficiency. Strong in end-to-end LLM application engineering (LangChain/LangGraph), retrieval quality improvements (hybrid search, embedding tuning, chunking strategies), and rigorous evaluation/monitoring for reliability.”
Intern Full-Stack Software Engineer specializing in AI/ML and AWS cloud platforms
“Full-stack engineer who built an LLM-powered productivity web app (LifeOS) end-to-end with TypeScript/Next.js, Prisma, and Postgres, emphasizing fast iteration with stable API contracts and an isolated AI service boundary. Also built a security/compliance login-verification workflow at Medpace used within an internal admin portal for thousands of employees, and has AWS experience orchestrating batch GPU workloads with robust retry/idempotency patterns.”
“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”
Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps
“AI/ML engineer with experience at Accenture and Morgan Stanley, building production LLM systems (GPT-3 summarization) and finance-focused ML models (credit risk and trading anomaly detection). Combines MLOps depth (Docker/Kubernetes, AWS SageMaker/Glue/Lambda, MLflow, A/B testing, drift monitoring) with practical domain adaptation techniques like few-shot prompting and RAG/knowledge-base integration.”
Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI
“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”
Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications
“Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.”
Intern AI Engineer specializing in LLM agents, RAG, and applied biostatistics
“Siemens AI engineer who shipped production multi-agent LLM systems across cybersecurity and sustainability, including a vulnerability automation agent that cut manual work 70%. Deep in orchestration (LangGraph supervisor-worker state machines), reliability engineering (async fault tolerance, retries, spike handling), and rigorous evaluation (offline benchmarks, LLM-as-a-Judge improving label agreement 28.9%) with measurable production guardrails.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and risk modeling
“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.”
Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI
“GenAI/ML engineer in Citigroup’s finance environment who has deployed production RAG systems for investment banking under strict privacy and model-risk constraints. Built an internal-VPC Llama2 + Pinecone + LangChain solution with NER redaction and citation-based verification to prevent hallucinations, delivering major time savings, and also partnered with global finance executives to ship an AI early-warning indicator for treasury/liquidity risk.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLMs and MLOps
“Built and deployed a production LLM-powered decision-support system for supply-chain planners that explains demand forecast changes using grounded retrieval from sales, promotion, inventory, and supplier data. Implemented strict anti-hallucination guardrails and latency optimizations, deployed as a real-time AWS API with monitoring, and reported ~15% forecast accuracy improvement and ~12% supply-chain risk reduction. Experienced orchestrating data/ML/LLM workflows with Airflow, LangChain/LangGraph-style patterns, and AWS Step Functions while partnering closely with non-technical business users via demos and example-based requirements.”
Mid-Level Backend Software Engineer specializing in payments, fraud systems, and AI agent infrastructure
“Early-career engineer who owned an end-to-end objective assessment/coding contest platform at an edtech startup, using Postgres + S3 and Redis (queues + ZSET) to decouple and scale code submission processing with worker sandboxes. Also implemented idempotency controls and set up monitoring and CI/CD while the rest of the team focused on curriculum.”