Pre-screened and vetted.
Senior AI Python Engineer specializing in Generative AI and MLOps
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and NLP
“Backend engineer who built and migrated a large-scale document intelligence platform used by legal, healthcare, and insurance clients, processing millions of pages. Experienced moving from a monolithic, LLM-heavy approach to a modular FastAPI service architecture with ML classification + RAG, strong validation/auditability, and enterprise security (JWT/OAuth, RBAC, PostgreSQL RLS) with zero-downtime incremental rollouts.”
Junior AI Engineer specializing in RAG pipelines and agentic AI systems
“Built and shipped production RAG/agentic systems in high-stakes domains (biomedical and legal), including an enterprise biomedical document retrieval platform over ~10k scientific docs and a multilingual African-law assistant at the World Bank. Deep hands-on experience with LangChain/LangGraph/LlamaIndex and evaluation tooling (LLM-as-a-judge, safety/hallucination detection), with measurable gains in retrieval quality and hallucination reduction.”
Mid-level GenAI Engineer specializing in AI agents and RAG systems
“Built and deployed a production LLM-based RAG agent platform adopted by multiple business teams (Marketing, GTM, Recruiting, Customer Support) to automate knowledge search, Q&A, and content generation. Emphasizes production-grade reliability (grounding/validation/guardrails), rigorous evaluation/monitoring, and cost-aware scaling via model tiering, prompt/retrieval optimization, and caching using LangChain/LangGraph orchestration.”
Mid-level Data Scientist specializing in Generative AI and multimodal systems
“Recent J&J intern who built a conversational RAG agent and led a shift from a monolithic model to a modular RAG workflow, cutting response time from several days to under a second by tackling data fragmentation, context retention, and embedding/latency optimization. Also worked on a large (7B-parameter) multimodal VQA pipeline for healthcare research and stays current via NeurIPS/ICLR and open-source contributions.”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI
“Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.”
Mid-level Full-Stack Engineer specializing in cloud-native microservices and AI automation
“Software engineer/product owner who has led end-to-end delivery of AI and content-management platforms, including building RAG-based reliability improvements and migrating fragile systems to containerized AWS ECS/Kubernetes with Terraform-managed CI/CD. Experienced designing event-driven microservices (SQS/SNS/RabbitMQ), scaling queue consumers with autoscaling, and creating internal Python tooling to standardize data connectors (e.g., BigQuery/Airtable/internal APIs) to speed iteration.”
Mid-level Generative AI Engineer specializing in LLMs and RAG systems
“Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.”
Senior Machine Learning Engineer specializing in LLMs, speech AI, and RAG systems
“AI engineer with production experience building multilingual speech-to-speech translation pipelines (ASR + LLM) for enterprise/media, focused on reliability at scale. Has hands-on orchestration experience (including IBM Watson contexts) and emphasizes production evaluation/monitoring using a mix of traditional metrics and LLM-based evaluators to catch quality regressions while balancing latency and cost.”
Senior AI Engineer specializing in Agentic AI and distributed systems
“LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.”
Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines
“Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.”
AI & Full-Stack Software Engineer specializing in LLM-powered applications
“Full-stack engineer focused on productionizing LLM applications, including an Android privacy-policy risk summarization app (Kotlin/React Native + FastAPI + Ollama) that cut response times from ~10s to ~5–6s via batching, caching, async, and event-driven architecture. Currently at PRGX building an LLM-based legal contract clause extraction system, partnering closely with legal/procurement SMEs to create schemas, labeled datasets, and evaluation pipelines that improved accuracy from 70% to 85%. Also has experience architecting real-time voice/LLM systems with streaming microservices (Kafka, Kubernetes, gRPC/WebSockets) and an avatar chatbot pipeline (TalkingHead, Google TTS, AnythingLLM).”
Senior Full-Stack Developer specializing in cloud-native microservices and AI/ML analytics
“Full-stack/backend engineer with deep insurance claims domain experience who built and operated a microservices + ETL platform (Java/Spring Boot + Python + Kafka/Databricks) processing 1M+ daily transactions. Combines production-grade reliability (99.7% uptime, zero-downtime blue/green releases, strong observability) with customer-facing UI delivery (AngularJS/React+TS dashboards and a hackathon-winning research chatbot).”
Mid-level AI/ML Engineer specializing in LLM systems, RAG, and MLOps
“Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.”
Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps
“Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.”
Mid-level AI/ML & Full-Stack Engineer specializing in LLM agents and medical RAG systems
“Full-stack engineer at an early-stage startup building an agentic AI application for enterprise systems, combining customer-facing Next.js/React UI work (30% faster load times) with backend/workflow orchestration using FastAPI + n8n, Redis, and RabbitMQ. Previously at Deloitte USI, built BDD Selenium/Java automation and managed 200+ defects end-to-end using JIRA/JAMA to support on-time production releases.”
Mid-Level Full-Stack Software Developer specializing in cloud-native microservices and AI/ML
“Backend engineer who optimized an AI-driven portfolio analytics/insights platform at Fidelity, addressing latency and traffic growth by moving services toward microservices, improving service communication, and tuning API/DB performance. Experienced scaling Python/FastAPI services with Docker + Kubernetes autoscaling, and strengthening security/privacy for sensitive client portfolio data used in LLM-based reporting.”
Mid-level AI/ML Engineer specializing in NLP and Generative AI
“Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.”
Mid-level Full-Stack Developer specializing in web platforms and cloud (AWS)
“Full-stack engineer with financial services experience (Lincoln Financial) who owned a customer-facing financial portal end-to-end using TypeScript/React and Node/Express. Has hands-on microservices and RabbitMQ event-driven workflows, addressing scale issues like retries/duplicates with idempotency and traceable logging, and built an internal real-time ops/support dashboard to improve monitoring and incident response.”
Senior AI/ML Engineer and Data Scientist specializing in Generative AI and MLOps
“ML/NLP practitioner focused on financial-services document intelligence and compliance workflows—built an end-to-end pipeline to classify documents and extract financial entities from loan applications, emails, and statements stored in S3/internal databases. Strong in entity resolution/record linkage and in productionizing pipelines with GitHub Actions CI/CD, testing, data validation, and Docker, plus semantic search using OpenAI embeddings and a vector database.”
Mid-level Data Scientist / ML Engineer specializing in FinTech and Healthcare ML systems
“AI/LLM engineer who has shipped production RAG systems (including a 250K-document compliance knowledge tool on AWS) and focuses on reliability via citations, guardrails, and rigorous evaluation (Ragas/Opik/DeepEval). Also built a LangGraph-orchestrated webcrawler agent that cut research paper extraction from hours to minutes, and collaborated with clinical teams to deliver patient volume forecasting with an optimization layer for staffing.”
Senior Full-Stack Software Engineer specializing in civic tech and AI/RAG systems