Pre-screened and vetted.
“ServiceNow engineer who built and launched a production LLM-powered ticket resolution/knowledge assistant using RAG (LangChain + Hugging Face embeddings + vector search) integrated into internal support dashboards via REST APIs. Optimized the system from ~6–8s to ~2–3s latency while improving usability with concise, cited answers and guardrails (grounding + similarity thresholds), delivering ~30–35% reduction in manual ticket investigation effort.”
Mid-level GenAI/ML Engineer specializing in LLM applications and enterprise automation
“Built and shipped a production LLM-powered healthcare support agent at UnitedHealthGroup, using LangChain + FAISS RAG on AWS SageMaker with CloudWatch monitoring and human-in-the-loop fallbacks for safety. Strong focus on reliability engineering (confidence gating, retries/timeouts, caching) and continuous evaluation loops; reported ~40% improvement in query resolution efficiency while reducing manual support workload.”
Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps
“Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.”
Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems
“Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.”
Mid-level Full-Stack Engineer specializing in AI/ML data platforms for biotech and FinTech
“AI/ML full-stack practitioner in a small-scale manufacturing/lab operations environment who deployed a production ML system to improve blood cell order fulfillment by predicting yield/success from donor characteristics. Experienced building custom multi-agent orchestration (Python, LangChain/LangGraph, MCP) and balancing reliability, data quality constraints, and token/ROI economics while communicating tradeoffs to VP-level business stakeholders.”
Junior Software Engineer specializing in video streaming and processing systems
“Software engineering intern at China Telecom who built and continuously evolved a real-time transaction platform ("Smart Tangerine") focused on strong consistency and peak-hour concurrency. Implemented microservices with Redis and RabbitMQ to decouple heavy processing and cut latency (~80ms to ~30ms), and led a zero-downtime migration from a monolith using strangler pattern, dual-write, and traffic shadowing.”
Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot and cloud microservices
“Backend-focused Python/Flask engineer who has built authentication/profile services with clean modular architecture (blueprints + service layer) and tuned SQLAlchemy/Postgres for scale using indexing, query rewrites, and pagination. Has production-style integration experience for AI/ML via TensorFlow Serving and OpenAI APIs (batching, rate limiting, caching), plus multi-tenant data isolation and high-throughput background processing with Celery/Redis and idempotent jobs.”
“LLM engineer who has deployed production RAG systems for regulated document QA (PDFs/knowledge bases), emphasizing grounded answers with citations, RBAC, monitoring, and continuous feedback. Demonstrates deep practical expertise in retrieval quality (semantic chunking, hybrid BM25+embeddings, re-ranking), reliability (guardrails, deterministic workflows), and measurable evaluation (golden sets, log replay, A/B tests) while partnering closely with compliance/operations stakeholders.”
“Backend/AI engineer who built a real-time vector database system for high-frequency financial data using Kafka/Flink on Kubernetes, achieving sub-100ms similarity search at 10k+ concurrent load and resolving tricky duplication issues with idempotency/versioning. Also shipped an end-to-end LLM-based travel itinerary feature (profiling + prompt workflows + APIs) with a focus on quality consistency and low latency.”
Junior Machine Learning Engineer specializing in LLMs, RAG, and medical imaging
“At Fileread, the candidate built and deployed an LLM-powered legal document classification and retrieval layer for an agentic extraction system that turns unstructured legal PDFs into structured tables with line-level citations. They productionized a RAG-style pipeline (ingestion, embeddings, retrieval, reranking, generation) and report 95%+ F1 across 70+ legal categories, emphasizing rigorous evaluation and close collaboration with legal domain experts for high-stakes precision.”
Entry Machine Learning Engineer specializing in NLP, computer vision, and recommender systems
“Built and shipped an end-to-end podcast recommendation system exposed via a Flask API and React UI, explicitly balancing relevance, diversity (MMR), and safety constraints while meeting ~200ms latency targets. Also implemented a production-style RAG/information-extraction pipeline using web retrieval, spaCy NER, and fine-tuned SpanBERT with guardrails and evaluation loops (precision/recall/F1) to tune confidence thresholds and improve reliability.”
Mid-level AI/ML Engineer specializing in LLM systems, MLOps, and Healthcare AI
“Built and shipped a production-grade agentic RAG system at CVS Health for patient adherence and medication recommendations, processing 20k+ patient records/day. Strong focus on real-world reliability: hybrid retrieval tuned with re-ranking (<400ms latency), strict JSON/schema validation and tool guardrails, and monitoring/drift detection that reduced MTTD from 6 days to 18 hours while improving recommendation accuracy (+8%) and cutting escalations (~23%).”
Mid Software Engineer specializing in distributed cloud-native backend systems
“Backend/AI workflow engineer who built production-grade orchestration systems for hardware security verification at Silicon Assurance (Nextflow/Python/Postgres) and a multi-agent LLM-driven regulatory code checking system at the University of Florida. Emphasizes reliability: strict plan/execute/verify boundaries, queue-based isolation, and strong observability/auditability with Prometheus/Grafana and persisted prompts/tool calls.”
Mid-level Software Engineer specializing in backend, AI, and full-stack systems
“Built and shipped production LLM agents including an internal RAG-based compliance classification system at SAIL (FastAPI/Redis/Docker) designed to handle real failure modes and scale to ~10k LLM calls/hour, achieving ~93% pipeline accuracy with reduced hallucination risk via multi-model orchestration and strict grounding. Also architected “Elara,” a state-machine-driven conversational appointment booking agent using structured JSON outputs and backend function execution for reliability, and has experience normalizing messy OTA/PMS data at RateGain.”
Entry-level Software Engineer specializing in full-stack and FinTech systems
“Software engineer with AppFolio payments experience who shipped an end-to-end payment adoption rollout during a short internship, including full-stack implementation, stakeholder coordination, and real-time monitoring. Also built an insurance service AI agent inspired by his father's agency, combining structured document ingestion, RAG, and agentic integrations to improve reliability on dense policy documents.”
Mid-level Machine Learning Engineer specializing in data science and cloud systems
“ML engineer who independently pitched and built a recommendation engine at Danske Bank in a legacy fintech environment, creating compliant data pipelines and deployment infrastructure from scratch and delivering a 62% engagement lift with 70%+ advisor adoption. Also worked at AWS on classification and GenAI-powered reporting systems, with strengths spanning production ML, platform setup, monitoring, and research-to-production optimization.”
Senior Machine Learning Engineer specializing in NLP and generative AI
“ML/AI engineer focused on production NLP and voice AI systems in the restaurant tech space, with hands-on work spanning ASR, intent classification, LLM fine-tuning, and deployment monitoring at Presto AI. They highlight a 15% improvement in full-AI ordering rate and also built a restaurant sentiment analysis product at Wisely that they say became a standout feature in a $10M acquisition context.”
Mid-level AI/ML Engineer specializing in Generative AI and financial services
“ML/AI engineer with hands-on experience shipping regulated financial AI systems at JPMC and Capgemini, spanning credit risk, fraud detection, and generative AI assistants. Stands out for combining modern LLM/RAG architectures with strong MLOps, real-time infrastructure, and explainability/compliance practices, while delivering measurable business impact in latency, accuracy, cost, and risk reduction.”
Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML
“Built an end-to-end GenAI/RAG platform for financial compliance and research at BlackRock, focused on safe, auditable answers in a highly regulated environment. Combines strong LLM engineering depth with production platform skills and delivered clear business impact, including reducing research/compliance turnaround from hours to seconds, improving retrieval relevance by 22%, and cutting inference costs by 75%.”
Senior Software Engineer specializing in cloud infrastructure and platform engineering
“Backend engineer with deep experience in security and access-management platforms at JPMorgan Chase, including owning automation for migrating 50+ engineering teams from CyberArk to HashiCorp Vault. Stands out for combining regulated-environment rigor, infrastructure automation, and production operations with practical AI integration in internal access workflows.”
Intern software engineer specializing in AI, full-stack, and applied ML
“Backend/ML-focused engineer with experience spanning fintech, sales enablement, and medtech, including a Capital One capstone and a Singapore medtech startup internship. Stands out for owning end-to-end AI/backend systems, from a GenAI sales pitch platform that cut prep time by 50% to an ultrasound-guidance MVP for non-expert operators in a highly ambiguous domain.”
Mid-level Software Development Engineer specializing in cloud-native AI/ML systems
“AI/ML-focused engineer with practical experience building RAG-based and multi-agent systems, including architectures for retrieval, reasoning, context processing, and response generation. Stands out for combining LLM productivity gains with disciplined software engineering practices like validation, monitoring, and reproducibility.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Engineer with impactful experience at Palo Alto Networks and Optum, focused on production automation and AI-powered internal tools. Built and owned an end-to-end RAG knowledge system adopted by 1000+ internal users with roughly 75% faster response times, and also transformed a legacy Optum coverage-feed workflow from 500+ minutes to under 3 minutes through data standardization and microservices refactoring.”