Pre-screened and vetted.
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”
Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions
“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”
Mid-level Data Scientist specializing in GenAI, LLM-to-SQL, and analytics platforms
“LLM/agentic AI builder who led end-to-end integration of an LLM system into a business intelligence product, creating a scalable, metadata-driven RAG/agent pipeline with an orchestrator that routes queries to specialized agents (including DB-backed quantitative querying). Also built an LLM-to-SQL chatbot and partnered with non-technical stakeholders to capture domain context and improve SQL generation, using automated LLM-based testing to evaluate reliability.”
Mid-level Applied AI Engineer specializing in agentic LLM workflows
“AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.”
Mid-level Full-Stack Java Developer specializing in microservices and cloud (AWS/Azure)
“Backend/full-stack Java engineer at PNC Bank specializing in real-time fraud detection systems. Built event-driven Spring Boot + Kafka microservices with PostgreSQL/Redis performance tuning, and shipped a production LLM-powered RAG feature for fraud analysts with strong guardrails (grounded internal data, structured prompts with references, human-in-the-loop) plus an evaluation loop using labeled historical fraud cases.”
Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems
“Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.”
Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps
“ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.”
Intern AI/ML Engineer specializing in computer vision and time-series forecasting
“Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.”
Mid-level AI Engineer specializing in multi-agent systems and RAG
“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”
Junior Product Manager specializing in AI-enabled analytics products
“Product/full-stack engineer with analytics-dashboard experience at Kantar, owning features end-to-end from React/Next.js UI through Postgres data modeling and query optimization. Built a multidimensional filters/tags module that cut analyst discovery time by ~60% and also implemented durable backend workflows for bulk report generation with retries and idempotency, validated via EXPLAIN ANALYZE and production monitoring.”
Junior Machine Learning Engineer specializing in NLP and multimodal transformers
“Built and deployed LLM-powered agentic chatbot and text-to-SQL systems using LangGraph/LangChain (and Bedrock), structuring workflows as DAGs with planning/replanning and validation to improve tool-calling reliability and reduce hallucinations. Operates production feedback loops with online/offline metrics, drift detection, and LangSmith-based evaluation pipelines, and regularly partners with business stakeholders and clinicians using slide decks and visual charts.”
Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines
“LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.”
Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation
“LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.”
Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps
“Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.”
Junior Software Engineer specializing in distributed systems and applied AI
“Early-career full-stack builder who created an AI interview-prep platform used by 200+ students, tested it with a 25-student study group, and earned recognition through the CUNY Startup accelerator, including prize money and local college adoption. Has also shipped compliance-sensitive AI products in healthcare marketing and operational tools like invoice approval systems, showing unusual breadth across AI, UX, and backend systems.”
“Backend-focused engineer with banking-domain deployment experience who has owned releases end-to-end, from discovery and API/database implementation through post-launch stabilization. Brings a reliability-first mindset across distributed systems, incident response, and messy real-world data handling, and has also applied that foundation to retrieval-based LLM workflows in production-oriented cloud environments.”
Mid-level Software Engineer specializing in AI pipelines and enterprise integrations
“Candidate has 4 years of experience and appears strongest in customer-facing implementation and AI-enabled workflow automation. They describe owning deployments end-to-end, putting an LLM support assistant with RAG and function calling into production, and improving support operations with a 30% reduction in resolution time and 25% gain in agent productivity.”
Senior Full-Stack Software Engineer specializing in backend systems and cloud-native APIs
“Full-stack engineer with startup-style ownership across backend, frontend, and AI systems, spanning Java/Spring, React, Node/TypeScript, and LLM-powered retrieval. Shipped a workspace intelligence layer using LangChain, OpenAI, and Pinecone to paying customers, while also improving core product metrics like workspace creation success (+30%), latency (450ms to 280ms), and deployment cycle time (-40%).”
Mid-level Full-Stack Software Engineer specializing in cloud-deployed web apps and APIs
“Software engineer who has shipped both core web platform features (secure user authentication/profile management) and production LLM systems. Built an internal documentation knowledge assistant using a full RAG pipeline (OpenAI embeddings, vector DB, semantic search, reranking) with evaluation loops and a scalable document-ingestion pipeline for PDFs/FAQs, iterating based on metrics and user feedback.”
Mid-level AI Engineer specializing in LLM agents and RAG for health-tech
“Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.”
Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting
“Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.”
Junior Machine Learning Engineer specializing in computer vision and generative AI
“CoreAI intern at The Home Depot who improved the Magic Apron Assistant by building a production video ingestion + RAG retrieval system for long videos (uploads and YouTube), including a graph-based retrieval module to speed up and improve relevance. Experienced with Kubernetes orchestration (HPA) and production reliability practices like caching, monitoring, regression testing, and stakeholder-driven requirements.”
Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications
“LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.”
Mid-level ML Engineer specializing in NLP and Generative AI
“Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.”