Pre-screened and vetted.
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI
“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”
Principal Data Scientist specializing in NLP and Generative AI
“ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Mid-level AI Engineer specializing in GenAI, NLP, and MLOps
“LLM/agentic-systems engineer with PayPal experience hardening an LLM-powered fraud support assistant from prototype to production, focusing on low-latency distributed architecture, rigorous evaluation/testing, and security/compliance. Comfortable in customer-facing and GTM contexts—runs technical demos/workshops, builds tailored pilots, and aligns sales/CS with engineering to close deals and drive adoption.”
Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision
“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”
Mid-level Software Engineer specializing in embedded AI and full-stack systems
“Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.”
Intern Software Engineer specializing in LLM agents and full-stack development
“Embedded C++ engineer with Bosch automotive infotainment experience, owning real-time audio middleware modules with strict latency/memory constraints. Strong in profiling/optimizing deterministic behavior, debugging hardware-specific intermittent issues, and building automated test + CI pipelines; currently ramping up on ROS2 concepts (DDS, nodes/topics/services) to transition toward robotics.”
Senior Data Engineer specializing in cloud analytics and data modernization
“Candidate has hands-on experience delivering production data and AI systems, including an AWS-based real-time data platform for a financial client at Deloitte and a production RAG workflow that cut manual search time by 40%. They stand out for combining strong data engineering depth with practical LLM governance, incident debugging, and stakeholder management across business and risk/compliance teams.”
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps
“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”
Junior AI/ML Engineer specializing in agentic AI, RAG, and voice systems
“Full-stack AI product engineer who has owned production-grade document intelligence and agent systems at meaningful scale, including a copilot used by 10,000+ users and 1M+ queries. Particularly strong in combining React/TypeScript product work with Python/FastAPI, RAG, knowledge graphs, observability, and performance tuning—cutting latency from ~7 seconds to 0.5 milliseconds while improving trust through citations and human review.”
Mid-level Full-Stack Java Developer specializing in cloud microservices and AI-driven platforms
“Software engineer with Intuit experience shipping an end-to-end real-time financial insights product on AWS, using event-driven architecture with Kafka and Spark Streaming to process millions of records with low latency. Also delivers customer-facing React + TypeScript dashboards and has hands-on production operations experience, including resolving a database scaling incident via read replicas, query tuning, and connection pooling.”
Mid-level Data Scientist / Machine Learning Engineer specializing in fraud, risk, and MLOps
“AI/ML practitioner with Northern Trust experience who has shipped production LLM systems (internal support assistant) using RAG, vector databases, orchestration (LangChain/custom pipelines), and rigorous monitoring/feedback loops. Also built AI-driven fraud detection/risk monitoring solutions in a regulated financial environment, emphasizing explainability (SHAP), audit readiness, and stakeholder trust through dashboards and clear communication.”
Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services
“At JP Morgan Chase, built and deployed a production LLM-powered RAG knowledge assistant to help fraud investigators and risk analysts quickly navigate regulatory updates and internal policies, reducing investigation delays and compliance risk. Strong focus on secure retrieval (RBAC filtering), reliability (layered testing + observability), and production constraints (latency/SLOs), with Airflow-orchestrated, auditable ML pipelines.”
Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines
“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”
Mid-level Software Engineer specializing in AI agents, backend systems, and data engineering
“Amazon engineer who built a production AI agent platform (Python/AWS Strands on Bedrock) that lets teams create tool-using, multi-agent workflows—e.g., agents that auto-triage and resolve customer support tickets by reading internal documentation and collaborating with a research agent. Previously worked in Deloitte on IAM using Ping Identity/Ping DaVinci orchestration, and applies orchestration thinking plus structured evaluation (LLM-as-judge, surveys, automated tests) to improve agent reliability.”
Mid-level Software Engineer specializing in FinTech full-stack and AI applications
“Built and productionized an NLP-powered customer support assistant at JPMorgan Chase for digital banking, focused on reducing response time for repetitive client queries. Strong in real-world AI deployment challenges—sensitive data handling, low-latency FastAPI services, and AWS/Kubernetes operations with CI/CD—plus a metrics- and guardrails-driven approach to reliable AI workflows.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
“AI/ML engineer with HP experience building and productionizing an LLM-powered document intelligence platform (LangChain + Pinecone) to deliver semantic search and contextual Q&A across millions of enterprise support documents. Demonstrates strong MLOps and scaling expertise (Airflow, Kubernetes autoscaling, Triton GPU inference, monitoring with Prometheus/W&B) plus a structured approach to evaluation (A/B tests, shadow deployments, failover) and effective collaboration with non-technical stakeholders.”
Mid-Level Software Engineer specializing in cloud infrastructure and microservices
“Backend engineer who has led major platform evolution to cloud-native microservices (Spring Boot on AWS with Terraform) and built scalable, secure FastAPI APIs. Demonstrates strong production rigor with metric-driven validation, canary/phased rollouts, and incremental migrations using shadow traffic/feature flags/parallel writes—achieving faster deployments, fewer incidents, and zero-downtime traffic spikes and migrations.”
Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare
“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”
“ServiceNow engineer who built and launched a production LLM-powered ticket resolution/knowledge assistant using RAG (LangChain + Hugging Face embeddings + vector search) integrated into internal support dashboards via REST APIs. Optimized the system from ~6–8s to ~2–3s latency while improving usability with concise, cited answers and guardrails (grounding + similarity thresholds), delivering ~30–35% reduction in manual ticket investigation effort.”
Mid-level Full-Stack Software Engineer specializing in Java/Spring microservices and React
“Uber engineer who has owned internal products end-to-end across backend (Spring Boot microservices, MySQL) and frontend (React), including performance optimization and secure JWT-based auth. Also shipped a production internal RAG/embeddings LLM support assistant over policy docs and support tickets, with guardrails (confidence thresholds, human review) and an evaluation loop that directly reduced hallucinations.”
Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps
“Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.”