Pre-screened and vetted.
“Built and deployed a production RAG-based LLM Q&A and summarization platform for internal documents, emphasizing grounded answers with structured prompting and citations to reduce hallucinations. Experienced orchestrating end-to-end LLM workflows with LangChain plus cloud pipelines (Azure ML Pipelines, AWS), and runs iterative evaluation using both metrics (accuracy/hallucination/latency/cost) and real user feedback to drive reliability.”
Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI
“FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.”
Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps
“Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.”
Entry-level Software Engineer specializing in AI and full-stack data systems
“Backend/AI engineer who has built an offline, citation-grounded RAG system end-to-end with hybrid retrieval, local LLM inference, and quantitative evaluation via RAGAS. Also brings real-time systems experience from an Airbnb-like booking platform and data pipeline/ML quality work from a Bilibili internship, with a strong emphasis on reliability, privacy, and measurable correctness.”
Intern software engineer specializing in AI, cloud, and full-stack systems
Junior Software Engineer specializing in full-stack development and data engineering
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
Staff Software Engineer/Manager specializing in Generative AI and enterprise platforms
Mid-level Software Engineer specializing in backend systems and LLM-powered AI applications
Senior Machine Learning Engineer specializing in Generative AI and NLP
Mid-level AI Engineer specializing in Generative AI and LLM/RAG systems
Senior Data Scientist specializing in LLMs, NLP, and anomaly detection
Principal Data Scientist specializing in LLMs, RAG, and enterprise AI products
Executive AI Architect specializing in enterprise cloud and FinTech solutions
“Candidate brings an operator-to-founder profile with leadership experience in IT and Business Systems and a strong grasp of how ideas become venture-backable products. They speak fluently about startup evaluation criteria such as TAM, technical defensibility, speed to scale, and AI differentiation, and appear especially motivated by building solutions end-to-end in startup or venture studio environments.”
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI
“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Mid-level AI Engineer specializing in GenAI, NLP, and MLOps
“LLM/agentic-systems engineer with PayPal experience hardening an LLM-powered fraud support assistant from prototype to production, focusing on low-latency distributed architecture, rigorous evaluation/testing, and security/compliance. Comfortable in customer-facing and GTM contexts—runs technical demos/workshops, builds tailored pilots, and aligns sales/CS with engineering to close deals and drive adoption.”
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps
“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”
Junior AI/ML Engineer specializing in agentic AI, RAG, and voice systems
“Full-stack AI product engineer who has owned production-grade document intelligence and agent systems at meaningful scale, including a copilot used by 10,000+ users and 1M+ queries. Particularly strong in combining React/TypeScript product work with Python/FastAPI, RAG, knowledge graphs, observability, and performance tuning—cutting latency from ~7 seconds to 0.5 milliseconds while improving trust through citations and human review.”