Pre-screened and vetted.
Mid-level AI Backend Engineer specializing in LLM applications and scalable ML services
Mid-level Software Engineer specializing in backend systems and LLM applications
Mid-level AI Backend Engineer specializing in LLM applications and scalable ML services
Mid-level Software Engineer specializing in event-driven backend and on-device ML for robotics
Mid-level AI Engineer specializing in LLM agents, RAG, and enterprise GenAI
Junior AI Product Engineer specializing in LLM workflows and analytics automation
Mid-level AI Engineer specializing in LLM orchestration and production AI systems
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and fraud detection
“At PwC, built and productionized an agentic RAG enterprise search assistant over 6M internal documents (8M embeddings), deployed across AWS and GCP. Drove major retrieval gains (72%→92% precision via BM25+dense hybrid with RRF and cross-encoder re-ranking), reduced hallucinations 30%, achieved <2s latency at 50–60K queries/month, and cut support tickets 30%—boosting adoption to 2,500 users by adding source-cited answers.”
Intern AI/ML Engineer specializing in LLM applications and data infrastructure
“Hands-on LLM practitioner who built a production document-processing pipeline in Python, tackling long-document handling and latency with chunking/batching and a user-driven correction feedback loop. Experienced operationalizing AI workflows with Kubernetes (CronJobs, autoscaling, scheduled data cleaning and weekly retraining) and applying structured testing/evaluation (E2E, LLM-as-judge, HITL) while communicating solutions clearly to non-technical clients using visual diagrams.”
“Built and deployed a production RAG-based LLM Q&A and summarization platform for internal documents, emphasizing grounded answers with structured prompting and citations to reduce hallucinations. Experienced orchestrating end-to-end LLM workflows with LangChain plus cloud pipelines (Azure ML Pipelines, AWS), and runs iterative evaluation using both metrics (accuracy/hallucination/latency/cost) and real user feedback to drive reliability.”
“Built and deployed a live LLM-powered platform that takes a LinkedIn job URL + resume and generates job-specific resumes and personalized outreach at scale, with production-grade logging/monitoring/retries on Vercel + Railway. Experienced with agent orchestration (AWS Bedrock/Strands, LangGraph, CrewAI) and rigorous AI workflow testing, plus stakeholder-facing prototypes like data lineage/metadata and NL-to-SQL + dashboard generation.”
Mid-Level Software Engineer specializing in backend systems and LLM/RAG applications
“Backend/AI engineer at Intuit who built a production AI-powered case assistant for support agents (FastAPI on AWS EKS) combining Postgres case data, OpenSearch retrieval with embedding reranking, and internal LLMs. Improved peak-season reliability by diagnosing P95/P99 timeout spikes and cutting P95 latency from ~800ms to <400ms via composite indexing, keyset pagination, connection pool tuning, and caching, while adding grounded-generation guardrails (evidence packs, confidence thresholds, fallbacks, human-in-the-loop).”
Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI
“FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.”
Junior Software Engineer specializing in LLM systems, data engineering, and ML
“Backend/ML systems engineer with experience at SDSC, UCSD, and Media.net, building production semantic dataset/model discovery using embeddings + Solr KNN and LLM-based intent/reranking at 5M+ dataset scale. Emphasizes offline/online separation for predictable serving, has delivered measurable gains (23% retrieval accuracy, 38% latency reduction) and helped secure a $3M+ NSF grant.”
Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps
“Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.”
Senior Data Scientist specializing in GenAI, LLM systems, and production ML