Pre-screened and vetted.
Mid-level Generative AI Engineer specializing in LLMs, RAG, and agentic systems
“Built a production "Mini RAG Assistant" for internal document Q&A, focusing on grounded answers (anti-hallucination), retrieval quality, and latency/cost optimization. Uses LangChain/LangGraph for orchestration and applies a metrics-driven evaluation loop (including reranking and semantic chunking improvements) while collaborating closely with product stakeholders.”
Senior Full-Stack Engineer specializing in AI, cloud infrastructure, and DevOps
“Frontend engineer focused on building and scaling data-heavy, real-time dashboards with React/Next.js/TypeScript. Emphasizes performance and reliability at scale through modular architecture, centralized state (Zustand/Redux), strict API contracts, automated testing, and production monitoring (Grafana/CloudWatch), and has experience shipping quickly with feature-flagged rollouts and rapid iteration from user feedback.”
Junior Full-Stack Software Engineer specializing in React/Node, cloud, and LLM-powered automation
“Master’s program project lead who built and deployed a real-time sound recognition system (Flask + React Native + ML) that was adopted by 200+ university students. Demonstrates strong production engineering and cross-layer debugging—solving latency, unreliable uploads, and observability gaps using microservice separation, chunked/idempotent transfers, and packet-capture-driven network diagnosis—plus AWS/on-prem and IoT edge-to-cloud integration experience.”
“Built and deployed a production LLM-powered RAG knowledge system to unify operational/policy information across PDFs, wikis, and databases, emphasizing auditability and low-latency/cost performance. Improved answer relevance at scale by moving from pure vector search to hybrid retrieval with metadata filtering and reranking, and partnered closely with healthcare operations/compliance to define acceptance criteria and human-in-the-loop guardrails.”
Mid-level GenAI Engineer specializing in RAG, LLM agents, and enterprise automation
“Accenture engineer who built and shipped a production RAG-based automation/chatbot for SAP incident triage and troubleshooting, embedding thousands of runbooks/logs/tickets into a semantic search pipeline and integrating it into Teams/Slack. Reported major productivity gains (30–60% time reduction), >90% validated answer accuracy, and sub-2-second responses, with strong orchestration (Airflow/Prefect/LangGraph) and reliability practices (guardrails, testing, monitoring).”
Senior Data Scientist specializing in LLM applications, RAG systems, and production ML
“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”
Mid-level Software Engineer specializing in Java/Spring Boot microservices
“Full-stack AI engineer who built Skillmatch AI, an LLM/RAG-based job matching platform using FastAPI microservices, Airflow-orchestrated async pipelines, and Pinecone vector search (sub-second retrieval across 50k+ vectors) deployed on GCP with autoscaling. Also partnered directly with a cancer researcher to automate SEER + PubMed-driven report generation via an AI pipeline, emphasizing rapid prototyping and outcome-focused communication.”
Mid-level Software/AI Engineer specializing in GenAI, AWS, and microservices
“Built a production AI pipeline at EyCrowd to automatically grade shaky outdoor user-submitted brand videos using CV + CLIP/BLIP and a LangChain RAG layer per brand, with GPT-4 generating structured JSON explanations and grades. Optimized for latency and cost (batch PyTorch inference, caching), cutting review time from ~8 minutes to <2 minutes while reaching ~90% alignment with human graders and supporting thousands of videos/day.”
Junior Machine Learning Engineer specializing in LLMs, RAG, and on-device AI
“Built an "Offline Study Assistant" that runs LLM inference locally on a 5-year-old Android device using Llama.cpp and the Android NDK, achieving a 27x speedup and cutting time-to-first-token from 11 minutes to 30 seconds. Also has applied backend/API experience with FastAPI, Supabase (Auth + RLS), and production hardening of a RAG system at Hashmint using Celery and Redis to eliminate PDF-processing-related query failures.”
Senior Full-Stack Engineer specializing in cloud-native microservices and AI/ML integration
Senior Full-Stack Developer specializing in React, Node.js, and AWS
“Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.”
Mid-level AI Engineer specializing in NLP and production ML systems
“AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows
“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”
Mid-level AI Engineer specializing in Generative AI and multimodal RAG systems
“GenAI/LLM engineer who built and productionized a 0-1 application (EMULaiTOR at Lumanity) combining qualitative + quantitative data using Postgres/pgvector RAG and prompt engineering, deployed with Azure backend and AWS-hosted frontend. Demonstrates strong production instincts (latency reduction via region alignment, autoscaling/health checks) and hands-on agent/tool-call debugging, plus experience enabling sales and winning a large pharma client.”
Mid-Level Software Engineer specializing in cloud-native microservices
“Built and shipped both a solo real-time multiplayer Spades game (TypeScript monorepo with shared client/server engine) and a production internal LLM-powered document Q&A tool for a SaaS company. Demonstrates strong RAG pipeline design (Pinecone + embeddings + reranking), rigorous eval/regression practices, and pragmatic data ingestion/observability work across Confluence, Notion, and messy PDFs/OCR—backed by clear metric improvements (P@1 61%→78%, escalations 40%→22%).”
Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation
“Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines
“Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.”
Mid-level AI Engineer & Researcher specializing in healthcare AI and multimodal LLM systems
“Backend/ML engineer focused on clinical AI transparency who built ShifaMind, an explainability-enforced clinical ML system using UMLS/MIMIC-IV/PubMed data with RAG, GraphSAGE, and cross-attention. Demonstrated strong production engineering via FastAPI API design and safe migrations (feature flags/shadow inference), plus HIPAA-aligned auth/RLS patterns; also delivered a real-time comet detection system reaching 97.7% accuracy.”