Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems
“Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS
“LLM engineer who built a production document intelligence/RAG pipeline to extract structured data from thousands of unstructured PDFs, cutting manual review time by 60%. Experienced with LangChain and Airflow orchestration plus rigorous evaluation (labeled datasets, prompt testing, HITL review, monitoring) to improve accuracy and reduce hallucinations while partnering closely with non-technical operations stakeholders.”
Entry-Level Software Engineer specializing in ML and backend systems
“Built and deployed a production LLM-based real-time stance detection system for social media, fine-tuning LLaMA 3.1 on A100s with DeepSpeed ZeRO/FSDP and iteratively refining data to handle sarcasm and context-dependent meaning. Also has Kubernetes operations experience (Kafka/Logstash/Elasticsearch observability pipeline) and delivered an OCR automation project during a Worley India internship that saved 20+ hours/week for on-site energy safety stakeholders.”
Mid-Level Full-Stack Python Engineer specializing in cloud APIs and data/ML platforms
“Backend engineer at Goldman Sachs who deployed internal LLM-powered utilities to summarize operational logs/tickets, with a strong emphasis on data sensitivity and reliability. Built deterministic workflows with template-based prompts, confidence checks, and rule-based fallbacks, and used monitoring plus failure-rate metrics to tune performance; also has hands-on Temporal orchestration experience for resilient async backend jobs.”
Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation
“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”
Mid-level Machine Learning Engineer specializing in NLP and computer vision
“AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.”
Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP
“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”
Mid-level Full-Stack & GenAI Engineer specializing in RAG and LLM applications
“Software engineer working on an e-commerce platform, currently building a RAG-based recommendation system with a team new to the technology. Has delivered an end-to-end React/TypeScript website for a local car dealer and built an internal "encryption as a service" tool to secure sensitive data across repositories and through release/UAT, with experience debugging microservices integration issues.”
Mid-level Software Engineer specializing in AI, big data, and distributed systems
“Software Developer at NYU (GEMSS) focused on scaling and optimizing a data-heavy asset management web app, including migrating/optimizing data access via Google Sheets API and Firestore. Previously an SDE at Sainapse working on Spring Boot microservices POCs (Kafka, Hadoop at 2B+ record scale). Built an end-to-end Apple Wallet coupon generation/redemption system using PassKit + Google Apps Script with measurable ops impact (40% efficiency gain).”
Junior Machine Learning Engineer specializing in LLMs and applied data science
“Built and shipped multiple production AI systems, including Auto DocGen (LLM-generated OpenAPI docs kept in sync via AST diffs, schema-constrained generation, and CI/CD on Render) and a multimodal sign-language recognition pipeline at USC orchestrated with FastAPI, MediaPipe, and PyTorch. Also partnered with Esri’s non-technical community team to fine-tune an LLaMA-based spam classifier with a review UI, cutting moderation time by 70%.”
Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI
“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”
Junior Data Scientist and ML Researcher specializing in Transformers, multimodal AI, and autonomy
“Autonomous robotics student who built an end-to-end ROS2 semantic goal navigation system as a solo course project, integrating CLIP-based vision-language understanding with SLAM Toolbox and Nav2 to execute natural-language commands in Gazebo/RViz. Also implemented and tuned an RRT planner from scratch in Python and uses Docker plus GitHub workflows for reproducible, tested robotics codebases.”
Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP
“Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.”
Mid-Level Software Engineer specializing in distributed systems and cloud-native backends
“AI/LLM engineer with production experience at Charles Schwab building a RAG-based assistant to help 5,000+ reps answer complex financial policy questions. Implemented a multi-layer anti-hallucination approach (GNN-driven ontology/graph retrieval + citation-only answers) and compliance-focused guardrails (Azure AI Content Safety) in partnership with audit/compliance stakeholders.”
Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI
“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”
Senior AI/ML Engineer specializing in Generative AI, agentic systems, and RAG
“Built and deployed an agentic RAG assistant in production to automate enterprise knowledge search and multi-step workflows with tool calling, tackling real-world issues like hallucinations, retrieval accuracy, and latency. Demonstrates strong LLMOps and orchestration depth (MLflow, Airflow, LangGraph/LangChain/LlamaIndex) plus a metrics-driven approach to agent testing/evaluation and cross-functional delivery with business stakeholders.”
Junior GenAI Software Engineer specializing in multimodal RAG and agentic workflows
“AI/LLM engineer with production experience building a multimodal RAG agent for Walmart driver support, combining hybrid retrieval (dense+BM25) and fine-tuned Llama 3 served via vLLM on Azure AKS to reach sub-second latency. Drove measurable impact (25% fewer escalations, 60% lower token costs, 33% lower storage costs) and also built Kafka-based microservices that cut batch runtime from 2 hours to 15 minutes and reduced DB load by 80%.”
Senior Full-Stack Engineer specializing in AI/LLM and cloud-native SaaS
“Software engineer with strong end-to-end ownership across frontend, backend, data, and infrastructure, including real-time systems (Kafka/Postgres) and observability (Datadog). Built and productionized an AI-native RAG support assistant (OpenAI embeddings + Pinecone) with prompt/guardrail design, achieving 48% agent adoption and 30% faster responses. Experienced in legacy modernization and reliability work using feature flags, event/transaction replay, and rapid embedded delivery.”
Principal AI/ML Engineer specializing in agentic AI and distributed systems
Mid-level Machine Learning Engineer specializing in production AI/ML systems and full-stack development
Junior AI Engineer specializing in LLM systems, RAG pipelines, and cloud microservices
Senior Data Scientist and Generative AI Engineer specializing in LLMs, RAG, and MLOps
Senior Research Scientist specializing in LLM safety, robustness, and out-of-distribution detection