Pre-screened and vetted.
Entry-level Software Engineer specializing in AI systems and GPU infrastructure
“Built a production LLM-powered diagnostic agent at Supermicro that automated triage of NVIDIA H100/H200 GPU cluster failures by parsing BMC/Redfish logs and recommending fixes from historical RMA data. Their work combined agent architecture, reliability engineering, and backend optimization, delivering a 30% reduction in resolution time and 50% lower database load.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI
“Built and owned a production GPT-4 RAG assistant for clinical and enterprise query resolution, taking it from initial experiment to deployment, monitoring, and iterative improvement. Their work cut resolution time from 45 minutes to under 2 minutes, achieved roughly 95% accuracy, and scaled to thousands of additional monthly queries while emphasizing safety and trust in a sensitive clinical domain.”
Mid-level AI/ML Engineer specializing in Generative AI and agentic systems
“Backend/platform engineer who has owned a Python/FastAPI results API and deployed it on Kubernetes with Helm and GitHub Actions-driven CI/CD. Demonstrates strong production operations mindset across performance tuning, monitoring, safe rollouts/rollbacks, and phased migrations, plus hands-on Kafka streaming experience focused on ordering and idempotency.”
Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI
“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”
Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI
“ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.”
Mid-Level Software Development Engineer specializing in full-stack and LLM/AI systems
“AI engineer with hands-on production experience building an end-to-end RAG system that reduced document-answering time from hours to minutes, improving accuracy through chunk overlap and hybrid BM25+semantic retrieval. Also built a LangGraph-based agent that researches company financial news via web search (Google Serper), using Pydantic structured outputs and checkpointing for reliability; experienced collaborating with non-technical stakeholders at JPMC and communicating ROI.”
Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps
“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”
Junior Machine Learning Engineer specializing in computer vision and LLM applications
“Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).”
Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS
“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”
Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems
“Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.”
Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps
“LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).”
Mid-level AI/ML & MLOps Engineer specializing in cloud AI infrastructure and GenAI
“At HPE, led and deployed an enterprise-grade LLM document intelligence platform for an insurance client, automating extraction from highly variable PDFs/scans/emails and raising field accuracy from 74% to 93%. Built a LangChain/Pinecone/OpenSearch RAG framework to cut hallucinations by 37% and operationalized LangSmith evals in CI, driving a 41% triage accuracy lift and >33% fewer incorrect resolutions while partnering closely with claims operations via HITL workflows.”
Junior Data Scientist / ML Engineer specializing in LLMs and Computer Vision
“Currently working in CoRAL Lab, built and deployed IntegrityShield—a document-layer PDF watermarking system that keeps assessments visually identical while disrupting LLM-based solving; validated in a real classroom where it helped catch 12 AI-cheating cases. Also built MALDOC, a modular red-teaming platform for document-processing AI agents using LangGraph to run reproducible, deterministic adversarial trials across OCR/text/vision routes.”
Executive Technology Leader (CTO) specializing in IoT sensing, AI/ML, and RF/embedded systems
“Currently a startup CTO who thrives on building new technology stacks and rapidly turning technical ideas into products. Interested in partnering with a CEO/business team to commercialize embedded/edge concepts such as multi-sensor drone localization (video/audio/RF with SDR), low-cost solar+battery power nodes networked via LoRa, and an Amazon Sidewalk/LoRa connectivity device with cloud management.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and enterprise AI
“Built an enterprise RAG-based document intelligence system at Freddie Mac for regulatory and financial documents, helping analysts cut search time from hours to minutes while improving retrieval accuracy by ~30%. Stands out for combining LLM product delivery with compliance-grade auditability, production monitoring, and scalable Python/FastAPI service design.”
Mid-level Software Engineer specializing in backend systems and applied AI
“Full-stack/product-minded engineer with strong React/TypeScript depth who has owned systems end-to-end, from UI architecture to backend services and data design. At Qualcomm, they built both a telemetry dashboard and an ML model drift monitoring platform for 20+ edge models, including post-launch tuning that cut false positives by 60%. They also demonstrate 0→1 startup execution by solo-building a production RAG document Q&A platform with JWT auth, Stripe gating, and sub-300ms retrieval.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation
“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”
“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”
Junior Software Engineer specializing in AI, RAG systems, and backend development
“Built an NYU software engineering capstone called “Smart Cash AI,” a multi-agent LLM-powered web app that curates offline-ready podcasts/articles/videos/news based on user preferences and commute schedules. Architected agent orchestration (discovery/downloader/summarizer), real-time progress via WebSockets, and an ETL normalization layer across RSS/YouTube and other sources with GUID-based deduplication, retries, and failure isolation to keep the system predictable.”
Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems
“Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems
“At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.”
Intern-level Software Engineer specializing in AI/ML systems
“Built production LLM/RAG systems during a UPS internship, including a shipment knowledge agent used across 15+ hubs worldwide and a multi-agent PDF RAG workflow. Stands out for combining hands-on enterprise integration with rigorous evaluation, hallucination reduction, and efficient fine-tuning techniques like LoRA.”
Junior Full-Stack Software Engineer specializing in AI, FinTech, and e-commerce
“Built both traditional internal tooling and LLM-powered systems during an internship, including a React/Python/AWS calculator onboarding platform and a production-style ROS2 RAG assistant over 10K+ documents. Stands out for combining full-stack delivery, stakeholder coordination, and practical AI reliability work like retrieval tuning, source-grounded answers, and low-confidence fallbacks.”