Pre-screened and vetted.
Mid-level AI Engineer specializing in ML, NLP, and Generative AI
“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”
Mid-level Data Scientist & Product Ops/Analytics professional specializing in AI and KPI systems
“Cross-functional operator/chief-of-staff style leader who took a product from prototype to a live pilot in 3 months, spanning public-sector data normalization, an ML matching engine, a secure API, and KPI/investor demo instrumentation. Strong focus on executive alignment and productivity via Notion-based operating systems plus automated reporting (Python/Power BI), with experience supporting fundraising and go-to-market narratives.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Mid-level Machine Learning Engineer specializing in computer vision and reinforcement learning
“Early-stage engineer with hands-on embedded prototyping experience (Arduino/Raspberry Pi) who helped build an award-winning smart glasses project enabling phone notifications via Bluetooth. Strong computer vision performance optimization background, including accelerating 120 FPS inference by moving from TensorFlow to PyTorch and deploying through ONNX + TensorRT quantization, plus Docker-based GPU deployment and CI/ML practices.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Mid-level Software Engineer specializing in cloud data platforms and serverless ETL
“Data/ML engineer from HCLTech who modernized enterprise data by linking fragmented financial and supply-chain data across SAP/SQL Server/Snowflake using NLP entity linking and embeddings (FAISS). Delivered measurable impact including ~40% reduction in manual error-log triage and entity-linking accuracy improvements from ~86% to ~93%, with results surfaced in Power BI for real-time analytics.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”
Mid-Level Software Engineer specializing in distributed systems and cloud microservices
“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”
Mid-level Software Engineer specializing in full-stack and distributed systems
Mid-level Machine Learning Engineer specializing in LLMs and multilingual NLP
Senior Machine Learning Engineer specializing in Generative AI, RAG, NLP, and Computer Vision
Mid-level Machine Learning Engineer specializing in NLP, MLOps, and predictive risk modeling
Entry-level ML and analytics engineer specializing in robotics and automation
Junior AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
Mid-level AI/ML Engineer specializing in cloud MLOps and real-time data pipelines