Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG
“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”
Mid-level AI/ML & Full-Stack Engineer specializing in LLM agents and generative AI
“LLM/agent builder who shipped a live consumer AI-agent app (kalpa.chat) that visualizes complex reasoning as interactive graphs and abstracts multi-provider model usage via a unified wallet. Professionally has applied LangChain/LangGraph to IVR parsing and to scaling a football video-generation pipeline at DAZN, including shipping a VAR-specific retrieval/order fix via SQL after iterating with a non-technical PM.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation
“LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.”
Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI
“ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.”
Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots
“Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.”
Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics
“Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.”
Mid-Level Full-Stack Engineer specializing in cloud-native e-commerce and AI/ML systems
“Full-stack engineer with strong ownership in fast-moving environments: designed and shipped a pre-order/campaign inventory system (NestJS + Strapi + Datadog) that freed 34% warehouse space and reduced stock risk to ~5.7%. Also built rapid, high-impact logistics features (Spot Sales) that drove last-mile cost to ~0 in ~40 days, and has hands-on AWS/Terraform/CI-CD experience including deploying a global RAG system with Pinecone, Datadog, and PagerDuty.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare
“Deployed a real-time LLM-driven call center summarization and agent-assist platform at Fifth Third Bank, combining transformer models (BERT/GPT) with FastAPI inference on AKS and vector storage (ChromaDB/PostgreSQL). Emphasizes production-grade reliability (autoscaling, CI/CD, monitoring) and measurable evaluation (A/B testing), and translates model outputs into business-facing Power BI insights for call center leadership.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Mid-level AI Builder and Data Engineer specializing in GenAI and data pipelines
“Full-stack AI product engineer who personally built ViGenAir, a multimodal system that turns long-form video into ads using FastAPI, React, and agentic scoring. Stands out for handling complex 50GB+ media pipelines, re-architecting systems to eliminate OOM failures, and making opaque AI workflows usable through interactive visual UX that improved trust, speed, and retention.”
Junior Software Engineer specializing in Applied AI and backend systems
“Full-stack/AI product engineer who has shipped both a production-style React finance app and multiple LLM-powered systems end-to-end. Particularly strong in turning early-stage AI concepts into production workflows, including a Bedrock-based multi-turn chatbot with durable session memory and a medical credentialing document parser that cut pipeline failures by 50%+ on large, messy real-world files.”
“Software engineer with healthcare domain experience (patient monitoring and provider systems) who improves reliability and performance in complex React/Flask applications. Led API standardization for shared internal React utilities using an RFC + deprecation strategy, and optimized a live WebSocket dashboard to handle 3000+ concurrent clinics while reducing client CPU usage. Strong in production debugging, data ingestion validation, and operational improvements like structured logging and alerting.”
Senior Full-Stack AI Engineer specializing in LLM/RAG agentic systems
“Built and deployed JobMatcher AI, an LLM-driven workflow automation product for job seekers that extracts requirements from job descriptions, matches to user skills, and generates tailored outreach. Demonstrated strong production engineering by cutting per-run cost ~70%, improving reliability with retries/backoff/fallbacks, and reducing hallucinations via schema validation and templating; also orchestrated the system with LangGraph plus Docker Compose across API, vector DB, and workers.”
Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning
“Robotics software engineer focused on hard real-time autonomy for legged robots, building a quadruped navigation stack that combines vision SLAM with MPC and maintains a deterministic 500Hz control loop. Deep performance optimization experience across CUDA (sub-2ms perception latency), ROS 2/DDS real-time tuning, and motion planning (cut 500ms spikes to sub-5ms). Also designed distributed ROS 2 + Zenoh communications between quadrupeds and aerial drones and validated robustness under lossy wireless conditions.”
Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms
“Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.”
“Built and owned a production RAG-based conversational AI system at Entera for real estate analysis, taking it from experimentation through AWS deployment, monitoring, and iterative improvement. Demonstrates strong practical judgment in retrieval design, LLM safety, and scalable Python service architecture, with measurable impact including 30-40% reduction in manual analysis time and roughly 30% better response accuracy.”
Intern Software Engineer specializing in AI agents, MLOps, and data engineering
Intern Full-Stack & AI Engineer specializing in LLM applications and computer vision
Mid-level Software Engineer specializing in distributed systems, cloud, and LLM applications