Pre-screened and vetted.
“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”
Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision
“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”
Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech
“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”
Intern Software Engineer specializing in ML/NLP and LLM applications
“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”
Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems
“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”
Senior Software Engineer specializing in backend infrastructure, cloud automation, and reliability
“End-to-end deployment owner for Oracle document delivery/print services in a hospital-like production environment, focused on reliability/performance at scale (thousands of systems). Also describes implementing event-driven RAG/agentic LLM workflows with attention to embeddings/index consistency, latency, and measurable improvements in response relevance and operational efficiency.”
Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions
“Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.”
Senior Full-Stack & Mobile Software Engineer specializing in cloud-based applications
“Data/ML backend engineer with hands-on production experience spanning RAG services (LlamaIndex/OpenAI) and AWS data platforms. Has delivered Terraform-managed AWS architectures (Lambda + ECS Fargate) with secure secrets handling, built Glue-to-Redshift ETL with schema evolution controls, modernized SAS reporting into Python microservices, and achieved major Redshift query speedups (2+ hours to under 15 minutes).”
Mid-Level Software Engineer specializing in LLM agents and real-time data streaming
“Software engineer with experience at Striim and Amazon who ships end-to-end production systems across UI, backend, ML, and operations. Built a real-time PII detection capability for a streaming data platform by integrating Python ML inference into a Java monolith via gRPC sidecars, achieving ~3M events/hour throughput and ~93% accuracy, and helped drive enterprise adoption (Fiserv, CVS). Also modernized internal Amazon tooling for multi-region scale with modularization and fully automated deployments.”
Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems
“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”
Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI
“Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.”
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics
“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”
Intern Full-Stack Software Engineer specializing in test analytics platforms
“Software engineer intern at Nutanix who independently shipped and maintained an internal smoke-test/failure-analysis dashboard, integrating failure data from multiple upstream systems (e.g., Jira, Jenkins, CircleCI) via REST APIs. Also has prior data-science experience building Postgres-based asset management analytics with automated reporting and indexing for faster time-series retrieval.”
Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise data platforms
“Built and shipped a production LLM-powered RAG assistant for enterprise internal document search (PDFs, knowledge bases, structured data), addressing real-world issues like noisy documents, hallucinations, and latency with grounded prompting, retrieval-confidence fallbacks, and performance optimizations. Also partnered with compliance and business teams at JPMc to deliver a solution aligned with regulatory constraints, supported by monitoring, feedback loops, and systematic evaluation.”
Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps
“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”
Mid-level Data Scientist specializing in machine learning and generative AI
“ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.”
Mid-Level Backend Software Engineer specializing in FinTech and scalable APIs
“Backend/microservices engineer with fintech loan-lifecycle experience operating low-latency (sub-250ms) services in production using Kafka, idempotent transaction design, and Datadog observability. Also built an end-to-end LLM chatbot (React + Flask) with a decoupled model integration layer (FLAN-T5 via Hugging Face) and has experience designing partner-facing REST APIs with OAuth2/JWT and Swagger documentation.”
Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision
“ML/AI engineer with strong end-to-end production ownership across predictive ML and Generative AI use cases. They built a churn prediction platform that cut churn 12% and preserved about $1.2M in annual revenue, and also shipped a RAG-based support assistant that reduced ticket resolution time 30% while improving agent satisfaction and onboarding speed.”
Senior AI/ML Engineer specializing in Generative AI, LLMs, and production ML systems
“ML/AI engineer with hands-on ownership of both classical ML and GenAI systems in production. They built an end-to-end churn prediction service on AWS and also shipped RAG-based document search/summarization features, with clear experience in monitoring, hallucination reduction, cost/latency optimization, and creating shared Python/LLM infrastructure used across teams.”
Mid-level MLOps/DevOps Engineer specializing in cloud automation and ML pipelines
Junior Software Engineer specializing in Python full-stack, cloud/DevOps, and AI/ML
Mid-level AI/ML Engineer specializing in NLP/LLMs and real-time data pipelines
Senior Full-Stack Python Developer specializing in FinTech and cloud-native systems