Pre-screened and vetted.
Senior AI/ML Engineer specializing in GenAI agents and LLM workflows
“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”
Mid-level Software Engineer specializing in backend systems and cloud-native FinTech
“Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.”
Junior Software Engineer specializing in full-stack systems and distributed log analytics
“CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.”
Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision
“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance
“Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.”
Intern Data Scientist specializing in generative AI and forecasting
“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”
Intern Software Engineer specializing in ML/NLP and LLM applications
“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”
Mid-Level Software Engineer specializing in full-stack systems and developer tooling
“Built and productionized an AI extension for JetBrains IDEs providing coding assistance, testing, security sweeps, and documentation generation using both an internal LLM and third-party models (e.g., Gemini, Claude). Experienced in diagnosing customer issues in real time (Slack) with structured follow-through (GitHub Issues) and driving adoption through developer-oriented walkthroughs and video demos.”
Mid-level Full-Stack Engineer specializing in AI and FinTech platforms
“Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.”
Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems
“Backend/LLM engineer with experience productionizing RAG systems (legal-case natural language querying) and optimizing for latency/cost, including a reported ~40% reduction via Redis caching and batching. Built monitoring and real-time debugging workflows (FastAPI, structured logging, correlation IDs, sandbox repro) and regularly delivered technical demos/workshops. Also partners with BD/sales to translate LLM capabilities into business value, including ESG-metric extraction from corporate filings.”
Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems
“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”
Mid-level Full-Stack Developer specializing in cloud microservices and GenAI systems
“Built and owned an end-to-end AI-driven decisioning platform at Uber, combining LLM orchestration with typed tool contracts and a Snowflake-based RAG pipeline to make decisions fully auditable. Delivered large-scale measurable impact (120k requests/day, 18k cases auto-resolved/month) while improving ops SLA from 3 days to 6 hours and cutting incident response time nearly in half. Previously led a high-risk strangler-fig modernization of a legacy insurance platform across 120+ microsites at Accenture, coordinating across multiple squads with feature-flagged parallel cutovers.”
Intern AI/ML Engineer specializing in Generative AI and applied machine learning
“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”
Mid-level Software Engineer specializing in backend, distributed systems, and AI infrastructure
“Built Baioniq, an enterprise LLM platform for automating extraction from massive unstructured documents like contracts and insurance claims. They demonstrate unusually strong production depth in agentic AI—scaling to 100k+ requests/day, processing 1M+ claim documents, and improving extraction accuracy through rigorous RAG architecture, evaluation, and fallback design.”
Senior Full-Stack & Mobile Software Engineer specializing in cloud-based applications
“Data/ML backend engineer with hands-on production experience spanning RAG services (LlamaIndex/OpenAI) and AWS data platforms. Has delivered Terraform-managed AWS architectures (Lambda + ECS Fargate) with secure secrets handling, built Glue-to-Redshift ETL with schema evolution controls, modernized SAS reporting into Python microservices, and achieved major Redshift query speedups (2+ hours to under 15 minutes).”
Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation
“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”
Mid-level Full-Stack Engineer specializing in scalable APIs, cloud infrastructure, and GenAI apps
“Backend/platform engineer with experience across edtech, logistics, and AWS internal systems—owned a production course recommender end-to-end (model serving + APIs + caching/observability), delivering +30% CTR and -20% latency. Has scaled real-time delivery visibility/rerouting on Kubernetes/EKS to sub-200ms P95 during demand spikes and built billion-events/day telemetry pipelines on AWS (Kinesis Firehose, Lambda, S3, Redshift) with schema evolution, dedupe, and replay support.”
Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices
“Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).”
Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems
“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”
Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems
“Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.”
Senior Technical Support Engineer specializing in Azure Cloud & Generative AI
“Microsoft cloud/infra engineer with 5+ years supporting enterprise Azure environments, specializing in security-focused networking (private endpoints, DNS) and production troubleshooting across Azure Front Door/App Gateway WAF/AKS. Has implemented posture improvements via Defender for Cloud, Azure Policy, and RBAC tightening, and also designs secure AWS agent/scanner integrations and modern EKS/GitHub Actions/Secrets Manager observability-enabled SDK rollouts.”
Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines
“ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.”