Pre-screened and vetted.
Mid-level Generative AI Engineer specializing in LLMs, RAG, and agentic AI
Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps
Mid-Level Software Engineer specializing in full-stack, data engineering, and ML
Senior Full-Stack AI/ML Engineer specializing in personalization, NLP, and GenAI platforms
Senior Java Full-Stack Developer specializing in cloud-native microservices and FinTech
Executive Engineering Leader specializing in data platforms and SaaS
Senior Data Engineer specializing in cloud data platforms and real-time streaming pipelines
Senior Machine Learning Engineer specializing in MLOps and Generative AI
Senior Data Engineer specializing in multi-cloud data platforms and generative AI
Mid-level AI/ML Engineer specializing in MLOps and production ML systems
“Backend/ML engineer who has shipped high-scale real-time systems across e-commerce and healthcare: built a PharmEasy real-time recommendation engine for ~2M monthly users (cut feature latency 5 min→30 sec; +15% cross-sell) and architected a HIPAA-compliant multimodal clinical diagnostic workflow (DICOM+EHR) with XAI, MLOps (MLflow/Airflow/K8s), and drift/monitoring guardrails supporting 10k+ daily predictions.”
Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps
“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”
Senior Software Engineer specializing in backend microservices, cloud, and full-stack systems
“Backend/platform engineer who has built and scaled production Java/Spring Boot + Kafka services on AWS/Kubernetes (1M+ msgs/day) and led reliability/performance fixes that restored SLAs (25–30% latency improvement; 99.9% uptime). Also shipped an AI customer-support chatbot end-to-end using retrieval + guardrails and rigorous evaluation/observability, improving resolution time 40% and satisfaction 25%, with a strong plan/execute/verify approach to agentic workflow reliability.”
Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS
“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”
Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms
“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”
Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps
“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”
Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning
“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”
Senior Full-Stack Java Developer specializing in cloud-native microservices
“Backend/platform engineer with production ownership of high-volume transaction analytics and fraud monitoring services built in Java/Spring Boot. Has scaled data processing platforms (including healthcare datasets) and operated Kafka-based event pipelines with schema versioning, deduplication, and replay/backfill workflows, using strong observability via CloudWatch/Grafana and CI/CD with Jenkins.”
Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms
“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”
Intern Software Engineer specializing in cloud, big data, and test automation
“Internship experience at Qualitest building and deploying an LLM-powered test automation system that reduced manual test creation and improved efficiency (~40%). Demonstrates strong production engineering for LLM systems (timeouts/retries/monitoring/caching, prompt optimization, batching) and has scaled workflows to 100+ concurrent jobs; also has orchestration experience with AWS Step Functions and Kubernetes.”