Pre-screened and vetted.
Senior Full-Stack & AI Engineer specializing in LLM integrations and cloud-native systems
“Backend/data engineer with hands-on production experience building FastAPI Python APIs and AWS-native platforms (Lambda/API Gateway, SQS, ECS Fargate) with Terraform + GitHub Actions CI/CD and strong reliability practices (JWT/RBAC, retries/timeouts, structured errors/logging). Also built AWS Glue ETL pipelines (S3/RDS to curated S3/Athena) with schema evolution and data quality controls, modernized legacy processing via parallel-run validation and phased cutovers, and has demonstrated SQL tuning impact (seconds to <200ms) plus incident ownership for batch pipeline SLAs.”
Mid-level AI Engineer specializing in GenAI agents and RAG for IT operations
“Built and operates a production LLM agent for enterprise IT operations that triages and drafts resolutions for high-volume ServiceNow tickets using LangChain + RAG (Pinecone/pgvector) and AWS Bedrock/OpenAI. Emphasizes reliability with schema-validated stages, offline eval datasets from real tickets, and CloudWatch-driven monitoring/guardrails; system scales to 40K+ tickets/month and cut resolution time ~28%.”
Mid-Level Software Development Engineer specializing in distributed systems and event-driven architectures
“Built and maintained an internal JavaScript/React real-time event monitoring UI used by multiple Goldman Sachs teams (e.g., Private Wealth Management and Bulk Trading Systems). Focused on scaling performance under hundreds of events/sec—using profiling, memoization, batching, and debouncing—and paired it with strong internal documentation and disciplined incident diagnosis via synthetic load testing and logs/metrics.”
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Senior Machine Learning Engineer specializing in MLOps and Generative AI
Senior Data Engineer specializing in multi-cloud data platforms and generative AI
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level Software Engineer specializing in Agentic AI and RAG systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Mid-level AI Engineer specializing in GenAI and RAG systems
“AI engineer who built a production e-commerce system that analyzes product images alongside sales and demographic data to generate actionable creative recommendations, now used by 20+ clients. Also built orchestrated document/agent pipelines (Airflow, LangGraph) including a compliance drift detector auditing 401 compliance documents, with an emphasis on traceability, logging, and production integration.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level Machine Learning Engineer specializing in computer vision and generative AI
“Built and deployed an LLM/RAG system that uses differential privacy and distributional similarity checks to transform private data into a non-sensitive knowledge base while preserving utility. Also has experience demonstrating adversarial ML concepts (FGSM) to non-technical audiences by focusing on observable model behavior rather than implementation details.”
Junior AI/ML Engineer specializing in anomaly detection and LLM/RAG systems
“Built and productionized a tool-first, multi-agent framework that augments an anomaly detection model with domain context to generate trustworthy, evidence-backed anomaly explanations (including false-positive likelihood). Architected the platform to be model/orchestration/vectorDB agnostic (e.g., GPT + CrewAI + ChromaDB vs Claude + LangGraph + other vector DB) with strong performance, reliability, and OpenTelemetry-based observability. Also built a personal LangGraph-based "mock interviewer" agent that asynchronously fuses voice + live code input using state reducers, stop conditions, and fallback routing.”
Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics
“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”
Mid-level AI/ML Engineer specializing in real-time anomaly detection and AI agents
“Built a production real-time anomaly detection platform for high-frequency trading at HSBC, using a streaming stack (Pulsar + Spark Structured Streaming + AWS Lambda) and a transformer-based model combining time-series and numerical signals. Experienced in MLOps and safe deployment (Kubernetes, canary releases, MLflow/Grafana monitoring) and in aligning model performance with risk/compliance expectations through SLA-driven tuning and stakeholder-friendly dashboards.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level Data Scientist specializing in ML, MLOps, and Generative AI
“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”
Senior AI/ML Engineer specializing in Generative AI and RAG
“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”
Mid-level Data Scientist specializing in fraud detection and healthcare ML
“Applied NLP/ML in healthcare and financial services, including fine-tuning BERT on unstructured EHR text and building embedding-based similarity search for clinical concepts. Also redesigned a Wells Fargo fraud detection data pipeline using modular Python + AWS Glue/Step Functions, cutting runtime ~40% with improved monitoring and reliability.”
Mid-level Generative AI Engineer specializing in LLM systems and RAG
“Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.”
Junior Software Engineer specializing in Full-Stack and ML for FinTech
“Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.”
Mid-level Full-Stack Software Engineer specializing in Generative AI
“Full-stack engineer who shipped an end-to-end speech capability for an LLM chatbot UI, integrating OpenAI APIs and publishing via Google Apigee with client documentation. Has experience operating deployments with Jenkins/Kubernetes/Docker and monitoring with Datadog, and has worked in an innovation-center environment building rapid prototypes under ambiguity with tight stakeholder feedback loops.”
Mid-level Machine Learning Engineer specializing in LLM-powered products
“Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.”