Pre-screened and vetted.
Senior Full-Stack & AI Engineer specializing in LLM integrations and cloud-native systems
“Backend/data engineer with hands-on production experience building FastAPI Python APIs and AWS-native platforms (Lambda/API Gateway, SQS, ECS Fargate) with Terraform + GitHub Actions CI/CD and strong reliability practices (JWT/RBAC, retries/timeouts, structured errors/logging). Also built AWS Glue ETL pipelines (S3/RDS to curated S3/Athena) with schema evolution and data quality controls, modernized legacy processing via parallel-run validation and phased cutovers, and has demonstrated SQL tuning impact (seconds to <200ms) plus incident ownership for batch pipeline SLAs.”
Mid-level AI Engineer specializing in GenAI agents and RAG for IT operations
“Built and operates a production LLM agent for enterprise IT operations that triages and drafts resolutions for high-volume ServiceNow tickets using LangChain + RAG (Pinecone/pgvector) and AWS Bedrock/OpenAI. Emphasizes reliability with schema-validated stages, offline eval datasets from real tickets, and CloudWatch-driven monitoring/guardrails; system scales to 40K+ tickets/month and cut resolution time ~28%.”
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Senior Machine Learning Engineer specializing in MLOps and Generative AI
Junior Robotics & Machine Learning Engineer specializing in perception, SLAM, and edge AI
“Built and deployed an Azure-based, fine-tuned CLIP visual retrieval system at Staples for a ~300k-item product catalog, improving edge-case recall by 12% by engineering a custom delta-similarity/dynamic-margin loss. Also has robotics experience using ROS2 for sensor/compute orchestration, including GPS-time-synchronized sensor triggering for robot swarms and latency-bounded optical-flow benchmarking for edge deployment.”
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level AI Engineer specializing in GenAI and RAG systems
“AI engineer who built a production e-commerce system that analyzes product images alongside sales and demographic data to generate actionable creative recommendations, now used by 20+ clients. Also built orchestrated document/agent pipelines (Airflow, LangGraph) including a compliance drift detector auditing 401 compliance documents, with an emphasis on traceability, logging, and production integration.”
Intern Software Engineer specializing in full-stack development and applied AI
“Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.”
Mid-level Data Scientist specializing in LLM development and scalable ML pipelines
“Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.”
Intern Software Engineer specializing in AI/LLMs and full-stack development
“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level Software Engineer specializing in backend systems, GraphQL, and real-time telemetry
“Software engineer with experience spanning an SEL internship and Walmart, combining backend/data pipeline work (Python, Kafka, relational DBs) with DevOps practices (Docker, Grafana, GitHub/Jenkins CI/CD, GitOps). Notably contributed to a REST-to-GraphQL migration aimed at reducing cloud utilization and implemented testing strategies to validate the transition.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Senior Software Engineer specializing in cloud-native microservices and healthcare integrations
“Backend engineer at Cerebrone.ai building cloud-native Flask microservices for an AI-driven automation platform on GCP (Cloud Run/App Engine), including dedicated inference services integrating OpenAI and internal ML pipelines. Demonstrated strong performance and scalability wins across Postgres/SQLAlchemy optimization, multi-tenant (healthcare/HIPAA-grade) data isolation, and high-throughput background processing with Celery/Redis/RabbitMQ, with multiple quantified latency/CPU/throughput improvements.”
Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms
“Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).”
Mid-level Full-Stack Software Engineer specializing in Generative AI
“Full-stack engineer who shipped an end-to-end speech capability for an LLM chatbot UI, integrating OpenAI APIs and publishing via Google Apigee with client documentation. Has experience operating deployments with Jenkins/Kubernetes/Docker and monitoring with Datadog, and has worked in an innovation-center environment building rapid prototypes under ambiguity with tight stakeholder feedback loops.”
Mid-level Machine Learning Engineer specializing in LLM-powered products
“Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.”
“Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.”
Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS
“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”
“GenAI/ML engineer from Deloitte who built and shipped a production RAG-based internal search assistant for support teams, delivering quantified operational gains (20% effort reduction, 35% faster manual lookup). Experienced in enterprise-grade LLM reliability (grounding/hallucination control), compliance/security constraints, and rapid release cycles using CI/CD, MLflow, and orchestration tools (Airflow, Databricks Jobs, LangChain).”
Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI
“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”
Mid-level Full-Stack Java Developer specializing in cloud-native microservices
“Full-stack engineer focused on enterprise, cloud-native microservices—building Spring Boot backends and React/Angular front ends with strong security (OAuth/JWT), AWS infrastructure (RDS/S3), and containerized deployments (Docker/Kubernetes). Has delivered data-heavy order/account/transaction platforms and healthcare solutions including EHR integrations for secure patient data exchange, with emphasis on testing, performance tuning, and reliability (load testing).”
Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms
“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations
“Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.”