Pre-screened and vetted.
Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture
“Technology leader focused on business-aligned roadmaps and integration-heavy ecommerce platforms. Recently delivered an on-time launch for lutusooking.com (a premium Hamilton Beach brand) by coordinating UX/UI, component-based middleware, BigCommerce, Algolia search, personalization/recommendations, payments, and supply chain integrations, and later improved scalability via a Jitterbit iPaaS approach proven during Black Friday/Cyber Monday traffic.”
Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications
“GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.”
Junior Software Engineer specializing in cloud-native microservices and ML/LLM pipelines
“Backend-leaning full-stack engineer who ships AI-enabled products end-to-end: built CodeChat, a production internal codebase Q&A tool using RAG with Pinecone and a model-agnostic wrapper across OpenAI/Anthropic/AWS Bedrock, cutting AWS costs ~50% and latency ~45%. Also built and operated RealityStream, a Flask-based real-time forecasting API with JWT/RBAC, MLflow model versioning, and Prometheus/Grafana observability, including handling a real production latency incident via rollback, preloading, and caching.”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents
“AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).”
Mid-Level Software Engineer specializing in backend microservices and distributed systems
“Built and productionized an internal LLM-powered search tool that helps engineers find the right SolidWorks macros using plain-English queries, using OpenAI embeddings and ChromaDB with strong logging/fallback safeguards. Experienced in diagnosing RAG/agentic workflow issues in real time and in hands-on API support, including fixing customer macros after SolidWorks version updates and driving adoption through reusable solutions and best practices.”
Mid-level Software Engineer specializing in cloud, data engineering, and AI/ML
“Backend/platform engineer who owned an AI-powered resume optimization service end-to-end (FastAPI + Celery + Redis/Postgres) and optimized it for unpredictable LLM task latency. Strong Kubernetes/GitOps practitioner (Helm, autoscaling, probes, ArgoCD rollbacks) with experience in on-prem-to-cloud migrations using Terraform and CDC-based replication, plus real-time Kafka pipelines monitored via Prometheus/Grafana.”
Junior Software Engineer specializing in AI systems and robotics infrastructure
“Robotics software engineer with hands-on ROS 2 experience building real-time perception/control infrastructure and multi-sensor fusion (radar/ultrasonic + GNSS/IMU) with deterministic latency and safety fallbacks. Debugged rover navigation drift via rosbag replay and timing analysis, improving state estimation by gating GNSS and switching to SLAM when GPS degraded. Also brings strong distributed-systems and build/CI tooling experience (gRPC/Protobuf, Docker, Bazel cross-compilation for ARM/RISC-V, GitHub Actions).”
Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs
“Backend engineer who significantly evolved an internal analytics/reporting platform (Python API + Postgres) powering self-service dashboards for product/business teams, focusing on reliability under heavy concurrent load and fast query performance. Demonstrates strong production engineering practices across API design (FastAPI), observability, incremental rollouts with feature flags, and data security using JWT/RBAC plus Postgres row-level security.”
Mid-level Software Engineer specializing in backend, cloud-native microservices, and LLM apps
“LLM/agentic systems practitioner who repeatedly takes customer-facing LLM prototypes into production by operationalizing prompts, hardening RAG pipelines, and adding monitoring/guardrails. Has hands-on experience debugging intermittent production failures under high traffic (vector store timeouts/empty retrieval) and implementing fail-safe behavior plus alerting. Also partners closely with sales in pilots/POCs, customizing demos with customer data and running side-by-side comparisons to drive adoption.”
Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps
“Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.”
Intern Data Scientist specializing in ML engineering and LLM agentic workflows
“Built an agentic, multi-step LLM system that generates full-stack code for API integrations using LangChain orchestration, Pinecone/SentenceBERT RAG, and a human-in-the-loop feedback loop for iterative code refinement. Also collaborated with non-technical content writers and PMs during a Contentstack internship to deliver a Slack-based AI workflow that generates and brand-checks articles with one-click approvals.”
Intern Machine Learning Engineer specializing in forecasting, NLP, and RAG systems
“Intern who built and deployed a production LLM-powered contract analysis system for finance teams: Azure Document Intelligence for text/table extraction plus Gemini prompting to surface key terms and risks via an async API and simple UI. Emphasizes reliability in production with fallbacks, guardrails against hallucinations, and operational concerns like latency/cost/versioning, delivering summaries in under 30 seconds instead of hours.”
Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics
“Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.”
Mid-level AI/ML Engineer specializing in Generative AI and production ML systems
“At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.”
Mid-level AI/ML & Full-Stack Engineer specializing in LLM agents and medical RAG systems
“Full-stack engineer at an early-stage startup building an agentic AI application for enterprise systems, combining customer-facing Next.js/React UI work (30% faster load times) with backend/workflow orchestration using FastAPI + n8n, Redis, and RabbitMQ. Previously at Deloitte USI, built BDD Selenium/Java automation and managed 200+ defects end-to-end using JIRA/JAMA to support on-time production releases.”
Mid-level Full-Stack Engineer specializing in enterprise AI systems
“Built and productionized an AI NL-to-SQL capability inside legacy accounts receivable software (React + Spring Boot + Postgres/pgvector RAG), adding semantic caching and a SELECT-only validation layer to satisfy infosec. Achieved measurable impact (3 days to seconds turnaround, 60% token cost reduction, 50% latency reduction) with strong adoption (40 analysts, 50+ queries/week) and documented/monitored via Confluence + logging and user feedback loops.”
Staff RPA & Automation Engineer specializing in Financial Services
“Blue Prism RPA developer in a small FinTech-aligned team who owned ~20 production bots and drove both delivery and reliability. Built a shared VDI/locking design that cut infrastructure cost ~20–30% and routinely handled ServiceNow-driven production incidents end-to-end, including hotfixes and longer-term SDLC fixes. Also acted as a player-coach, training junior hires and maintaining high bot success rates (up to 99% within SLA).”
Mid-level AI Engineer specializing in LLM orchestration, RAG, and multi-agent systems
“Research Assistant at the University of Houston who built and live-deployed a production RAG system for 1000+ research documents, using hybrid retrieval (dense+BM25+RRF) with cross-encoder reranking and RAGAS-based evaluation; reported 66% MRR, 0.85+ faithfulness, and 68% lower LLM inference costs. Also built a deployed LangGraph multi-agent research system (Researcher/Critic/Writer) with tool integrations (Tavily, arXiv) and dual memory (ChromaDB + Neo4j), plus freelance automation work delivering a WhatsApp chatbot and n8n workflows for a wholesale clothing business.”
Junior AI Software Engineer specializing in LLM applications and real-time retrieval
“Founding engineer at Novum AI building a real-time call analytics/suggestion backend (transcription + sentiment/tone + context retrieval) using a serverless architecture. Drove major latency improvements (about 4s down to sub-1.5s) and has practical experience hardening production APIs (FastAPI/Pydantic, auth with Cognito/Redis) and payment systems (Stripe) by surfacing overlooked subscription and multi-tenant billing edge cases.”
“Designed and deployed a production LLM agent platform at the National Institutes of Health to reduce time spent searching fragmented internal documentation, combining RAG grounding with multi-step tool-calling workflows and integration into legacy services via inference APIs. Emphasizes production-grade reliability through automated evaluation on real queries, guardrails/safe-failure behaviors, and ongoing A/B testing and monitoring, and has experience translating non-technical stakeholder goals into measurable success metrics.”
Junior AI/ML Engineer specializing in cloud-native LLM systems and RAG
“AI/LLM engineer who has shipped production RAG copilots and multi-agent workflows, including a real-time Llama3 (Ollama) copilot backend handling 12k+ concurrent queries at 99.9% uptime. Deep on orchestration (Langflow/Airflow/Kubernetes), reliability evaluation (hallucination detection, p95 latency, token cost), and monitoring (Prometheus/Grafana), with demonstrated stakeholder-facing analytics delivery via Tableau.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“LLM/agentic systems engineer who built a production "Agentic AI Diagnostic Assistant" for network engineers, using a multi-agent Llama 2 + LangChain architecture with RAG over telemetry/incident data in DynamoDB and confidence-based deferrals to reduce hallucinations. Also has strong MLOps/orchestration experience (Airflow, EventBridge, Spark, Docker, SageMaker/ECS) at multi-terabyte/day scale and delivered multilingual NLP analytics (fine-tuned BERT/spaCy) for support operations through hands-on stakeholder workshops.”
Mid-level Data Scientist specializing in predictive modeling, NLP/LLMs, and RAG search systems
“Built production LLM/RAG platforms for financial services to enable natural-language Q&A over large policy/compliance document sets stored in Snowflake and SharePoint. Strong in MLOps and orchestration (Airflow, ADF, Step Functions, MLflow) and in solving real production issues like stale embeddings and model performance, including an incremental Snowflake Streams sync that cut processing time from hours to minutes.”