Pre-screened and vetted.
Senior Java Full-Stack & DevOps Engineer specializing in cloud-native microservices
“Software engineer with a CS/Computer Engineering background who has worked on ML/NLP (Hugging Face, clinical NLP, text generation and structured extraction) and has a school robotics project integrating a trained ML model with microprocessor-controlled hardware to drive motor movement and writing. Currently focused on building and deploying applications and ML models to AWS/Azure using Docker, Kubernetes, and CI/CD; targeting ~$150K compensation.”
“Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.”
Junior Machine Learning Engineer specializing in computer vision and LLM applications
“Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).”
Mid-level Robotics Software Engineer specializing in autonomous perception and sensor fusion
“Robotics engineer with Honeywell and Tata Motors experience deploying ROS/ROS2 autonomous mobile robot fleets into live factory environments, integrating sensors, safety PLCs, and on-prem services. Known for solving end-to-end latency and stability issues (including network spikes under load) using gRPC, Docker, and improved diagnostics—cutting diagnosis time from hours to minutes and achieving sub-150 ms control response.”
Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS
“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”
Senior Research Scientist specializing in AI for autonomous driving and semiconductors
“Robotics perception engineer focused on autonomous driving 3D detection, integrating PETR embeddings into BEVFormer and tackling hard orientation/temporal alignment issues in multi-camera BEV pipelines. Uses Gazebo with custom sensor plugins to validate calibration, timing, and transforms, and blends synthetic labels with real imagery for scalable 3D box generation.”
Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps
“Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.”
Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI
“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”
Mid-Level Full-Stack Engineer specializing in LLM and RAG applications
“LLM/RAG engineer who took a PDF-heavy agent from prototype to production for an Africa-based client, combining Pinecone retrieval with robust PDF parsing (unstructured.io, OCR, structured table extraction). Demonstrates strong production mindset (eval metrics, prompt hardening, security/scalability) and measurable optimization impact (30% efficiency gain, 2x faster responses), and has helped close deals by building security-focused POCs for skeptical IT stakeholders.”
Mid-level Full-Stack Developer specializing in FinTech web applications
“Backend engineer who built an e-commerce order processing service in Python/Flask with PostgreSQL, focusing on correctness and reliability (idempotency, Redis locks, async payment processing with circuit breakers). Also integrated an ML recommendation system as a separate FastAPI inference service with caching and async embedding updates, reporting ~25% CTR lift, and has experience with multi-tenant isolation using PostgreSQL row-level security.”
Mid-Level Backend Software Engineer specializing in enterprise systems and applied AI/ML
“Support engineer with IBM DFSMS OAM experience who restored a production TS7770 environment during a TS7760→TS7700 migration by using logs, SLIP traps, and dump analysis to pinpoint an SMS configuration (SCDS) issue, then partnering with the customer to redo the migration successfully. Also built a personal agentic news selector system and emphasizes documentation improvements and customer education to prevent recurring incidents.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and Clinical AI
“Built and productionized a HIPAA-compliant LLM+RAG Clinical AI assistant at Optum, fine-tuning GPT/LLaMA on de-identified patient notes and integrating FAISS/Pinecone for sub-second retrieval; reported to cut diagnosis time by ~20 minutes per case. Experienced in orchestrating ML pipelines (Airflow, AWS Step Functions, Azure Data Factory) and in reliability techniques for LLM systems (grounding, citations, confidence filters, monitoring) while partnering closely with clinicians and compliance teams.”
Junior Machine Learning Engineer specializing in generative AI and computer vision
“AI engineer who deployed a production LLM-powered safety system for an education platform, combining rule-based checks, multi-LLM verification, and selective context (prompt+image vs image-only) to prevent explicit prompts/images from getting through. Strong focus on reliability via benchmarking, trace-based failure analysis, and continuous improvement driven by stakeholder feedback and manual review.”
Junior Full-Stack Developer specializing in web apps and reinforcement learning
“Built an AI basketball shooting coach that analyzes player form against NBA players and recruited 30+ beta users via Reddit to drive iterative UI/workflow improvements. Also has internship experience building an administrative server and coordinating API/database compatibility with another client server, emphasizing communication and integration quality.”
Senior Machine Learning Engineer specializing in MLOps and NLP/GenAI
“Built a production LLM-agent framework for a startup that performs daily financial/trading analysis by combining live market data with internal tools, including a centralized memory module to prevent context drift and reduce hallucinations. Also implemented an Airflow-orchestrated retail price forecasting pipeline deployed to AWS endpoints, scaling parallel workloads via Kubernetes Executor and validating systems with rigorous functional + LLM-specific metrics and cross-team collaboration.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”
Mid-level AI/ML Engineer specializing in Generative AI and LLMOps
“Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations
“Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.”
Mid-level Applied AI Engineer specializing in knowledge graphs, GraphRAG, and urban mobility
“ML/NLP practitioner focused on knowledge-graph-based retrieval for LLM question answering, including an urban/autonomous-vehicle decision-making use case. Built a hierarchical GraphRAG + vector database system and an entity-resolution pipeline that blends spatial and semantic similarity, validated using LLM-generated synthetic datasets; uses Python tooling like RDFLib, GraphDB, OpenAI APIs, and LangChain.”
Senior Software Engineer specializing in AI, cloud infrastructure, and full-stack development
“ML/NLP engineer who built a production system that converts large-scale unstructured text into a connected, searchable knowledge base using spaCy + Sentence Transformers/FAISS and a Neo4j knowledge graph, with BERTopic and XGBoost for organization/labeling. Strong focus on production-grade Python workflows (FastAPI/Celery, Pydantic validation, Docker, AWS ECS/Lambda) and robust entity resolution with measurable precision/recall and human review for low-confidence matches.”
Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps
“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and real-time fraud detection
“GenAI/ML engineer who has shipped production agentic systems in highly regulated and high-throughput environments, including an AWS Bedrock-based fraud/compliance workflow at U.S. Bank with PII redaction and hallucination detection that cut investigation time by 50%+. Also built and evaluated RAG and recommendation systems at Target, using RAGAS-driven testing, hybrid retrieval with re-ranking, and SHAP explainability dashboards to align model behavior with merchandising business KPIs.”
Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning
“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”
Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines
“Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.”