Pre-screened and vetted in the Austin Metro.
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in MLOps and cloud-native ML systems
Mid-level Generative AI & Machine Learning Engineer specializing in LLMs and RAG
Senior Data & ML Engineer specializing in big data platforms and marketing/ads ML
Senior AI/ML Engineer specializing in conversational and generative AI
“Built and productionized an LLM-based support assistant end-to-end, including RAG, APIs, monitoring, guardrails, and agent feedback loops. Stands out for translating GenAI prototypes into reliable production systems with structured evaluation, safety controls, and reusable Python infrastructure that improved both support quality and engineering velocity.”
Junior Machine Learning Engineer specializing in NLP and computer vision
Senior AI Engineer specializing in LLMs, RAG, and multimodal NLP
“Built a production LLM/RAG assistant for insurance/health claims agents that ingests 100–200 page patient PDFs via OCR (migrated from local Tesseract to Azure Document Intelligence) and delivers grounded claim detail retrieval plus summaries with PII/PHI guardrails. Experienced orchestrating large workflows with Celery worker pipelines and AWS Step Functions (S3-triggered, Fargate-based batch inference/accuracy aggregation), and collaborates closely with non-technical SMEs (claims agents/nurses) through shadowing, iterative demos, and SME-defined evaluation.”
Mid-level AI Engineer specializing in Ambient AI and full-stack applications
Mid-level AI/ML Engineer specializing in recommender systems, MLOps, and Generative AI
Junior Software Engineer specializing in AI/ML and cloud platforms
“LLM/agent engineer who shipped a production "Memory Assistant" at HydroX AI, building a LangChain/LlamaIndex RAG memory pipeline on ChromaDB/FAISS with robust fallbacks (BERT/BART), prompt-injection mitigation, and 99.9% uptime monitoring. Also built a multi-step customer support agent using Rasa + OpenAI Assistants API with structured tool calling, guardrails, and human-in-the-loop escalation, and has experience hardening agents against messy ERP data via Pydantic validation, idempotency, and transactional outbox patterns.”
Staff Data Scientist specializing in AI/ML engineering and MLOps
“ML/NLP engineer with experience at Flatiron Health building a production NLP platform that processed millions of clinical notes, using BERT/BiLSTM-CRF and spaCy to extract and normalize entities from noisy EMR text with oncologist-in-the-loop validation. Also built scalable retail ML workflows (Spark + Kubernetes + feature store caching) and applied vector databases plus contrastive-learning fine-tuning to improve retrieval relevance and recommendations.”
Senior Machine Learning Engineer specializing in LLMs and scalable AI platforms
Mid-level Generative AI Developer specializing in LLM apps and RAG for FinTech and payments
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services
“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”
Mid-level Backend/Data Engineer specializing in cloud APIs and data pipelines
Senior AI/ML Engineer specializing in Generative AI, Agentic AI, and RAG systems
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps
“Built and deployed an LLM-powered financial/regulatory document analysis platform at State Street, combining fine-tuned transformer models with a RAG pipeline over internal knowledge bases. Owned the productionization stack (FastAPI, Docker, SageMaker, Terraform, CI/CD) plus monitoring for drift/latency/hallucinations, delivering ~40% faster analyst review and improved reliability through chunking/embeddings and grounding.”