Pre-screened and vetted in Texas.
Senior AI/ML Software Engineer specializing in LLMs, NLP, and scalable ML platforms
Senior Data & ML Engineer specializing in big data platforms and marketing/ads ML
Senior Data/GenAI Engineer specializing in cloud-native ML, RAG, and real-time data platforms
Senior Python AI/ML Engineer specializing in MLOps, data engineering, and LLM applications
Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines
Senior Data Engineer specializing in cloud-scale data pipelines and legal data systems
Senior Data Engineer specializing in cloud-scale pipelines and legal data utilities
Senior Data Engineer specializing in cloud data platforms and big data pipelines
Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines
“Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.”
Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP
“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”
Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines
“Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.”
Senior Data Engineer specializing in data pipelines, APIs, and machine learning
“Data engineer with experience at Expedia building SQL Server and Azure Data Factory pipelines for business reporting and analytics. Stands out for pragmatic end-to-end pipeline ownership in ambiguous environments, with a strong emphasis on data quality, rerunnability, query performance, and making downstream datasets reliable for other teams.”
Mid-level AI Engineer specializing in Generative AI, LLMs, and RAG systems
Senior Data Engineer specializing in AI/ML platforms and legal data pipelines
Senior Data Engineer specializing in cloud data platforms and streaming pipelines
Senior Data Engineer specializing in cloud analytics and real-time streaming
Mid-level Data Engineer specializing in cloud ETL and real-time analytics
Mid-level AI/ML Data Engineer specializing in analytics, ML pipelines, and LLM applications
Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines
Mid-level Data Engineer specializing in streaming and cloud lakehouse platforms
“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”
Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake
“Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.”