Pre-screened and vetted in Texas.
Senior Data Scientist specializing in machine learning, NLP, and MLOps
“ML/NLP engineer with experience building production-grade legal-tech and data platforms, including a GPT-4/LangChain contract review system using ElasticSearch embeddings (RAG) deployed on AWS EKS. Strong in entity resolution and scalable batch/streaming pipelines (Kafka/Spark), with measurable impact (70%+ reduction in contract review time) and a focus on monitoring and CI/CD for reliable delivery.”
Senior Data Scientist specializing in AI/Deep Learning and applied machine learning
Mid-level Data Scientist specializing in LLMs, RAG, and personalization
Senior Digital Analyst specializing in marketing analytics, personalization, and MarTech
Senior Data Engineer specializing in cloud-scale data pipelines and legal data systems
Senior Business Analyst specializing in data analytics and business intelligence
Senior Data Engineer specializing in cloud-scale pipelines and legal data utilities
Senior Data Engineer specializing in cloud data platforms and big data pipelines
Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines
“Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.”
Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP
“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”
Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines
“Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.”
Senior Data Engineer specializing in data pipelines, APIs, and machine learning
“Data engineer with experience at Expedia building SQL Server and Azure Data Factory pipelines for business reporting and analytics. Stands out for pragmatic end-to-end pipeline ownership in ambiguous environments, with a strong emphasis on data quality, rerunnability, query performance, and making downstream datasets reliable for other teams.”
Senior Data Engineer specializing in AI/ML platforms and legal data pipelines
Senior Data Engineer specializing in cloud data platforms and streaming pipelines
Senior Data Scientist specializing in AI agents and LLM production systems
Senior Data Engineer specializing in cloud analytics and real-time streaming
Director of Data Science specializing in ML, NLP/LLMs, and MLOps
Mid-level Data Engineer specializing in cloud ETL and real-time analytics
Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines
Mid-level Data Engineer specializing in streaming and cloud lakehouse platforms
Senior Data Scientist specializing in analytics, experimentation, and BI on AWS
“Data/ML practitioner focused on healthcare data quality and record linkage: analyzed 10M+ records, built anomaly detection and NLP-driven entity resolution, and automated AWS ETL/validation pipelines (Glue/Redshift/Lambda), cutting data errors by 40% and generating $500k in annual savings. Has hands-on experience with embeddings (Sentence Transformers/spaCy), FAISS vector search, and fine-tuning for domain-specific matching.”