Pre-screened and vetted.
Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare
“Data engineer at Plaid who built greenfield, end-to-end real-time transaction pipelines and FastAPI data services for fraud detection and analytics, handling millions of events per day. Strong focus on reliability and data integrity via Great Expectations validation, Airflow-based monitoring/SLAs, quarantine/staging patterns, and robust external data ingestion with schema versioning and backfills (reported 50% fewer anomalies and ~40% fewer failures).”
Mid-level Machine Learning Engineer specializing in Generative AI and real-time ML systems
“ML/GenAI engineer with hands-on experience shipping LLM-powered support systems at Uber, including real-time feedback analysis, ticket summarization, and retrieval-grounded knowledge systems. Stands out for combining fine-tuning, RAG, safety evaluation, and production optimization to drive measurable support outcomes like faster handling times, better resolution rates, and lower latency/cost.”
Senior Software Engineer specializing in .NET microservices for Healthcare IT and FinTech
Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines
Mid-level Data Engineer specializing in real-time streaming and ML feature pipelines
Mid-level AI/ML Engineer specializing in generative AI and data engineering
Mid-level AI/ML Engineer specializing in production ML, NLP, and computer vision
Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics
Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data
Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics
Senior Software Engineer specializing in Healthcare IT and cloud-native microservices
Senior Data Analyst specializing in healthcare and financial analytics
Staff Full-Stack Software Engineer specializing in cloud-native microservices
Senior Data Engineer specializing in cloud data platforms and big data pipelines
Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems
“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”
Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines
“Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.”
Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines
“Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.”
Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP
“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”
Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps
“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”
Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics
“Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.”