Pre-screened and vetted in the NYC Metro.
Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines
Mid-level Data Scientist specializing in LLMs, NLP, and predictive modeling in healthcare and finance
Mid-level Data Scientist specializing in GenAI, RAG, and multi-agent orchestration
Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)
Mid-level Data Scientist specializing in ML, NLP, and fraud detection
Mid-level Data Scientist specializing in NLP, deep learning, and Generative AI
Senior Data Analyst specializing in healthcare, biopharma, and financial services analytics
Mid-level Data Scientist specializing in ML, NLP, and forecasting across finance and retail
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”
Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI
“Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.”
Principal Data Scientist specializing in cybersecurity ML and MLOps
“ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).”
Senior Data Engineer specializing in ETL/ELT pipelines and data integration platforms
“Data engineer/software engineer who led an end-to-end ETL/ELT pipeline at Pearson processing millions of rows of student data nightly, including client-side data prep/validation, SFTP/API ingestion, staging-based SQL validation/transforms, and production loading. Built reliability features like configurable per-client validation thresholds, detailed reporting, concurrency throttling via a custom queue, and multi-source merge/backfill logic to keep nightly loads running even when sources fail.”
Senior Insights Manager specializing in audience research and ad effectiveness
Junior Systems & Software Engineer specializing in distributed systems and industrial automation
Mid-level Data Scientist specializing in machine learning, computer vision, and generative AI
Senior Lead Data Engineer specializing in cloud data platforms and real-time ML pipelines
Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP
Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms
“Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.”
Junior Data Engineer specializing in cloud ETL/ELT and lakehouse platforms
Senior Data Scientist specializing in LLM applications, RAG systems, and production ML
“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”
Intern Data Analyst and Software Engineer specializing in AI/ML and data platforms
Mid-level Research Analyst specializing in equity research and AI-exposed public companies
Director-level Data & Analytics Manager specializing in ML, Snowflake ELT, and RAG
Senior Data Scientist / AI Engineer specializing in LLMs, RAG, and production ML
“Data science professional who has built a production RAG-based LLM question-answering system ("Flash Query") to deliver fast, accurate answers over large document collections, focusing on retrieval quality and grounded responses. Also collaborates with non-technical retail/jewelry stakeholders to turn business questions into predictive models and dashboards for decision-making.”