Pre-screened and vetted in the NYC Metro.
Director-level Data Science Manager specializing in ML forecasting, experimentation, and MLOps
“Data/ML engineer with experience at American Express and Amazon, owning an end-to-end rewards redemption/liability ML pipeline (~200GB) with rigorous regulatory/audit validation and quarterly executive reporting. Also built web-scraped product datasets with anti-bot protections at a startup and helped modernize an authn/authz service using AWS, plus led early-stage migration work from an internal warehouse to GCP with CI/CD and cloud observability.”
Mid-level Data Analyst specializing in growth, product, and healthcare analytics
Senior Data Scientist specializing in LLM products, voice agents, and FinTech risk modeling
Mid-level analytics professional specializing in pricing and survey analytics
Junior Data Scientist specializing in customer and growth analytics
“Candidate combines fraud analytics experience at Citi with a clinical AI capstone involving reproducible ML pipelines for imaging and notes data. They stand out for turning messy, high-volume data into decision-ready reporting, automating evaluation workflows, and translating analytics into operational impact—from fraud rule changes to retention metric adoption.”
Junior business technology analyst specializing in supply chain and SAP operations
“Candidate brings a blend of consulting and brand-side marketing sourcing experience from Deloitte Consulting and L’Oreal. They stand out for using deep research, cost analysis, and one-on-one stakeholder engagement to drive agency negotiations, support a $4M annual savings target, and win adoption of new project tools like Smartsheet in change-resistant client environments.”
Senior Data Engineer specializing in cloud data platforms and big data pipelines
“Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.”
Senior Data Engineer specializing in cloud data platforms and real-time pipelines
“Data engineer focused on reliability and observability, building end-to-end pipelines processing millions of records/day from sources like S3 and Kafka. Has hands-on experience with Airflow-based data quality automation, PySpark/Databricks transformations, and shipping versioned Python REST APIs deployed via Docker/Kubernetes with CI/CD (Jenkins) and monitoring (CloudWatch/Azure Logs).”
Mid-level Data Scientist specializing in financial risk, fraud detection, and GenAI NLP
Mid-level Data Scientist specializing in NLP, GenAI, and time-series modeling
Mid-level Data Engineer specializing in LLM agents, RAG pipelines, and LLMOps
Junior Machine Learning Engineer specializing in computer vision, LLMs, and geospatial AI
Senior Data Scientist specializing in LLMs, Agentic AI, and MLOps
Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics
“Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.”
Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps
“Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.”
Junior Data Analyst specializing in finance, supply chain, and GTM analytics
Mid-level Data Engineer specializing in GCP, Spark, and healthcare analytics
Mid-level Business Analyst specializing in BI, reporting, and data analytics
“Finance data and reporting professional with PwC experience who bridges accounting and technology, especially around GL-related reconciliations, reporting accuracy, and close support. While not a direct PeopleSoft GL owner, they bring strong SQL-driven troubleshooting, ETL/data mapping remediation, and process automation experience that helped shorten close cycles and improve audit readiness.”
Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services
“Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.”
Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps
“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”
Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms
“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”
Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems
“Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.”
Junior Data Engineer specializing in cloud ETL and big data platforms
“Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.”