Pre-screened and vetted in New York.
Junior business technology analyst specializing in supply chain and SAP operations
“Candidate brings a blend of consulting and brand-side marketing sourcing experience from Deloitte Consulting and L’Oreal. They stand out for using deep research, cost analysis, and one-on-one stakeholder engagement to drive agency negotiations, support a $4M annual savings target, and win adoption of new project tools like Smartsheet in change-resistant client environments.”
Senior Data Engineer specializing in cloud data platforms and big data pipelines
“Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.”
Senior Data Engineer specializing in cloud data platforms and real-time pipelines
“Data engineer focused on reliability and observability, building end-to-end pipelines processing millions of records/day from sources like S3 and Kafka. Has hands-on experience with Airflow-based data quality automation, PySpark/Databricks transformations, and shipping versioned Python REST APIs deployed via Docker/Kubernetes with CI/CD (Jenkins) and monitoring (CloudWatch/Azure Logs).”
Mid-level Data Scientist specializing in NLP, GenAI, and time-series modeling
Mid-level Data Engineer specializing in LLM agents, RAG pipelines, and LLMOps
Junior Machine Learning Engineer specializing in computer vision, LLMs, and geospatial AI
Senior Data Scientist specializing in LLMs, Agentic AI, and MLOps
Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics
“Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.”
Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps
“Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.”
Junior Data Analyst specializing in finance, supply chain, and GTM analytics
Mid-level Data Engineer specializing in GCP, Spark, and healthcare analytics
Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps
“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”
Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms
“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”
“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”
Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems
“Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.”
Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics
“ML/AI engineer with Capital One experience building production-grade customer segmentation and fraud detection systems combining NLP (transformers) and anomaly detection. Strong MLOps and orchestration background (PySpark ETL, MLflow, Airflow, Docker/Kubernetes, Azure ML) with real-time monitoring/alerting and performance optimizations like quantization and caching, plus proven ability to deliver business-facing insights through Power BI/Tableau for marketing stakeholders.”
Junior Data Engineer specializing in cloud ETL and big data platforms
“Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.”
Intern-level Sales and Outreach Professional specializing in sustainability and client engagement
“Sales/business development candidate with hands-on outbound experience across donor/partner and community stakeholder outreach, including multi-channel campaigns (email, word-of-mouth, door-to-door) managed through Salesforce. Applied data-driven targeting (NY census) to focus outreach on high-poverty areas and has experience operating in an early-stage/ambiguous environment at NBC during the Paris Olympics, using proactive coordination and meetings to create structure.”
Mid-level Data Scientist specializing in fraud detection and ML pipelines
Senior Data Engineer specializing in cloud data platforms and Generative AI