Pre-screened and vetted in the NYC Metro.
Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services
“Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.”
Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps
“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”
Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms
“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”
Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems
“Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.”
Junior Data Engineer specializing in cloud ETL and big data platforms
“Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.”
Intern-level Sales and Outreach Professional specializing in sustainability and client engagement
“Sales/business development candidate with hands-on outbound experience across donor/partner and community stakeholder outreach, including multi-channel campaigns (email, word-of-mouth, door-to-door) managed through Salesforce. Applied data-driven targeting (NY census) to focus outreach on high-poverty areas and has experience operating in an early-stage/ambiguous environment at NBC during the Paris Olympics, using proactive coordination and meetings to create structure.”
Junior Data & AI professional specializing in analytics, ML, and LLM systems
“Full-stack product builder with strong GTM and applied AI experience, including end-to-end ownership of a production lead intelligence platform that combined React/TypeScript, Python services, external data enrichment, and LLM orchestration. Notably reduced SDR research time from 15-20 minutes to under 2 minutes per account and also drove an 8% revenue increase at Finding Pi by building a customer segmentation framework from analysis of 45k+ users.”
Mid-level Data Scientist specializing in fraud detection and ML pipelines
Senior Data Engineer specializing in cloud ELT/ETL and data warehousing
Junior Product & Business Analyst specializing in analytics, dashboards, and go-to-market strategy
Mid-level Data Scientist specializing in experimentation, personalization, and decision intelligence
Mid-level Data Engineer specializing in lakehouse and cloud data platforms
Principal/Lead Data Engineer specializing in large-scale pipelines, NLP, and graph databases
Junior Data Scientist specializing in analytics automation and BI dashboards
Mid-level Data Analyst specializing in healthcare and financial analytics
Mid-level Data Analyst specializing in predictive analytics and BI for financial services
Junior Data Scientist specializing in applied machine learning and analytics
Junior Data Analyst specializing in automation, BI dashboards, and applied machine learning