Pre-screened and vetted in the NYC Metro.
Staff-level Software Engineer specializing in AI, data platforms, and cloud infrastructure
Senior Data & AI/ML Engineer specializing in LLM/NLP platforms and cloud data engineering
Mid-level Data Engineer specializing in LLM agents, RAG pipelines, and LLMOps
Junior Data Scientist specializing in analytics automation and BI dashboards
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”
Mid-level Azure Data Engineer specializing in Databricks lakehouse and Spark pipelines
Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms
Mid-level Data Engineer specializing in cloud ETL, big data, and analytics
Senior Backend/Cloud Developer specializing in Python and AWS-native data workflows
Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines
Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)
Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines
Senior Lead Data Engineer specializing in cloud data platforms and real-time ML pipelines
Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP
Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms
“Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.”
Junior Data Engineer specializing in cloud ETL/ELT and lakehouse platforms
Junior Backend Software Engineer specializing in search, data systems, and LLM applications
“Built a contract and customer documentation retrieval solution for Urban Studio, designing a RAG + Elasticsearch hybrid search stack (RRF + cross-encoder reranking) with a strong emphasis on chunking/data quality and hallucination reduction. Experienced in diagnosing LLM workflow issues via observability traces and tailoring technical demos to developer concerns like reliability and high concurrency.”