Pre-screened and vetted in the Bay Area.
Mid-level Data Engineer specializing in large-scale analytics platforms
“Data/Backend engineer with experience at Naukri building large-scale analytics products over a 130M+ user base, including Spark/Airflow pipelines and Kafka-based clickstream validation with Confluent Schema Registry. Also built an audience segmentation backend (Athena/S3 + Spring Boot APIs) for non-technical internal teams and recently shipped a GenAI customer data audit system (FastAPI/Postgres/Llama) that cut sales-planning validation from ~3 months to ~1 week.”
Intern Data Scientist specializing in generative AI and forecasting
“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”
Intern Data Analyst specializing in geospatial ML and business analytics
Principal Technology Architect specializing in Salesforce Data Cloud, integrations, and agentic AI
Senior Data Scientist specializing in ML search, recommendations, and generative AI
Senior Data Scientist specializing in Generative AI, NLP, and MLOps
Mid-level GenAI & Analytics Engineer specializing in LLM and cloud cost/finance analytics
Senior Data Scientist specializing in LLMs, NLP, and anomaly detection
Junior Data Engineer specializing in BI, governed metrics, and workflow automation
“Built and shipped LLM/OCR/NLP-driven document-intelligence workflows in operational environments (EnvoyX and UPS), emphasizing production readiness via explicit state-machine orchestration, confidence gates, and human-in-the-loop review. Demonstrated strong business impact in customs brokerage/document ingestion: 50% fewer customs rejects, 30% higher throughput, SLA adherence improved from 71% to 96%, and platform reliability reaching 99.6% with 78% fewer bad-data incidents.”
Junior Full-Stack & Data Scientist specializing in ML/NLP and analytics products
“Built and deployed profitprops.io, a sports betting player-props prediction product using ML/AI. Implemented backend APIs with FastAPI/Express.js and Supabase, trained models on AWS GPU (P3) using Docker + RAPIDS, and set up CI/CD with GitHub Actions while working around cost constraints and data-collection hurdles (EC2 proxy rotation/rate limits).”
Intern Data Scientist specializing in computer vision and LLM agents
“Software engineering candidate with hands-on experience building and shipping LLM agents: created a production AI enrichment/coding agent at Covalent Metrology using Apollo.io + OpenAI, and built a Mistral hackathon router that dynamically selects among models to reduce token cost while maintaining quality. Also developed a real-time financial margin analysis agent that emails actionable insights and iterated on reliability issues (e.g., fixing misrouted emails, improving news relevance filtering).”
Intern Data Analyst specializing in business intelligence and financial analytics
“Analytics candidate with hands-on experience in both fraud and churn use cases, including SQL-based preparation of 6.5M transaction records and reproducible Python modeling workflows. Stands out for combining technical rigor in data quality, feature engineering, and imbalance handling with strong stakeholder alignment, metric definition, and dashboard adoption.”
Mid-level Data Engineer specializing in cloud data platforms and AI/ML pipelines
“Data-engineering-oriented candidate with hands-on experience building an agentic AI product and operational automation workflows. They described automating inventory-to-ERP discrepancy reconciliation with anomaly detection and daily reporting, and also have practical scraping/automation experience dealing with Cloudflare-protected sites using Selenium and Puppeteer.”
Junior AI & ML Engineer specializing in agentic systems and full-stack AI products
“Won a machine learning contest and was placed onto a Kaiser data science team, where they built ML models for hospital bottleneck prediction and resource allocation. They later built and deployed a full-stack LLM-based “data analyst agent” (with custom orchestration plus LangChain/OpenAI Agents experience) that generates analysis code, answers questions, and produces dashboards from uploaded datasets, emphasizing rigorous evaluation sets, robustness, and healthcare security/compliance integration.”
Mid-level Data Analyst specializing in AI/ML data quality and NLP
Mid-level Data Analyst specializing in analytics, machine learning, and financial services
Mid-level Data Engineer specializing in ML-driven pipelines and cloud microservices
Senior Data Scientist specializing in ML, NLP, and fraud analytics for regulated industries
Principal Data Engineer specializing in petabyte-scale Spark pipelines on GCP
Senior growth and digital acquisition leader specializing in retail media measurement
Junior Data Engineer specializing in cloud data pipelines and LLM/RAG systems
Junior Research Analyst specializing in deeptech venture sourcing and technical diligence