Pre-screened and vetted.
Mid-level Data Engineer specializing in AI/ML data platforms and real-time streaming
Mid-level Data Engineer specializing in cloud lakehouse and streaming pipelines
Mid-level Data Engineer specializing in streaming and cloud lakehouse platforms
Mid-level Data Engineer specializing in AWS, Spark, and streaming data pipelines
Mid-level Data Engineer specializing in cloud-native ETL and data warehousing
Senior AI Platform Engineer specializing in agentic AI and RAG systems
Senior AI/ML Engineer specializing in GenAI, LLMs, NLP, and MLOps
Executive Product & Technology Leader specializing in AI and healthcare platforms
Senior Data Scientist specializing in analytics, experimentation, and BI on AWS
“Data/ML practitioner focused on healthcare data quality and record linkage: analyzed 10M+ records, built anomaly detection and NLP-driven entity resolution, and automated AWS ETL/validation pipelines (Glue/Redshift/Lambda), cutting data errors by 40% and generating $500k in annual savings. Has hands-on experience with embeddings (Sentence Transformers/spaCy), FAISS vector search, and fine-tuning for domain-specific matching.”
Mid-level Data Engineer specializing in cloud data platforms and real-time streaming
“Worked on onboarding a Middle East logistics client processing thousands of invoices/month, building a production-ready pipeline that routes known vendor PDFs to deterministic regex parsers via Tax ID matching and falls back to LlamaParse for unknown layouts. Added financial consistency validation plus human-in-the-loop review and logging/metrics to continuously reduce LLM usage and improve template coverage.”
“Built and deployed a production RAG-based LLM Q&A and summarization platform for internal documents, emphasizing grounded answers with structured prompting and citations to reduce hallucinations. Experienced orchestrating end-to-end LLM workflows with LangChain plus cloud pipelines (Azure ML Pipelines, AWS), and runs iterative evaluation using both metrics (accuracy/hallucination/latency/cost) and real user feedback to drive reliability.”
Mid-level Data Analyst specializing in retention, churn, and customer analytics
“Analytics professional with experience across healthcare and fintech, including building SQL/Python data pipelines at Optum and owning a fraud detection initiative at Razorpay. Stands out for combining messy-data cleanup, reproducible analytics workflows, and stakeholder-driven metric design, with a reported 25% improvement in fraud detection while keeping false positives under control.”
Senior Full-Stack Java Engineer specializing in cloud-native microservices
“Backend/platform engineer who owned high-volume Java/Spring Boot microservices on AWS (Kafka + RDS/DynamoDB) and has hands-on experience debugging complex production latency incidents across DB, JVM/GC, and async consumers. Also shipped applied AI features for ops, including an LLM-powered log analysis assistant and an incident-response agent with strong safety guardrails (schema-validated tool use, retries/backoff, and human-in-the-loop escalation).”
Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI
“FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.”
Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics
“Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.”
Junior Software Engineer specializing in LLM systems, data engineering, and ML
“Backend/ML systems engineer with experience at SDSC, UCSD, and Media.net, building production semantic dataset/model discovery using embeddings + Solr KNN and LLM-based intent/reranking at 5M+ dataset scale. Emphasizes offline/online separation for predictable serving, has delivered measurable gains (23% retrieval accuracy, 38% latency reduction) and helped secure a $3M+ NSF grant.”
Director-level Software Engineering Leader specializing in AI platforms and full-stack cloud systems
“Engineering leader with BCG consulting background who has built roadmaps and scaled AI and data platforms for pharma and manufacturing clients. Led architecture shifts (Django monolith to event-driven microservices) for high-volume IoT SaaS products, improving deployment speed and enabling zero-downtime releases. Also established a near-shore engineering team in São Paulo and has managed distributed teams across multiple countries, leveraging strong stakeholder communication and a prior professional acting background for storytelling.”
Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps
“Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.”
Senior Data Engineer specializing in multi-cloud data platforms and streaming pipelines
“Data platform engineer with hands-on ownership of high-volume financial data pipelines (millions of transactions/day) on Azure (ADF, Databricks, Delta Lake, Synapse), emphasizing schema-drift protection and automated data-quality gates. Also built resilient web scraping pipelines with anti-bot and backfill strategies, and shipped a versioned FastAPI + Redis data API with autoscaling, testing, and CI/CD via GitHub Actions.”
Mid-level Data Analyst specializing in SaaS product and business analytics
“Analytics professional with hands-on experience building SQL and Python workflows for support operations and product reporting. They stand out for turning messy CRM, ticket, and activity data into validated, performance-optimized reporting tables and dashboards, while partnering closely with stakeholders to standardize KPI definitions around SLA performance and retention.”
Mid-level Software Engineer specializing in cloud-native microservices and real-time data pipelines
Mid-Level Full-Stack Software Engineer specializing in microservices and fraud detection
Senior Data Scientist specializing in GenAI, LLM systems, and production ML