Pre-screened and vetted.
Senior Business Analyst specializing in regulated systems and data-driven process optimization
“Analytics candidate with hands-on experience building SQL-driven reporting tables and end-to-end KPI dashboards in Power BI and Excel. They emphasize stakeholder alignment, metric clarity, and data trust, including defining retention metrics, documenting dashboards, and creating centralized reporting that reduced manual work and improved decision-making.”
Junior Analytics Engineer specializing in modern data platforms
“Analytics engineer/data professional with strong healthcare and membership analytics experience, combining SQL, dbt, BigQuery, Python, and Tableau to turn messy source data into trusted executive reporting. Stands out for metric governance and stakeholder alignment work, including unifying conflicting business definitions and delivering a CMS market-risk model that identified $792M in excess payer costs.”
Entry-level Data Scientist specializing in LLMs and analytics
“Built a zero-to-one AI contract/policy QA agent for compliance and data teams, with a strong emphasis on trust, traceability, and clause-level citations rather than just fluent answers. They combine full-stack product ownership with practical LLM systems design, including hybrid retrieval, structured outputs, and evaluation pipelines to improve reliability, latency, and cost.”
Mid-level Data Engineer specializing in cloud data platforms and AI/ML analytics
“Backend/data engineer in healthcare who built an AWS-based clinical analytics platform from scratch (DynamoDB/S3/Airflow/dbt) with sub-second clinician query goals, 99.9% uptime, and HIPAA-grade controls (KMS encryption, IAM RBAC, audit trails). Also modernized ML delivery by replacing a manual 4-hour deployment with a 30-minute Docker/GitHub Actions CI/CD pipeline using parallel runs, parity testing, and rollback, and caught critical EHR data edge cases (date formats/timezones) that could have impacted patient care.”
Intern Data Scientist specializing in robotics localization and SLAM
“Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.”
Intern AI/ML Software Engineer specializing in RAG and medical AI
“ML/LLM engineer with production experience building medical RAG systems to automate chart review, including retrieval + re-ranking and rigorous evaluation. Notably uncovered errors/bias in physician-curated ground truth by tracing answers back to source note chunks and presented evidence to an academic partner, accelerating deployment. Also built a RAG-based FAQ chatbot for a health insurance company and delivered it to non-technical stakeholders via demos.”
Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI
“Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.”
Principal Data Scientist specializing in cybersecurity ML and MLOps
“ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).”
Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP
“Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.”
Intern Data Analyst specializing in data pipelines and LLM/RAG applications
“Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.”
Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps
“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”
Intern Data Engineer specializing in Snowflake pipelines and AI/ML analytics
“Built and operated an end-to-end TypeScript/Node AI agent platform for high-volume financial data that generates explainable investment signals and automates execution via resilient Playwright browser automation. Uses Postgres + pgvector/Prisma for RAG retrieval, Redis for async orchestration, Zod-based boundary validation as a circuit breaker, and OpenTelemetry for tracing/latency monitoring; also designed a TypeScript SDK with semver, scoped bearer-token auth, CLI key rotation, and interactive Swagger docs.”
Junior Data Analyst specializing in BI, ETL, and reporting
“Analytics professional with hands-on experience building SQL and Python workflows across SAP, Oracle, and internal operational systems, processing roughly 5 million records per month. They combine strong data quality rigor with stakeholder-friendly Power BI reporting, and cite a concrete impact of cutting reporting turnaround time from four days to two while surfacing cost anomalies for business teams.”
Mid-level Business Analyst specializing in analytics, operations, and supply chain
“Analytics candidate with hands-on experience improving enterprise reporting and operational decision-making at Reliance and Wendy’s. They combine SQL optimization, Python automation, sentiment analysis, and dashboarding to deliver measurable impact, including cutting report runtimes from 3 minutes to 1 minute, improving model accuracy from 70% to 80%, and reducing supplier past dues by 30%.”
Mid-level Data Scientist specializing in machine learning, NLP, and healthcare AI
“Senior data scientist with hands-on ownership of production ML and GenAI systems across enterprise churn, clinical Q&A, and real-time fraud detection. Stands out for combining strong MLOps discipline with measurable business impact, including $2M+ retained revenue, 10K TPS low-latency fraud infrastructure, and a clinician-reviewed RAG system that improved retrieval accuracy by ~38%.”
Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications
“Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.”
“Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.”
Junior Data Scientist and Robotics Perception Engineer specializing in GenAI and autonomous systems
“Robotics software architect who built an automated pick-and-place palletizing prototype at BLACK-I-ROBOTICS, spanning perception (multi-RealSense fusion, segmentation, 6D pose, ICP), GPU-accelerated motion planning (MoveIt 2 + NVIDIA CuRobo), grasp generation, and safety (human detection + safe mode). Also brings cloud/CI/CD depth from VERIDIX AI (AWS Cognito/Lambda/ECS and CodePipeline stack) and demonstrated strong debugging chops by reducing outdoor rover EKF drift to ~5 cm via Allan variance-based IMU tuning.”
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”
Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions
“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”
Mid-level Data Scientist specializing in GenAI, LLM-to-SQL, and analytics platforms
“LLM/agentic AI builder who led end-to-end integration of an LLM system into a business intelligence product, creating a scalable, metadata-driven RAG/agent pipeline with an orchestrator that routes queries to specialized agents (including DB-backed quantitative querying). Also built an LLM-to-SQL chatbot and partnered with non-technical stakeholders to capture domain context and improve SQL generation, using automated LLM-based testing to evaluate reliability.”
Mid-level Data Scientist specializing in NLP and predictive modeling
“AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.”
Junior Data & Insights Analyst specializing in BI, dashboards, and automation
“Worked on taking an LLM-based system at Soundmakr from prototype to production by adding prompt constraints, validation/guardrails, deterministic ranking, and robust logging/monitoring with feedback loops. Also partnered with product/marketing during an internship on Thea: Study Smart to analyze onboarding drop-offs and run A/B tests on AI-driven flows, translating results into actions that improved retention and conversion.”
Mid-level Data Engineer specializing in cloud data platforms and AI agents
“Data/Backend engineer who has owned end-to-end merchant analytics systems on AWS: orchestrated multi-source ingestion (FISERV/Shopify/Clover) with Step Functions/Lambda, enforced strong data quality gates, and served curated datasets via Redshift and a FastAPI layer. Also built an early-stage Merchant Insights AI agent that converts natural language questions into SQL using OpenAI models, with full CI/CD and observability.”