Vetted Databricks Professionals

Pre-screened and vetted.

YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
AB

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp
JPMorgan ChaseNew York Institute of Technology

Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.

View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
keerthana s - Mid-level Backend Software Engineer specializing in Python/FastAPI on AWS in Los Angeles, California

keerthana s

Screened

Mid-level Backend Software Engineer specializing in Python/FastAPI on AWS

Los Angeles, California4y exp
McKessonUniversity of North Texas

Backend engineer with healthcare domain experience building AI-driven radiology workflow systems. Evolved tightly coupled APIs into secure, reliable FastAPI-based services by moving heavy imaging/data processing into idempotent asynchronous pipelines with retries, feature-flagged incremental rollout, and strong data-integrity controls (constraints, backfills, validation). Strong focus on defense-in-depth security for sensitive patient data (OAuth2/JWT, RBAC, and database-level protections).

View profile
AP

Axel Paredes

Screened

Mid-level Business Analyst specializing in operations data and reporting

Miami, FL6y exp
Cole, Scott & Kissane, P.A.Miami Dade College

Candidate has hands-on project experience in healthcare analytics, using SQL, Python, and Power BI to analyze CMS hospital readmissions and HRRP penalty risk in Florida. Their work centers on turning messy CMS flat files into reporting-ready datasets, benchmarking hospitals against national references, and surfacing financial risk through dashboards.

View profile
AB

Alekya Battu

Screened

Mid-level Data Scientist specializing in machine learning, MLOps, and cloud analytics

USA5y exp
Wells FargoWilmington University

Senior data scientist with ~5 years’ experience building production ML/NLP systems in finance (Wells Fargo) and deep learning for sensor analytics in connected vehicles (Medtronic). Has delivered end-to-end platforms combining time-series forecasting with transformer-based NLP, including automated drift monitoring/retraining (MLflow + Airflow) and standardized Docker/CI/CD deployments; achieved a reported 22% precision improvement after domain fine-tuning.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile
LW

Lingyi Wu

Screened

Mid-level Financial/Data Analyst specializing in analytics, forecasting, and healthcare/MarTech data

Los Angeles, CA4y exp
MINISOWestcliff University

Growth/creative marketer from Esleydunn Games who uses Google Analytics to integrate cross-channel performance data (TikTok, YouTube, LinkedIn, Facebook) and run structured A/B tests on video ad length and layout. Reported reducing CPA by 20 per customer when leveraging YouTube and TikTok, and improved CTR through CTA/button placement testing and ongoing user-feedback loops (forum/WeChat topics).

View profile
KG

Senior AI Engineer specializing in Agentic AI and distributed systems

Charlotte, NC4y exp
UnitedHealth GroupUniversity of North Carolina at Charlotte

LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.

View profile
HE

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp
LexisNexisUniversity of South Florida

AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.

View profile
SK

Mid-level Data Engineer specializing in cloud data platforms and real-time analytics

Saint Louis, MO5y exp
CignaSaint Louis University

Customer-facing data engineering professional who builds and deploys real-time reporting/dashboard solutions, gathering reporting and compliance requirements through direct stakeholder engagement. Experienced with Google Cloud IAM governance, secure integrations (encryption, audit logging), and fast production troubleshooting of ETL/pipeline failures with follow-on monitoring and automated recovery improvements; motivated by hands-on, travel-oriented customer work.

View profile
AG

Archit Gangal

Screened

Senior Full-Stack Developer specializing in cloud-native microservices and AI/ML analytics

7y exp
AllstateColorado State University

Full-stack/backend engineer with deep insurance claims domain experience who built and operated a microservices + ETL platform (Java/Spring Boot + Python + Kafka/Databricks) processing 1M+ daily transactions. Combines production-grade reliability (99.7% uptime, zero-downtime blue/green releases, strong observability) with customer-facing UI delivery (AngularJS/React+TS dashboards and a hackathon-winning research chatbot).

View profile
Yash De - Intern Full-Stack Developer specializing in AI/LLM applications in San Jose, CA

Yash De

Screened

Intern Full-Stack Developer specializing in AI/LLM applications

San Jose, CA3y exp
Kingship AIStevens Institute of Technology

Backend-focused intern who built and refactored the backend for an LLM-driven gifting mobile app using FastAPI, tackling high-latency LLM + product-API workflows. Implemented async worker-pool/queue processing with Redis caching plus retries/fallbacks, cutting end-to-end suggestion latency from ~4–5 seconds to ~1 second while improving reliability and rollout safety via staged migrations and testing.

View profile
Brian Mar - Senior Data Engineer specializing in data infrastructure and marketing/CRM analytics in San Mateo, CA

Brian Mar

Screened

Senior Data Engineer specializing in data infrastructure and marketing/CRM analytics

San Mateo, CA8y exp
Full Circle InsightsUC Davis

Salesforce-focused implementation/solutions engineer from Full Circle Insights who owned end-to-end campaign attribution and reporting deployments for multiple customers at once (3–5 concurrently), including sandbox testing, KPI monitoring, and rollback-safe migrations from legacy reporting. Also builds personal multi-agent workflows and uses Claude Code to rapidly scaffold data/analytics scripts like an advertising optimization parser over CSV/XLSX inputs.

View profile
CR

Senior Analytics and Business Intelligence professional specializing in e-commerce and digital analytics

8y exp
NutrisystemCampbellsville University

Analytics professional with hands-on experience unifying marketing-platform data through Fivetran and Snowflake, building reporting views, and catching source-to-report issues like timezone-driven spend discrepancies. They also owned subscription LTV/cohort analysis and engagement tracking initiatives, partnering with e-commerce, product, and senior leadership to turn behavioral and demographic data into dashboards, lead-qualification metrics, and lifecycle marketing insights.

View profile
DI

Mid-level Data Analyst specializing in financial risk and data automation

McLean, VA5y exp
Capital OneFlorida International University

Analytics professional from Capital One with strong experience automating risk, reconciliation, and regulatory reporting workflows in financial services. They combine deep SQL/Python pipeline skills with stakeholder-facing dashboard and KPI design, delivering measurable impact like 30% performance gains, sub-24-hour anomaly detection, and 100% data integrity for regulatory filings.

View profile
CT

Mid-level AI Engineer specializing in LLMs, MLOps, and healthcare NLP

4y exp
HCA HealthcareUniversity of South Florida

Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.

View profile
RK

Senior AI/ML Engineer specializing in LLMs, generative AI, and applied research

Boca Raton, FL10y exp
ModMedFlorida Atlantic University

Research-heavy ML/AI candidate with a PhD/publications background who translated LLM evaluation and clinical summarization techniques into production at ModMed. They owned an end-to-end healthcare GenAI pipeline that cut clinician documentation time from ~22 minutes to ~7-8 minutes, reduced token costs by ~30%, and built an internal evaluation framework later adopted by multiple teams.

View profile
VD

Vimala Devi

Screened

Mid-level AI & Machine Learning Engineer specializing in FinTech

Texas, USA4y exp
The HartfordUniversity of Houston

ML/AI engineer with hands-on experience building production systems in financial services, including a real-time underwriting analytics platform at Hartford Financial Services. Stands out for combining classic ML, low-latency API deployment, monitoring, and emerging LLM/RAG design patterns, with measurable impact including 20% better decision accuracy, sub-200ms latency, and 5M+ records processed daily.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.

View profile
DV

Dyuti Vartak

Screened

Junior Data Scientist/Data Engineer specializing in ML pipelines and analytics

Seattle, WA1y exp
DocsumoUniversity of Washington

Machine Learning Intern at Docsumo who delivered a customer-facing fraud-detection solution end-to-end: rebuilt the pipeline, deployed a Random Forest model, and shipped a Python/Flask microservice on AWS SageMaker. Drove measurable production impact (precision +30%, processing time cut in half, manual review -60%, customer satisfaction +15%) and demonstrated strong customer integration and live-incident response skills.

View profile
HS

Harsha Sikha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and data engineering

Armonk, New York4y exp
IBMSaint Peter's University

IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.

View profile
YL

Yun-Hao Lee

Screened

Junior Machine Learning Engineer specializing in LLM deployment and computer vision

Dallas, TX2y exp
Lab for Intelligent Storage and ComputingUniversity of Texas at Dallas

Robotics/AI candidate who built an AI-driven landmark location tool during a summer internship at Mobile Drive, combining YOLOv5 object detection with OpenStreetMap-based geolocation to handle dense, cluttered urban environments. Also researched deploying LLM-based agents on constrained hardware using quantization plus LoRA/continuous learning, improving accuracy from ~80% to ~92%, with an emphasis on production logging for reliability.

View profile

Need someone specific?

AI Search