Vetted Databricks Professionals

Pre-screened and vetted.

HK

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.

View profile
HG

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

NJ, USA4y exp
Red HatOklahoma Christian University

Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.

View profile
SM

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).

View profile
NM

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp
SonyFitchburg State University

Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Ajay Madhusudhan Thumala - Junior Software Engineer specializing in data engineering and LLM applications in Irvine, CA

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp
GeisingerUC Irvine

Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.

View profile
Ruthvik Bacha - Mid-level Data Engineer specializing in financial data pipelines and reliability in North Carolina, USA

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp
Wells FargoUniversity of South Florida

Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
Jhansi Bendi - Senior Software Engineer specializing in cloud-native microservices and event-driven systems in Antioch, CA

Jhansi Bendi

Screened

Senior Software Engineer specializing in cloud-native microservices and event-driven systems

Antioch, CA18y exp
SephoraRashtriya Sanskrit Sansthan

Senior engineer/tech lead with 18+ years building large-scale distributed applications, specializing in performance and reliability improvements. Recently owned multiple apps on an email personalization team, shipping major optimizations (including a push-update feature and audience-count architecture redesign) that reportedly lifted system performance from ~50% to ~99% while also leading code standards, reviews, and mentoring.

View profile
DM

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp
JPMorgan ChaseSaint Louis University

Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.

View profile
AJ

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp
ServiceNowUniversity of North Carolina at Charlotte

ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.

View profile
LJ

Senior Full-Stack Engineer specializing in FinTech and cloud platforms

New York Metropolitan Area, NY8y exp
State StreetVanderbilt University

State Street engineer who identifies operational pain points and turns them into high-impact internal platforms, including a service-health monitoring system and a Databricks log standardization pipeline used by 200+ users. Also experiments with practical LLM workflows, having built a Claude-based AI host that dramatically reduced facilitation time for a growing book club.

View profile
VJ

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and applied research

New York, NY2y exp
NYU’s Center for Social Media, AI, and PoliticsNYU

LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.

View profile
BR

Mid-Level Software Engineer specializing in cloud-native distributed systems

Sunnyvale, CA5y exp
WalmartArizona State University

Backend/platform engineer who has built and run production Python/Flask + Kafka microservices processing RFID and camera/RFID fusion streams for near-real-time retail cart updates at ~4–5M events/day. Strong in reliability/performance debugging (p99 latency, Kafka lag, Cosmos DB RU hot partitions) with measurable impact including ~30% database cost reduction, and has also shipped an end-to-end vulnerability scanning workflow with DynamoDB-backed state, idempotency, and robust retry/verification guardrails.

View profile
SB

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.

View profile
JS

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp
Johnson & JohnsonStevens Institute of Technology

Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.

View profile
PE

Mid-Level Software Engineer specializing in distributed systems and cloud-native backends

Dallas, USA5y exp
T-MobilePurdue University

AI/LLM engineer with production experience at Charles Schwab building a RAG-based assistant to help 5,000+ reps answer complex financial policy questions. Implemented a multi-layer anti-hallucination approach (GNN-driven ontology/graph retrieval + citation-only answers) and compliance-focused guardrails (Azure AI Content Safety) in partnership with audit/compliance stakeholders.

View profile
KF

Kevin Fang

Screened

Intern Software Engineer specializing in full-stack and data systems

Beverly Hills, CA1y exp
Alo YogaUC Irvine

Software developer with healthcare operations experience at Epic Systems (Referrals & Authorizations), delivering customer-facing tooling to speed manual insurance authorization/denial documentation and support future automation. Also supported an HRIS migration to Workday at Aloe Yoga, solving legacy ID interoperability via scripting and mapping, and demonstrates strong production debugging and test-driven maintainability practices.

View profile
SS

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp
JPMorgan ChaseUniversity of South Dakota

Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.

View profile
GJ

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and Computer Vision

USA5y exp
WalmartUniversity of New Haven

ML/AI engineer with production experience across retail and healthcare: built a real-time computer-vision shelf monitoring system at Walmart and optimized edge inference latency by ~30% using TensorRT/ONNX and pruning. Also partnered with CVS Health clinical/pharmacy teams to deliver a medication-adherence predictive model, using Streamlit explainability dashboards and achieving an 18% adherence improvement.

View profile
DM

Mid-level Generative AI Engineer specializing in decision intelligence and RAG for regulated enterprises

5y exp
JPMorgan ChaseSaint Louis University

Healthcare GenAI engineer who built a HIPAA-compliant, auditable RAG-based claims decision support system at Molina Healthcare, processing 3M claims and delivering major impact (48% faster manual reviews, 43% higher decision accuracy). Deep hands-on experience with LangChain orchestration, vector search (ChromaDB/FAISS), embedding fine-tuning, and safety controls (confidence scoring, rule validation, human-in-the-loop escalation) for clinical workflows.

View profile
DK

Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake

Richardson, TX6y exp
PwCUniversity of Central Missouri

Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.

View profile
Vaibhav Sharma - Mid-level Software Engineer specializing in AI/ML and data platforms in Remote, USA

Mid-level Software Engineer specializing in AI/ML and data platforms

Remote, USA5y exp
GoogleIndiana University Bloomington

AI/ML engineer who built a production agentic system to automate computational research experiments (simulation execution, parameter exploration, and numerical analysis) and mitigated context-window failures using constrained tool-calling/prompt-chaining patterns in LangChain with OpenAI tool-enabled models. Also has adtech/big-data pipeline experience at InMobi, orchestrating Spark jobs in Airflow to filter bot-like user IDs and publish clean IDs to an online NoSQL store for live serving, plus Apache open-source collaboration experience.

View profile
Harshavardhan Reddy - Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics in Albany, NY

Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics

Albany, NY5y exp
Capital OnePace University

ML/AI engineer with Capital One experience building production-grade customer segmentation and fraud detection systems combining NLP (transformers) and anomaly detection. Strong MLOps and orchestration background (PySpark ETL, MLflow, Airflow, Docker/Kubernetes, Azure ML) with real-time monitoring/alerting and performance optimizations like quantization and caching, plus proven ability to deliver business-facing insights through Power BI/Tableau for marketing stakeholders.

View profile

Need someone specific?

AI Search