Reval Logo
Home Browse Talent Skilled in Databricks

Vetted Databricks Professionals

Pre-screened and vetted.

DatabricksPythonSQLDockerAWSCI/CD
TK

Tejaswi Kothapalli

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI

3y exp
AetnaIndiana Tech

“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”

AgileAmazon SageMakerApache SparkAWSAWS LambdaAzure DevOps+165
View profile
CM

Chris Marcus

Screened

Executive CTO & AI Architect specializing in regulated SaaS (InsurTech/Healthcare/FinTech)

Remote15y exp
agentCanvas.aiUniversity of Texas at Austin

“Insurance-tech CTO and repeat founder with 10+ years in insurance startups; was employee #4/CTO at Polly (formerly DealerPolicy) and helped scale it from a PowerPoint to 250 employees while raising $180M+. Currently building and selling AgentCanvas.ai—an extensible AI accelerator platform for large insurance agencies—after coding the product end-to-end and now running demos/POCs with prospective buyers.”

Generative AILangChainLangGraphMLOpsMachine LearningNeural Networks+99
View profile
ML

Maurice Lange

Screened

Executive Technology & Data Leader specializing in cloud platforms, AI/ML, and enterprise data

Tampa, FL35y exp
HigherEchelonRotterdam School of Management, Erasmus University

“Former PwC Director with hands-on early-stage venture experience (e.g., BridgeLights, a big-data analytics concept for early fintech) spanning concept creation, platform architecture, and go-to-market experimentation. Strong focus on building scalable, modular data platforms with rigorous governance/compliance (data lineage, quality controls) and supporting technical diligence in investor-aligned environments.”

LeadershipStrategic PlanningRisk ManagementProcess ImprovementProject ManagementCI/CD+139
View profile
JH

John Hoffman

Screened

Senior Data Engineer specializing in Databricks, Spark, and AWS for government healthcare data systems

Windsor Mill, MD12y exp
GDITUniversity of Virginia

“Python/AWS engineer focused on batch-processing and data workflows, including building reusable S3/boto3 utilities with reliability features and IAM-based auth. Has led low-risk legacy modernizations using parity testing plus a month of parallel production runs, and has owned production issues end-to-end (including fixing a client-side Excel macro) while contributing to significant AWS cost reductions (~$10k/month).”

PythonSQLBashDatabricksApache SparkPySpark+66
View profile
GB

Geetha Bommareddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services

USA5y exp
JPMorgan ChaseTrine University

“At JP Morgan Chase, built and deployed a production LLM-powered RAG knowledge assistant to help fraud investigators and risk analysts quickly navigate regulatory updates and internal policies, reducing investigation delays and compliance risk. Strong focus on secure retrieval (RBAC filtering), reliability (layered testing + observability), and production constraints (latency/SLOs), with Airflow-orchestrated, auditable ML pipelines.”

Amazon EC2Amazon EKSAmazon RedshiftAmazon S3Amazon SageMakerAnomaly Detection+159
View profile
SK

Santhosh Kumar

Screened

Mid-level GenAI/ML Engineer specializing in LLM agents and RAG for Financial Services & Healthcare

5y exp
Bank of AmericaVirginia Commonwealth University

“Built and deployed a production GenAI internal support agent at Bank of America (“Ask GPS/AskGPT”) using RAG on Azure, focused on reducing escalations and improving response quality for repetitive knowledge-based queries. Demonstrates strong production LLM engineering: custom LangChain orchestration, retrieval tuning to reduce hallucinations, rigorous offline/online evaluation, and model benchmarking with dynamic routing (e.g., GPT-4 vs Claude).”

AWSAWS LambdaCI/CDClaudeDatabricksDecision Trees+97
View profile
YP

Yash Pise

Screened

Mid-level Data Scientist specializing in Generative AI, LLMOps, and clinical data pipelines

5y exp
NovartisStevens Institute of Technology

“LLM/RAG engineer who has built and deployed corporate-scale systems at Novartis and Johnson & Johnson, including a healthcare AI agent that generates day-to-day treatment schedules. Recently handled a high-stakes safety incident (LLM suggesting overdose) by tightening model instructions and validating with ~200 test prompts, and has strong end-to-end data/embedding/vector DB pipeline experience (PySpark, FAISS, Pinecone) plus SME-in-the-loop evaluation (RLHF).”

PythonRJavaScriptMySQLPostgreSQLNumPy+88
View profile
AR

Ashwin Ram

Screened

Junior Data Scientist specializing in Generative AI and applied machine learning

Dayton, OH1y exp
Evoke TechnologiesUniversity of Chicago

“At Evoke Tech, built a production LLM "Testbench" to quickly compare LLMs/embedding models and RAG strategies (semantic, hybrid BM25, re-ranking, HyDE, query expansion) to select optimal architectures for different client needs. Also developed a multi-agent, multimodal (voice/text) RAG system for live catalog retrieval and safe product recommendations using LangGraph/LangChain with LangSmith monitoring, and regularly translated PM/UX goals into concrete agent behaviors via demos and flowcharts.”

PythonSQLRPandasNumPyScikit-learn+62
View profile
EL

Ethan Lam

Screened

Junior Software Engineer specializing in data platforms and full-stack development

Toronto, Ontario3y exp
Warner Music GroupUniversity of Toronto

“Software engineer with Warner Music Group experience owning and shipping analyst-facing data products (marketing/streaming data dashboards) end-to-end with high adoption through continuous stakeholder feedback. Also builds side projects with TypeScript/React and domain-driven API design, emphasizing flexibility (including swapping databases mid-development) and pragmatic microservices reliability patterns (logging, timeouts, retry backoff).”

PythonJavaSQLScalaJavaScriptTypeScript+72
View profile
PD

Pooja Dokuri

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps

Remote, USA4y exp
UnitedHealth GroupEast Texas A&M University

“Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.”

PythonPandasNumPyPySparkScikit-learnSQL+133
View profile
UJ

Utkarsh Joshi

Screened

Senior Data Scientist specializing in ML, NLP, and GenAI analytics

Remote, US7y exp
University of MinnesotaUniversity of Minnesota

“Built and deployed an LLM-powered analytics assistant enabling business users to ask questions in plain English and receive validated Spark SQL executed in Databricks, with a Streamlit/Flask UI. Addressed strict client schema-privacy constraints by implementing a RAG strategy and ultimately leveraging AWS Bedrock and fine-tuned reference docs. Also has production ML pipeline experience using Docker + Airflow and AWS (S3/ECS/EC2) for financial classification models.”

PythonPandasNumPyScikit-learnRSQL+107
View profile
DD

Dhyey Desai

Screened

Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems

Los Angeles, California0y exp
NalaUSC

“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”

Apache HadoopBERTCCachingData VisualizationDatabricks+87
View profile
HK

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp
EquinixFitchburg State University

“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”

PythonSQLPySparkBashJavaJavaScript+169
View profile
ES

Edin Samuel Joselyn Chandrakumar

Screened

Senior Engineering Manager specializing in cloud platforms and risk systems

16y exp
Capital OneGovernment College of Technology, Coimbatore

“Engineering leader who proposed and delivered a new API-based document management platform to replace a vendor-dependent system, improving latency by ~1s and availability to 99.9% while migrating legacy data. Also drove Python-based automation of ~12 workflows via third-party API integrations and led an SSO/auth integration focused on backward compatibility and high login success rates.”

A/B TestingAgileAmazon CloudWatchAmazon DynamoDBAmazon ECSAmazon RDS+88
View profile
SK

Supreetha Kashyap

Screened

Mid-Level Software Engineer & Data Analyst specializing in cloud analytics and BI

Jacksonville, FL4y exp
Johnson & JohnsonUniversity of Texas at Arlington

“Built and owned an end-to-end Seat Allocation & Management System at Accenture, replacing a legacy process with a scalable web app used across teams. Deep focus on reliability under concurrency (transactions + unique constraints + idempotent APIs) and on Postgres performance tuning (composite indexes, EXPLAIN ANALYZE), plus post-launch production support and monitoring.”

PythonSQLJavaJavaScriptHTMLCSS+77
View profile
UC

Uday Chilakala

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and RAG systems

Atlanta, GA5y exp
Morgan StanleyKennesaw State University

“Machine learning/NLP engineer who built a production-oriented retrieval-based AI system at Morgan Stanley for healthcare use cases, combining RAG over unstructured patient records with deep-learning medical image segmentation (U-Net/Mask R-CNN). Strong in end-to-end pipelines and MLOps (Spark/MongoDB, AWS SageMaker, CI/CD, monitoring, automated retraining) and in entity resolution/data quality validation for noisy clinical data.”

PythonSQLFlaskApache SparkgRPCTensorFlow+125
View profile
PC

Prasanna Chelliboyina

Screened

Mid-level Machine Learning Engineer specializing in forecasting, NLP, and GenAI

United States6y exp
WalgreensSyracuse University

“GenAI/ML engineer with production experience building multilingual LLM systems (English/Spanish) and RAG-based clinical documentation summarization at Walgreens, combining prompt engineering, structured output validation, and rigorous evaluation (ROUGE + pharmacist review). Also orchestrated end-to-end ML pipelines for demand forecasting using Apache Airflow, PySpark, and MLflow with scheduled retraining and production monitoring.”

A/B TestingAgileAnomaly DetectionApache SparkAWSAzure Machine Learning+114
View profile
RB

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp
Wells FargoUniversity of South Florida

“Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.”

PythonDistributed systemsState managementDockerContainerizationDebugging+85
View profile
AM

Ajay Madhusudhan Thumala

Screened

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp
GeisingerUC Irvine

“Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.”

API DevelopmentAPI GatewayAWSBashCC+++206
View profile
SG

Shruti Gaikwad

Screened

Mid-Level Software Engineer specializing in secure cloud microservices and FinTech

Remote, USA4y exp
BrexSyracuse University

“Built and owned major parts of a real-time distributed AI fraud-detection pipeline (ingestion, inference microservice integration, and automated action layer), optimizing latency and observability and reducing false positives by ~35%. Understands ROS/ROS2 concepts (nodes/topics/services) and planned hands-on ramp-up via ROS2 pub/sub exercises and Gazebo simulation, but has not worked on physical robots or ROS in production.”

Amazon API GatewayAmazon CloudWatchAmazon EKSAmazon SNSAnsibleAngular+220
View profile
HK

Harshitha Kotari

Screened

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

“AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.”

PythonRSQLJavaMATLABHTML+148
View profile
SD

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

“Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.”

PythonSQLJavaScalaShell ScriptingTypeScript+153
View profile
HG

Harshavardhan Garikala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

NJ, USA4y exp
Red HatOklahoma Christian University

“Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.”

PythonPySparkSQLTensorFlowPyTorchHugging Face+127
View profile
SM

Subhasmita Maharana

Screened

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”

A/B TestingApache AirflowAzure Machine LearningBERTCI/CDClustering+170
View profile
1...131415...53

Related

Machine Learning EngineersData ScientistsSoftware EngineersData EngineersAI EngineersData AnalystsAI & Machine LearningEngineeringData & AnalyticsExecutive & Leadership

Need someone specific?

AI Search