Reval Logo
Home Browse Talent Data & Analytics

Vetted Data & Analytics Professionals

Pre-screened and vetted.

PythonSQLTableauPower BIAWSDocker

Popular Titles

Data Scientist350+Data Engineer250+Data Analyst150+Machine Learning Engineer70+Software Engineer50+Research Assistant40+Business Analyst20+
NYC MetroBay AreaDFW MetroplexRemoteGreater BostonLos Angeles MetroDMVChicago MetroGreater HoustonGreater Philadelphia
HK

Harsha KeladiGanapathi

Screened

Intern Data Scientist specializing in robotics localization and SLAM

Lexington, KY1y exp
InfineonUniversity of New Haven

“Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.”

BashCC++Deep LearningGazeboGit+143
View profile
AM

Anirud Mohan

Screened

Intern AI/ML Software Engineer specializing in RAG and medical AI

Herndon, VA1y exp
CarinaAIUniversity of Maryland, College Park

“ML/LLM engineer with production experience building medical RAG systems to automate chart review, including retrieval + re-ranking and rigorous evaluation. Notably uncovered errors/bias in physician-curated ground truth by tracing answers back to source note chunks and presented evidence to an academic partner, accelerating deployment. Also built a RAG-based FAQ chatbot for a health insurance company and delivered it to non-technical stakeholders via demos.”

PythonJavaJavaScriptTypeScriptSQLFastAPI+77
View profile
AD

Atharva Deshmukh

Screened

Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps

Rochester, New York4y exp
CrowdDoingRochester Institute of Technology

“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”

PythonRSQLBashJavaJavaScript+90
View profile
NS

Nitin Shivakumar

Screened

Senior Data Scientist specializing in healthcare ML, LLMs, and responsible AI

Morris Plains, NJ4y exp
CignaUniversity at Buffalo

“Clinical data scientist who has built an agentic LLM-powered literature review assistant (with RAG-style storage/retrieval) to identify predictors for downstream predictive modeling. Also delivered a patient-focused progression analysis model using Databricks + Airflow orchestration, partnering closely with clinicians to define targets and validate that model insights aligned with clinical expectations.”

A/B TestingAWSClassificationComputer VisionDatabricksData Analysis+72
View profile
AC

Alexander Conn

Screened

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp
Beyond IdentityIowa State University

“ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).”

Machine LearningArtificial IntelligenceSupervised LearningUnsupervised LearningDeep LearningComputer Vision+118
View profile
SK

Sai Krishna Mallikanti

Screened

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp
CignaUniversity of Memphis

“Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.”

A/B TestingAnomaly DetectionData ValidationDeep LearningFeature EngineeringGenerative AI+77
View profile
NC

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp
CignaUniversity of North Texas

“Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.”

Amazon EC2Amazon RDSAWSAWS LambdaAnalyticsAnomaly Detection+55
View profile
YS

Yash Sanap

Screened

Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications

Virginia Beach, VA2y exp
City of Virginia BeachGeorge Mason University

“Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.”

PythonSQLJavaGoBashJavaScript+95
View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

“Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.”

PythonSQLRCHTMLCSS+153
View profile
RP

Rupesh Pathak

Screened

Junior Data Scientist and Robotics Perception Engineer specializing in GenAI and autonomous systems

Boston, MA2y exp
VERIDIX AINortheastern University

“Robotics software architect who built an automated pick-and-place palletizing prototype at BLACK-I-ROBOTICS, spanning perception (multi-RealSense fusion, segmentation, 6D pose, ICP), GPU-accelerated motion planning (MoveIt 2 + NVIDIA CuRobo), grasp generation, and safety (human detection + safe mode). Also brings cloud/CI/CD depth from VERIDIX AI (AWS Cognito/Lambda/ECS and CodePipeline stack) and demonstrated strong debugging chops by reducing outdoor rover EKF drift to ~5 cm via Allan variance-based IMU tuning.”

PythonCC++MultithreadingSQLMATLAB+164
View profile
KR

Krishna Rajput

Screened

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp
HCLTechArizona State University

“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”

A/B TestingAnomaly DetectionAWS GlueAWS LambdaAzure Machine LearningCI/CD+126
View profile
NB

nitesh bommisetty

Screened

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”

PythonRSQLTensorFlowPyTorchKeras+123
View profile
GG

Gabriele Gobbi

Screened

Mid-level Data Scientist specializing in GenAI, LLM-to-SQL, and analytics platforms

Turin, Italy3y exp
Engineering Ingegneria InformaticaUniversity of Ferrara

“LLM/agentic AI builder who led end-to-end integration of an LLM system into a business intelligence product, creating a scalable, metadata-driven RAG/agent pipeline with an orchestrator that routes queries to specialized agents (including DB-backed quantitative querying). Also built an LLM-to-SQL chatbot and partnered with non-technical stakeholders to capture domain context and improve SQL generation, using automated LLM-based testing to evaluate reliability.”

PythonMachine LearningScikit-LearnTensorFlowPyTorchLarge Language Models (LLMs)+51
View profile
SP

Santhoshi Priya Sunchu

Screened

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp
Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

“AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.”

PythonSQLRNumPyPandasScikit-learn+147
View profile
AM

Asanti Mokwala

Screened

Junior Data & Insights Analyst specializing in BI, dashboards, and automation

Remote3y exp
CanvaSan José State University

“Worked on taking an LLM-based system at Soundmakr from prototype to production by adding prompt constraints, validation/guardrails, deterministic ranking, and robust logging/monitoring with feedback loops. Also partnered with product/marketing during an internship on Thea: Study Smart to analyze onboarding drop-offs and run A/B tests on AI-driven flows, translating results into actions that improved retention and conversion.”

SQLPythonMicrosoft ExcelTableauPower BIGoogle Analytics+53
View profile
DG

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp
PaycomUniversity of Kansas

“Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.”

A/B TestingAmazon EC2Apache KafkaApache SparkAWSAWS Glue+163
View profile
AC

Andrew Clayman

Screened

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”

PythonC++SQLDockerFlaskCI/CD+133
View profile
MF

Michael Forster

Screened

Senior Data Engineer specializing in ETL/ELT pipelines and data integration platforms

New York, NY15y exp
PearsonCleveland State University

“Data engineer/software engineer who led an end-to-end ETL/ELT pipeline at Pearson processing millions of rows of student data nightly, including client-side data prep/validation, SFTP/API ingestion, staging-based SQL validation/transforms, and production loading. Built reliability features like configurable per-client validation thresholds, detailed reporting, concurrency throttling via a custom queue, and multi-source merge/backfill logic to keep nightly loads running even when sources fail.”

PostgreSQLSQL.NETC#GoPython+65
View profile
AG

Amit Gangane

Screened

Junior Data Scientist specializing in agentic AI and RAG pipelines

San Francisco, CA2y exp
Eureka AIUC Davis

“LLM/agentic systems builder who shipped production workflows at Angel Flight West and Eureka AI, combining LangGraph + RAG (Postgres/pgvector) with strong observability (LangSmith/Langfuse). Delivered large operational gains (address lookup cut from 10 minutes to 60 seconds; accuracy to 92%) and has a track record of quickly stabilizing customer-critical pipelines (Pydantic-enforced JSON for ETL) while partnering with sales/ops to drive adoption.”

PythonC++SQLGitDockerCI/CD+107
View profile
PG

Pandraju Gamanapriya

Screened

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp
UnitedHealth GroupTexas State University

“Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.”

AgileAnomaly DetectionAPI IntegrationAWSAWS GlueBash+106
View profile
EA

Erik Arriaga

Screened

Mid-level Data Engineer specializing in cloud data pipelines and machine learning

Austin, TX4y exp
Corner LeagueCalifornia State University, Long Beach

“Experience spans college-built AWS-hosted Python/Flask web apps and enterprise data work at General Motors, including PostgreSQL query optimization on millions of records and multi-tenant-style data isolation using group-based, column-level permission grants. Also built an AWS-hosted meat price prediction dashboard using Dash/Plotly and ran large nightly data pipelines orchestrated with Apache Airflow.”

PythonSQLJavaPySparkPandasNumPy+59
View profile
PJ

PRAHARSHA JANDHYALA

Screened

Mid-level Data Scientist/Data Analyst specializing in ML, BI dashboards, and ETL pipelines

Dallas, TX4y exp
HumanaArizona State University

“Data/ML practitioner with experience at Humana and Hexaware, focused on turning messy, semi-structured datasets into production-ready pipelines. Built an age-prediction model from book ratings using heavy feature engineering and multiple regression models, and has hands-on entity resolution (deterministic + fuzzy matching) plus embeddings/vector DB approaches for linking and search relevance.”

PythonRSQLPower BITableauMicrosoft Excel+178
View profile
RT

Rakesh Thota

Screened

Mid-level Data Engineer specializing in multi-cloud real-time data pipelines

California, USA5y exp
Molina HealthcareUniversity at Buffalo

“Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.”

PythonSQLJavaPySparkApache SparkApache Kafka+136
View profile
GM

Gopichand Muppaneni

Screened

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp
Eastern Illinois UniversityEastern Illinois University

“Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.”

PythonSQLPySparkETLData PipelinesData Modeling+47
View profile
1...272829...39

Related

Data ScientistsData EngineersData AnalystsMachine Learning EngineersSoftware EngineersResearch AssistantsBusiness AnalystsTeaching AssistantsSoftware DevelopersAI Engineers

Need someone specific?

AI Search