Reval Logo

Vetted Data Scientists in Pennsylvania

Pre-screened and vetted in Pennsylvania.

Machine LearningPythonNatural Language ProcessingAWSData VisualizationDeep Learning
DR

Dhrubajyoti Ray

Mid-level Data Scientist specializing in NLP, MLOps, and semiconductor manufacturing analytics

Pittsburgh, USA3y exp
TIAACarnegie Mellon University
A/B TestingAirflowAWSAzureBashBERT+63
View profile
SM

Shuvam Mitra

Screened

Mid-level Data Scientist specializing in anomaly detection and production ML

Pittsburgh, PA4y exp
HondaCarnegie Mellon University

Interned at Backblaze building production AI systems for incident response and security operations, including an internal LLM-powered incident triage assistant that used Snowflake + RAG over historical tickets/postmortems and delivered results via Slack and a web UI. Emphasizes reliability (PII filtering, grounding, schema validation, fallbacks) and rigorous evaluation/observability (offline replay, partial rollouts, time-to-first-action metrics, Prometheus/Grafana).

AgileAnomaly DetectionAWSAWS Database Migration Service (DMS)AWS Relational Database Service (RDS)AWS Timestream+89
View profile
TH

Tzu-Chieh Huang

Screened

Mid-level Software Engineer specializing in backend systems, IoT, and AI security

Pittsburgh, PA3y exp
NapticCarnegie Mellon University

Full-stack engineer in the investment tracking/financial reporting space who built an automated reporting dashboard and compliance/reporting pipeline end-to-end using Next.js (App Router, server/client components), REST, and Postgres. Demonstrated measurable performance wins (~30% faster loads) through caching and query optimization, and built durable orchestrated workflows in n8n with retries, idempotency, and reconciliation checks.

PythonJavaC++C#JavaScriptSQL+74
View profile
JL

Jiaqi Li

Screened

Junior AI Engineer specializing in healthcare analytics and compliance AI

Pittsburgh, PA1y exp
CustomerInsights.AICarnegie Mellon University

Built and shipped a production LLM-driven multi-agent platform (ciATHENA) at CustomerInsights.AI to automate analytics/ML/compliance workflows in healthcare and life sciences. Implemented LangGraph/LangChain orchestration with strong backend-style rigor (schemas, Pydantic validation, retries, auditability) and optimized latency/cost while keeping the system usable for non-technical users via guided natural-language interactions and structured/visual outputs.

PythonRScikit-LearnPyTorchPredictive ModelingMachine Learning+79
View profile
SS

Shubham Singh

Screened

Mid-level AI/ML Engineer specializing in speech, computer vision, and agentic GenAI

Pittsburgh, PA6y exp
Musing AICarnegie Mellon University

Built and shipped a production multi-agent, voice-based conversational assistant for older adults’ daily health management using Vertex AI, FastAPI, Firebase/Firestore, and Cloud Run, with a custom cross-session memory design to keep responses context-aware at low latency. Also partnered with caregivers/elderly users and health officials, translating needs into workflows and explaining HIV risk predictions with SHAP and dashboards.

Agent AIApache Beam (Dataflow)ASR (Automatic Speech Recognition)AzureBigQueryCI/CD+108
View profile
ME

Mohamed Elaraby

Senior NLP Research Scientist specializing in summarization, argument mining, and LLM evaluation

Pittsburgh, PA8y exp
University of PittsburghUniversity of Pittsburgh
Natural Language Processing (NLP)Text SummarizationComputational ArgumentationArgument MiningAbstractive SummarizationLong-document Summarization+41
View profile
SD

Shreyas Darade

Screened

Mid-level Data Scientist specializing in business intelligence and machine learning

Pittsburgh, PA2y exp
Armada PartnersCarnegie Mellon University

Internship experience building a production LLM-powered podcast operations agent that automated lead intake (HubSpot), guest research, scheduling (Calendly), meeting-summary evaluation (Gemini), and human approval via Slack bot—while retaining rejected candidates for future outreach. Also contributed to ideation of a multi-agent orchestration framework with parsing and task routing, and emphasized reliability via structured prompts, HITL feedback, and prompt-based test sets.

A/B TestingAirbnb Market AnalysisAnalyticsBusiness IntelligenceClassificationClustering+84
View profile
FK

Faizan Khan

Mid-level Applied Scientist specializing in production GenAI and RAG systems

Pittsburgh, PA5y exp
Finetune LearningCarnegie Mellon University
Agentic SystemsAI SafetyAmazon Elasticsearch ServiceAmazon LambdaAnthropic APIAudio Processing+83
View profile
AM

Ayesha Mazzy

Senior Data Scientist specializing in healthcare analytics and scalable ML pipelines

Philadelphia, PA11y exp
CoverMyMeds
AgileAirflowApache HadoopApache KafkaApache SparkAudit-ready documentation+96
View profile
SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.

AI PipelinesAPIsAutomation WorkflowsBaseline MetricsCI/CDCoding Best Practices+56
View profile
RG

Riya Gaur

Mid-level Data Scientist specializing in NLP, MLOps, and Generative AI

Pittsburgh, PA3y exp
PNCDrexel University
PythonRSQLJupyter NotebookGoogle ColabMachine Learning+86
View profile
MV

Meenaa Vellaiyan

Mid-level Data Scientist specializing in ML, NLP, and GenAI (RAG)

Newtown, PA4y exp
CenTrakNortheastern University
PythonSQLPySparkPandasNumPyMachine Learning+55
View profile
MP

Mahesh Ponnam

Screened

Mid-level Data Scientist specializing in credit risk, fraud detection, and ESG analytics

PA, USA4y exp
Northern TrustWilmington University

AI/LLM practitioner who has deployed production chatbots across e-commerce, HRMS, and real estate, focusing on retrieval-first workflows for factual tasks like product and property search. Optimized intent understanding and significantly improved latency by using lightweight embeddings and tuning the inference pipeline on Groq (Llama 3.3), while applying modular orchestration and measurable production evaluation.

PythonPandasNumPyScikit-learnTensorFlowPyTorch+124
View profile
TZ

Tiffany Zhu

Mid-level Data Scientist specializing in ML, predictive analytics, and automation

Pittsburgh, PA5y exp
Liaison InternationalUniversity of Pittsburgh
PythonSQLRMachine LearningPredictive ModelingForecasting+58
View profile

Need someone specific?

AI Search