Vetted Data Scientists

Pre-screened and vetted.

SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile
VA

Mid-level Data Scientist specializing in Generative AI and NLP for financial risk

Glassboro, NJ4y exp
S&P GlobalRowan University

Built and shipped production generative AI/RAG assistants in regulated financial contexts (S&P Global), automating compliance-oriented Q&A over earnings reports/filings with grounded answers and citations. Experienced across the full stack—AWS-based ingestion (PySpark/Glue), vector retrieval + LangChain agents, GPT-4/Claude model selection, and production reliability (monitoring, caching, retries) plus rigorous evaluation and regression testing.

View profile
Daniel Berhane Araya - Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance in Fairfax, VA

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
Bala Venkateswarlu K - Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps in USA

Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps

USA5y exp
MetLifeHarrisburg University of Science and Technology

Built and deployed an LLM-powered claims-document summarization system (insurance domain) that cut agent review time from 4–5 minutes to under 2 minutes and saved 1,200+ hours per quarter. Hands-on across orchestration and production infrastructure (Airflow retraining DAGs, Kubernetes, SageMaker endpoints, FastAPI) and recent RAG workflows using n8n + Pinecone, with a strong focus on reliability, cost, and explainability for non-technical stakeholders.

View profile
KS

Krish Shah

Screened

Junior AI Engineer specializing in LLM systems and analytics

Miami, FL2y exp
CoUnderscorePurdue University

Analytics-focused candidate with internship and project experience at Recotap and CoUnderscore, combining SQL, Python, and BI dashboards to turn messy marketing and engagement data into decision-ready reporting. Stands out for tying analytics work to business outcomes, including ~15% CTR improvement, identifying ~40% misattributed spend, and enabling a ~$75K budget shift through better targeting.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.

View profile
MP

Meghana P

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and NLP

Illinois, USA5y exp
State FarmSaint Louis University

AI/ML engineer with forensic analytics and healthcare claims experience (Optum), building production LLM/RAG systems to surface context-driven fraud patterns from unstructured claim notes and explain risk to investigators. Strong in large-scale retrieval performance tuning, legacy API integration with reliability patterns (SQS, circuit breakers), and MLOps orchestration on Airflow/Kubernetes with rigorous testing, monitoring, and stakeholder-friendly interpretability.

View profile
AS

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

United States5y exp
CVS HealthUniversity of Maryland, Baltimore County

At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.

View profile
PG

Prasanth Goli

Screened

Mid-level Data Scientist specializing in Generative AI and LLM production systems

United States5y exp
AT&TWestern Illinois University

Built and deployed a production LLM-powered workflow assistant that automated internal marketing/production business tasks (document summarization, repeated Q&A, status updates). Demonstrates end-to-end applied LLM engineering: modular RAG architecture, hallucination/latency mitigation, automated evals to prevent prompt regressions, and Azure-based orchestration (Functions/Logic Apps) with monitoring and controlled rollouts.

View profile
RE

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.

View profile
AM

Mid-level Data Scientist specializing in LLMs, RAG, and document intelligence

NYC, NY3y exp
MagnitUniversity at Buffalo

LLM/ML engineer who has shipped production systems in legal/financial-risk domains at Wolters Kluwer, including a hybrid OCR+deterministic+LLM extraction pipeline that structured UCC filings at massive scale and drove $6M+ in revenue. Also built LangGraph-based multi-agent “Deep Research” workflows with model routing, tool calls (MCP), persistence, and human-in-the-loop review, and partnered closely with policy writers to deliver LLM summarization that cut writing time by ~60%.

View profile
OR

Mid-level Data Scientist specializing in predictive modeling, NLP/LLMs, and RAG search systems

Des Moines, IA6y exp
CDS GlobalUniversity of Massachusetts

Built production LLM/RAG platforms for financial services to enable natural-language Q&A over large policy/compliance document sets stored in Snowflake and SharePoint. Strong in MLOps and orchestration (Airflow, ADF, Step Functions, MLflow) and in solving real production issues like stale embeddings and model performance, including an incremental Snowflake Streams sync that cut processing time from hours to minutes.

View profile
SS

Sameer Shaik

Screened

Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning

Chicago, IL8y exp
Live NationDePaul University

Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.

View profile
TN

Mid-level Data Scientist & AI/ML Engineer specializing in GenAI and cloud ML

Harrison, NJ5y exp
State FarmMonroe University

GenAI/LLM engineer who recently built a production compliance assistant at State Farm for KYC/AML and regulatory teams, using AWS Bedrock + LangChain with Textract/Lambda pipelines to extract fields, tag risk, and summarize long documents. Implemented RAG, strict structured outputs, and human-in-the-loop guardrails, and reports automating ~80% of documentation work while reducing review time by ~40%.

View profile
VN

Vasanthi N.

Screened

Senior AI/ML Engineer and Data Scientist specializing in Generative AI and MLOps

Los Angeles, CA9y exp
Pacific Community BankAurora University

ML/NLP practitioner focused on financial-services document intelligence and compliance workflows—built an end-to-end pipeline to classify documents and extract financial entities from loan applications, emails, and statements stored in S3/internal databases. Strong in entity resolution/record linkage and in productionizing pipelines with GitHub Actions CI/CD, testing, data validation, and Docker, plus semantic search using OpenAI embeddings and a vector database.

View profile
KE

Senior Data Scientist specializing in NLP and explainable machine learning

8y exp
Miro HealthRensselaer Polytechnic Institute

NLP/ML practitioner who built an explainable, clinician-aligned system to detect cognitive decline (Alzheimer’s/stroke-related) from audio responses, achieving 97% accuracy on only a few hundred data points. Also has experience with healthcare claims entity resolution and prototyped a word2vec-based patent search vector database in Elasticsearch, with strong emphasis on testing, interpretability, and scalable Python data workflows.

View profile
UO

Principal Data Scientist specializing in Generative AI, NLP, and MLOps

San Francisco, CA12y exp
CognizantUniversity at Buffalo

ML/NLP practitioner with banking experience (M&T Bank) who has built a GPT-4 RAG system using LangChain and Pinecone to connect unstructured customer data with internal knowledge bases, improving accuracy and reducing manual lookup time by 50%+. Strong in entity resolution and productionizing scalable Python data workflows, including major performance wins by migrating bottleneck joins from Pandas to Dask.

View profile
JM

Mid-level Data Scientist / ML Engineer specializing in FinTech and Healthcare ML systems

4y exp
FiservSan Diego State University

AI/LLM engineer who has shipped production RAG systems (including a 250K-document compliance knowledge tool on AWS) and focuses on reliability via citations, guardrails, and rigorous evaluation (Ragas/Opik/DeepEval). Also built a LangGraph-orchestrated webcrawler agent that cut research paper extraction from hours to minutes, and collaborated with clinical teams to deliver patient volume forecasting with an optimization layer for staffing.

View profile
Husayn El Sharif - Senior Data Scientist specializing in geospatial ML and environmental analytics in Atlanta, GA

Senior Data Scientist specializing in geospatial ML and environmental analytics

Atlanta, GA16y exp
Georgia Institute of TechnologyGeorgia Tech

Applied ML practitioner who deployed a near-real-time water-quality monitoring tool for Gwinnett County by fusing ESA satellite imagery with in-situ measurements to predict chlorophyll-A and support early warnings for harmful algal blooms. Also working on a multimodal deep-learning project combining skin lesion images with patient tabular/text data (TensorFlow, embeddings) to predict melanoma risk.

View profile
Lokesh Jain - Senior AI/ML Engineer specializing in supply chain and healthcare systems in Bentonville, AR

Lokesh Jain

Screened

Senior AI/ML Engineer specializing in supply chain and healthcare systems

Bentonville, AR6y exp
Forman TechnologyUniversity at Buffalo

Built and deployed AcademiQ Ai, a production LLM-based teaching assistant using GPT/BERT with RAG (LangChain + Pinecone) to handle large student notes and generate adaptive explanations/quizzes. Demonstrated measurable retrieval-quality gains (18% precision improvement, 22% less irrelevant context) by tuning similarity thresholds and chunking based on user satisfaction signals. Also orchestrated terabyte-scale, real-time demand forecasting pipelines using Airflow and Kubeflow on GCP with strong monitoring, shadow deployment, and feedback-loop practices.

View profile
JM

Senior Full-Stack Software Engineer specializing in civic tech and AI/RAG systems

13y exp
Emerson Collective42 Silicon Valley
View profile
SP

Mid-level Data Scientist specializing in LLMs, fraud detection, and healthcare analytics

Atlanta, GA3y exp
Georgia Institute of TechnologyGeorgia Tech
View profile
Gayatri Nagesh Walke - Junior AI/ML Engineer specializing in NLP, LLMs, and production ML systems in Arizona, United States

Junior AI/ML Engineer specializing in NLP, LLMs, and production ML systems

Arizona, United States2y exp
peerlogic.aiUniversity at Buffalo
View profile

Need someone specific?

AI Search