ScreenedIdentity Verified
No cost, no commitment - we'll make a personal intro
DM

Diana Minine Gudinho

Mid-level Data Scientist specializing in GenAI, RAG, and forecasting

University at BuffaloUniversity at BuffaloNew Jersey, USA4 Years ExperienceMid LevelWorks On-Site

Connect with Diana

Diana already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

About

ML/NLP engineer focused on large-scale data linking for e-commerce-style catalogs and customer records, combining transformer embeddings (BERT/Sentence-BERT), NER, and FAISS-based vector search. Has delivered measurable lifts (e.g., +30% matching accuracy, Precision@10 62%→84%) and built production-grade, scalable pipelines in Airflow/PySpark with strong data quality and schema-drift handling.

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee

Key Strengths

  • Built NLP pipeline to unify multi-vendor product catalogs using BERT/DistilBERT + NER + FAISS
  • Improved product matching accuracy by ~30%
  • Designed scalable entity resolution with hybrid blocking + BERT similarity; scaled to tens of millions of records
  • Improved entity match accuracy by ~25% vs prior system
  • Improved semantic search relevance via Sentence-BERT fine-tuning; Precision@10 from 62% to 84% and ~35% relevance lift
  • Production-grade data workflow engineering (Airflow, PySpark, Docker, CI/CD, monitoring, data quality checks)
  • Handled vendor schema drift with automated schema validation and dynamic mapping layer
  • Built and deployed a RAG-based compliance/legal document review system
  • Designed for high-precision retrieval to reduce compliance risk
  • Optimized retrieval and model performance for large-scale data (latency/compute constraints)
  • Hands-on Airflow orchestration for ETL + ML pipelines at scale (5M+ daily records)
  • End-to-end pipeline design from S3/Glue/PySpark to Redshift with DAG-based scheduling
  • Structured agent/workflow testing approach (unit + integration) with measurable metrics
  • Production monitoring with real-time dashboards and stakeholder feedback loops
  • Effective collaboration with non-technical stakeholders (marketing/compliance) translating requirements into usable outputs

Like what you see? We'll introduce you to Diana directly.

Experience

Research AssistantUniversity at Buffalo · Feb 2025 – Present
Graduate Student AssistantUniversity at Buffalo · Aug 2024 – Dec 2024part-time
Data AnalystTata Consultancy Services Ltd. · Oct 2020 – Jun 2023
Full-Stack Software Engineering InternQSpiders · Jul 2019 – Aug 2019internship

Education

University at Buffalomaster, Data Science (2025)
Visvesvaraya Technological Universitybachelor, Information Science (2020)

Languages

English

Similar Candidates

SS

Principal Data Scientist & AI/ML Engineer specializing in LLMs, recommender systems, and MLOps

San Francisco, CA11y exp
SlackStanford University
View profile
George Sun - Mid-level investor specializing in cross-border private equity and tech in Cambridge, MA

George Sun

Screened

Mid-level investor specializing in cross-border private equity and tech

Cambridge, MA4y exp
Mossavar-Rahmani CenterHarvard University

Investor/banker with cross-border sourcing experience spanning JPMorgan, Ascendant, Pumavira, and EQD, covering AI, advanced manufacturing, TMT, and consumer healthcare. Stands out for turning cold founder outreach into long-term banking engagements and for combining high-volume sourcing with rigorous early-stage commercial and regulatory screening.

View profile
AK

Executive AI & Data Infrastructure Leader

Seattle, WA17y exp
Stealth StartupUniversity of Michigan
View profile
EP

Esha Pahwa

Screened

Intern Machine Learning Engineer specializing in LLM agents and multimodal reasoning

Mountain View, CA2y exp
Corvic AICarnegie Mellon University

LLM/agent engineer who built a production code-generation agent at Corvic AI that lets non-technical users query CSV/tabular data in natural language by generating and executing Python. Focused on making LLM systems reliable and scalable via schema-aware validation, sandboxed execution-feedback retries, prompt caching/embeddings, async execution, and high-throughput data processing with Polars; also partnered with Adobe product/marketing to ship brand-aligned AI content generation for email and push notifications.

View profile
FC

Executive AI founder specializing in machine learning for drug discovery

San Diego, CA15y exp
KekulaiHarvard University

Entrepreneur building an AI-driven small molecule biotech startup who has already raised $5M from major venture capital investors. Brings 3 years of fundraising experience and a thoughtful approach to aligning investors with different risk profiles across financing stages, paired with strong conviction around founding high-risk, high-upside companies.

View profile
KV

Intern Software Engineer specializing in Machine Learning and Generative AI

Bellevue, WA1y exp
AmazonGeorgia Tech
View profile

Interested in Diana?

We'll personally introduce you - no strings attached.

For Hiring Teams

Build your dream team with Reval

Our AI agents source, screen, and vet candidates for your open roles. Get qualified, high-intent candidates on your desk within 48 hours.

$250one-time kickoff
10%on successful hire
48hrsto first candidates
Post a Role90-day money-back guarantee. A fraction of traditional agency fees.

Discover more candidates like Diana

Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.

Search Talent

Connect with Diana

Diana already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee
Diana Minine GudinhoMid-level Data Scientist specializing in GenAI, RAG, and forecasting