Intern Data Scientist specializing in generative AI and forecasting
San Francisco, CAData Scientist Intern5 years experienceInternTechnologyArtificial IntelligenceFinancial Services
ScreenedIdentity Verified
Connect with Cassandra
Cassandra already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.
Recommended
Already have an account?
About
ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.
Experience
Data Scientist InternAurora AI
Machine Learning & Data Science EconomistBanco de Mexico
ConsultantUniversidad Autónoma de Nuevo León (University-Industry Relations Office)
Financial Planning AnalystSecretary of Finance and General Treasury of Nuevo León (State Gov. Office)
Education
University of Chicago, Physical Sciences Divisionmaster, Applied Data Science (2025)
Instituto Tecnológico y de Estudios Superiores de Monterreybachelor, Economics (2019)
Key Strengths
Built healthcare LLM inference/analytics pipeline combining RAG + RL with clinician-feedback reward model to reduce hallucinations
Designed multi-stage biomedical data cleaning + UMLS-based entity normalization; reduced duplicate entities by 18% and increased UMLS alignment by 12%
Validated entity resolution at data/semantic/model levels (manual gold set + downstream fine-tuning impact)
Improved downstream factual accuracy/consistency by ~9–10% using cleaned vs raw datasets
Production-grade Python workflow design with orchestration, versioning, and experiment tracking (Airflow/Prefect, DVC, MLflow)
Embedding + vector DB search/relevance systems (OpenAI/Sentence-Transformers + FAISS) linking customer behavior and product metadata
Clear prototype-to-production decisioning using measurable metrics (accuracy, latency, interpretability) and cross-functional validation
Discover more candidates like Cassandra
Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.