Reval Logo

Vetted Apache Spark Professionals

Pre-screened and vetted.

SR

Sharanya Rao

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare

Remote, USA3y exp
Ally FinancialUniversity of Maryland, Baltimore County

Built an AI lending assistant (RAG + DeBERTa) used by credit analysts to retrieve policies and past loan decisions, tackling real production issues like hallucinations, document quality, and sub-second latency. Deployed a modular, Dockerized AWS architecture (ECS/EMR + load balancer) with load testing, caching/precomputed embeddings, and CloudWatch monitoring, and used Airflow to automate scheduled data/embedding/vector DB refresh pipelines with retries and alerts.

View profile
AR

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

3y exp
State FarmCleveland State University

Built a secure, on-prem/private GPT assistant to replace manual SharePoint-style search across thousands of policies/SOPs/engineering docs, using a production RAG stack (LangChain/LangGraph, FAISS/Chroma, PyMuPDF+OCR, vLLM). Implemented layout-aware ingestion (including table-to-JSON) and a multi-agent retrieval/generation/verification workflow with strong observability and compliance guardrails, delivering ~70% reduction in search time.

View profile
YL

Yurong Luo

Screened

Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems

Remote9y exp
dataAnnotationVirginia Commonwealth University

Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.

View profile
SN

Senior Data Engineer specializing in cloud data platforms and ML pipelines

Atlanta, GA8y exp
Berkshire HathawayUniversity of Alabama at Birmingham

Data engineer focused on AWS-based enterprise data platforms, owning end-to-end pipelines from multi-source batch/stream ingestion (Glue/Kinesis/StreamSets/Airflow) through PySpark transformations into curated datasets for Redshift/Snowflake. Emphasizes production reliability with strong monitoring/observability and data quality gates, and reports ~30% performance improvement plus improved SLAs and latency after optimization.

View profile
SK

Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture

Chesterfield, VA25y exp
Hamilton BeachUniversity of Phoenix

Technology leader focused on business-aligned roadmaps and integration-heavy ecommerce platforms. Recently delivered an on-time launch for lutusooking.com (a premium Hamilton Beach brand) by coordinating UX/UI, component-based middleware, BigCommerce, Algolia search, personalization/recommendations, payments, and supply chain integrations, and later improved scalability via a Jitterbit iPaaS approach proven during Black Friday/Cyber Monday traffic.

View profile
YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
RR

Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices

Texas, USA6y exp
Morgan StanleyUniversity of Central Missouri

Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.

View profile
AV

Senior Full-Stack .NET Developer specializing in cloud-native web applications

Bethesda, MD5y exp
Accompany HealthTrine University

Backend/ML systems engineer who built a Flask + PostgreSQL internal ticketing platform and demonstrates strong database/ORM performance depth (indexes, partitioning, RLS multi-tenancy). Notably optimized a high-throughput attachment OCR/embedding pipeline with batching, deduplication, and Redis caching, cutting median latency from 45s to 10s and reducing worker cost by 35% while increasing throughput 4x.

View profile
HB

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp
Freddie MacUniversity of Wisconsin

Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.

View profile
KP

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and React

5y exp
Northern TrustCentral Michigan University

Full-stack engineer who owned enterprise workflow platforms end-to-end at Northern Trust and Elevance Health—building NestJS/Java Spring Boot APIs, React UIs, and cloud deployments on GCP Cloud Run. Strong in data-heavy applications (hundreds of thousands of records) with proven production performance tuning (indexing/query rewrites, Cloud Run concurrency/min instances) and secure RBAC via Azure AD.

View profile
HT

Mid-level Machine Learning Engineer specializing in LLMs, agentic AI, and risk/fraud modeling

San Francisco, CA3y exp
The Research Foundation for SUNYUniversity at Buffalo

Built and productionized an agentic LLM workflow during a summer internship to transform unstructured clinical reports into analytics-ready structured data, using a LangChain multi-agent design plus an LLM-as-a-judge layer to control quality in a regulated setting. Also has experience orchestrating ML pipelines at Piramal Capital using AWS Step Functions/EventBridge/CloudWatch, with strong emphasis on observability, evaluation rigor, and measurable impact (80–90% reduction in manual data entry).

View profile
TG

Executive Technology Leader (CTO/CIO) specializing in AI/ML, cloud modernization, and FinTech

Santa Monica, CA11y exp
Web3AdvisorsUniversity of Phoenix

Engineering/technology leader (CTO-style) with experience scaling orgs and running distributed teams across four continents for over a decade. Led a high-stakes modernization of a securities trading platform at Wedbush—migrating from monolith to microservices on AWS with zero-downtime constraints—driving 45% execution performance improvement and enabling 25% market share growth. Emphasizes business-aligned roadmaps, build-vs-buy rigor, and scalable engineering practices/culture.

View profile
MS

Muaaz Syed

Screened

Mid-level AI/ML Engineer specializing in NLP and conversational AI

Richardson, TX4y exp
CVS HealthUniversity of Texas at Dallas

ML/NLP engineer focused on real-time IT ops analytics, building a predictive maintenance/anomaly detection platform end-to-end (multi-source ETL, streaming, modeling, and production deployment on GCP/Vertex AI). Uses deep learning (LSTMs, autoencoders/VAEs) plus embeddings (SentenceBERT) and vector search to improve incident correlation and search, citing ~40% reduction in duplicate alert noise.

View profile
AB

Alekya Battu

Screened

Mid-level Data Scientist specializing in ML, NLP, and MLOps

USA5y exp
Wells FargoWilmington University

Senior data scientist with ~5 years’ experience building production ML/NLP systems in finance (Wells Fargo) and deep learning for sensor analytics in connected vehicles (Medtronic). Has delivered end-to-end platforms combining time-series forecasting with transformer-based NLP, including automated drift monitoring/retraining (MLflow + Airflow) and standardized Docker/CI/CD deployments; achieved a reported 22% precision improvement after domain fine-tuning.

View profile
SK

Mid-level Data Scientist specializing in real-time fraud detection and MLOps

San Francisco, CA5y exp
Charles SchwabCUNY Graduate Center

ML/NLP engineer with experience at Charles Schwab building an NLP + graph (Neo4j) entity-resolution system to unify fragmented user/device/transaction data and improve downstream model quality and analyst querying. Has applied embeddings (SentenceTransformers + FAISS) with domain fine-tuning to boost hard-case matching recall by ~12% while maintaining precision, and has a track record of hardening scalable Python/Spark pipelines and productionizing fraud models via A/B tests and shadow-mode monitoring.

View profile
OP

Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms

Jersey City, NJ5y exp
Nurture HoldingsUC Santa Cruz

Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.

View profile
AB

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp
JPMorgan ChaseNew York Institute of Technology

Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.

View profile
HG

Hritvik Gupta

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI

San Francisco, CA3y exp
Penn MedicineUC Riverside

Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.

View profile
DB

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
MA

maheen Adeeb

Screened

Senior Machine Learning Engineer specializing in LLMs, speech AI, and RAG systems

Chicago, IL3y exp
VosynDePaul University

AI engineer with production experience building multilingual speech-to-speech translation pipelines (ASR + LLM) for enterprise/media, focused on reliability at scale. Has hands-on orchestration experience (including IBM Watson contexts) and emphasizes production evaluation/monitoring using a mix of traditional metrics and LLM-based evaluators to catch quality regressions while balancing latency and cost.

View profile
BS

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp
DiscoverUniversity of South Dakota

Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile

Need someone specific?

AI Search