Reval Logo

Vetted PySpark Professionals

Pre-screened and vetted.

SR

Shruti Rawat

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services

Jersey City, NJ4y exp
State StreetPace University

Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.

View profile
UO

Mid-Level Software Engineer specializing in backend, distributed systems, and AI/LLM platforms

Prairie View, TX4y exp
Prairie View A&M UniversityPrairie View A&M University

Built and shipped AI-powered workflow automation at Oracle, including an MCP-based agentic workflow with tool-calling and guardrails, plus Grafana monitoring and Confluence documentation. Also led a Django monolith-to-microservices migration at Chamsmobile using blue-green deployment and load balancer traffic splitting to avoid regressions while modernizing production systems.

View profile
AA

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp
TCSCalifornia State University, Chico

Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.

View profile
AT

Abdul Tanimu

Screened

Senior Full-Stack Software Engineer specializing in cloud-native web applications

Houston, TX7y exp
TechwaveUniversity of North Texas

Backend/data engineer who built a production booking platform on FastAPI microservices (Postgres/Redis/gRPC) and delivered AWS infrastructure spanning Lambda, ECS, SQS, and Glue-to-Redshift analytics. Demonstrated measurable SQL optimization (10 minutes to <40 seconds) and strong operational ownership through monitoring, incident response, and schema-evolution hardening.

View profile
YS

Yash Sanap

Screened

Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications

Virginia Beach, VA2y exp
City of Virginia BeachGeorge Mason University

Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.

View profile
VK

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp
TCSUniversity of New Haven

LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
AJ

Aman Jain

Screened

Mid-level Software Engineer specializing in cloud-native data pipelines and ML platforms

Boston, MA4y exp
Community Dreams FoundationBoston University

Backend engineer who has owned end-to-end delivery of Python/FastAPI microservices for real-time data processing and alerting, including performance tuning (Postgres optimization, caching, async processing). Strong DevOps/GitOps background: Docker + Kubernetes deployments with GitHub Actions CI/CD and ArgoCD-driven GitOps, plus experience supporting phased on-prem to AWS migrations and building Kafka-based streaming pipelines.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
GG

Mid-level Data Scientist specializing in GenAI, LLM-to-SQL, and analytics platforms

Turin, Italy3y exp
Engineering Ingegneria InformaticaUniversity of Ferrara

LLM/agentic AI builder who led end-to-end integration of an LLM system into a business intelligence product, creating a scalable, metadata-driven RAG/agent pipeline with an orchestrator that routes queries to specialized agents (including DB-backed quantitative querying). Also built an LLM-to-SQL chatbot and partnered with non-technical stakeholders to capture domain context and improve SQL generation, using automated LLM-based testing to evaluate reliability.

View profile
AM

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices

Sanford, FL4y exp
HCLTechUniversity of Massachusetts Lowell

Backend engineer with cloud-native Python/Flask experience building high-throughput financial platforms (loan origination intelligent document processing and real-time fraud detection). Has scaled microservices on AKS with event-driven Azure messaging, delivered measurable performance gains (e.g., 700ms→180ms query latency; ~40% API improvements), and implemented strong security controls (OAuth2/JWT, Azure AD RBAC, audit logging, AES-256/TLS) for sensitive regulated data.

View profile
SV

Mid-level Data Engineer specializing in cloud data platforms and AI agents

Santa Clara, CA6y exp
SwirepaySan José State University

Data/Backend engineer who has owned end-to-end merchant analytics systems on AWS: orchestrated multi-source ingestion (FISERV/Shopify/Clover) with Step Functions/Lambda, enforced strong data quality gates, and served curated datasets via Redshift and a FastAPI layer. Also built an early-stage Merchant Insights AI agent that converts natural language questions into SQL using OpenAI models, with full CI/CD and observability.

View profile
MP

Manali Patil

Screened

Senior Software Engineer & Engineering Manager specializing in cloud backend and manufacturing MES

Santa Clara, CA9y exp
Halo IndustriesUniversity of San Francisco

Customer-facing engineer who led recurring midnight ERP data-feed/B2B integrations from prototype to production, building reusable APIs and using Hangfire for job scheduling. Known for tight weekly customer iteration, strong documentation and test coverage (80%+), and cross-functional problem-solving with Operations/Quality/NPI to resolve data-collection and manufacturing-process constraints; has 2 customers live on the integration.

View profile
LP

Mid-level ML & Data Engineer specializing in GenAI, graph modeling, and fraud/risk analytics

Redwood City, CA5y exp
BlueArcYeshiva University

Built a production AI fraud/risk scoring platform at BlueArc that ingests web business/product/site data, generates text+image embeddings, and connects entities in a graph to detect reuse patterns and links to known bad actors. Optimized for scale with incremental graph re-scoring and delivered investigator-friendly explainability by surfacing the exact signals/relationships behind each score; orchestrated workflows with Airflow and GCP event-driven components (Pub/Sub, Dataflow, Cloud Run) and has recent LLM workflow orchestration experience (retrieval, prompting, scoring).

View profile
SS

Sai somapalli

Screened

Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants

USA6y exp
Stellar AI SolutionsCampbellsville University

GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.

View profile
AG

Mid-level AI/ML Engineer specializing in MLOps and cloud-deployed ML systems

Austin, TX3y exp
PurevisitxUniversity of Illinois Springfield

ML/AI engineer who built and productionized an NLP system at PurevisitX, orchestrating end-to-end ML workflows with Airflow (S3 ingestion through auto-retraining) and optimizing for drift and low-latency inference. Also partnered with Citibank risk teams on a fraud detection model, translating results via dashboards and iterating thresholds based on stakeholder feedback.

View profile
SK

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp
Epic SystemsUniversity of Central Missouri

Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.

View profile
MV

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp
DiscoverCentral Michigan University

ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.

View profile
DD

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
DG

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp
PaycomUniversity of Kansas

Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.

View profile
SS

Sumit Sahu

Screened

Mid-level Machine Learning Engineer specializing in computer vision and MLOps on GCP

Atlanta, GA4y exp
NCR VoyixUniversity of Georgia

ML/AI engineer who deployed a real-time, edge-based computer-vision pipeline for produce recognition in retail self-checkout to reduce shrink. Demonstrates strong end-to-end production chops: multi-camera data calibration/sync, ranking-based modeling for fine-grained classes, latency-focused optimization, and continuous A/B testing/monitoring with guardrails. Experienced with ML orchestration (Kubeflow Pipelines, Airflow) and CI/CD via GitHub Actions, and collaborates closely with store operations to make interventions usable in the checkout flow.

View profile
MK

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Arlington, TX4y exp
micro1University of Texas at Austin

Built and shipped a production RAG assistant using GPT-4, LangChain, and Pinecone/FAISS to search 50K+ institutional documents, with a strong focus on groundedness and hallucination reduction through retrieval optimization and re-ranking. Pairs this with a metrics-driven evaluation/monitoring approach (BLEU/ROUGE, manual sampling, logging) and workflow automation via Airflow, and has experience translating stakeholder needs into iterative AI prototypes.

View profile
ER

Edwin Rivera

Screened

Senior Full-Stack Software Engineer specializing in modern web apps and cloud platforms

Villa Rica, Georgia7y exp
RTXFull Sail University

Backend/data engineer with production experience building real-time sensor telemetry platforms: FastAPI + PostgreSQL services with strong observability, plus AWS serverless and Glue-based ETL into Redshift. Has modernized legacy SAS pipelines into Python microservices and delivered measurable performance wins (Postgres query latency cut to <1 minute and ~60% DB CPU reduction) while owning incident response and reliability improvements.

View profile

Need someone specific?

AI Search