Reval Logo
Home Browse Talent Skilled in Web Scraping

Vetted Web Scraping Professionals

Pre-screened and vetted.

Web ScrapingPythonDockerSQLJavaScriptAWS
RC

Randall Crumm

Senior Solutions Engineer specializing in AI automation and Unified Communications

San Jose, CA15y exp
RC ConsultingSan José State University
Workflow AutomationAPI IntegrationPrompt EngineeringRetrieval-Augmented Generation (RAG)PythonJSON+82
View profile
JC

Jacob Colombo

Intern AI Engineer specializing in agentic RAG systems and computer vision

Orlando, FL2y exp
Hibiscus HealthCornell University
ChromaDBComputer VisionData PreprocessingData StructuresDashboard DevelopmentJava+43
View profile
RW

Robert Williams

Senior Full-Stack Engineer specializing in Python, AI automation, and cloud microservices

New Braunfels, TX13y exp
GRAILTexas Christian University
PythonJavaC#TypeScriptDjangoReact+82
View profile
JK

Jay Kim

Junior Python Developer & Data Analyst specializing in AML and financial data engineering

McLean, VA2y exp
Capital OneGeorgia Tech
PythonJavaScriptJavaC++HTMLCSS+71
View profile
GV

Greg Vincent

Engineering Executive specializing in AI platforms and high-scale web products

Washington, DC30y exp
LiltGrinnell College
A/B TestingAgileAnalyticsAPI DevelopmentAWSAWS Lambda+69
View profile
HL

Haochen LI

Intern Machine Learning Engineer specializing in Generative AI and LLM systems

Hong Kong, China1y exp
Blue InsuranceDuke University
Amazon SageMakerApache HadoopApache SparkAPI IntegrationArtificial IntelligenceAWS+61
View profile
DM

Diego Magdaleno

Mid-Level Software Engineer specializing in test automation and ML systems

7y exp
10x GenomicsGeorgia Tech
BashCC++CI/CDData AnalyticsData Visualization+52
View profile
DS

Daniel Stephens

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
AgileAmazon RedshiftApache AirflowApache HiveApache KafkaAWS+107
View profile
PA

Pranav Arvind Jibhakate

Junior Software Developer specializing in C++ systems and performance engineering

San Jose, CA1y exp
AMDNorth Carolina State University
CC++PythonJavaSQLBash+56
View profile
AL

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp
Georgia Institute of TechnologyUniversity of Chicago

“LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.”

Machine LearningLarge Language Models (LLMs)OpenAI APIPrompt EngineeringWeb ScrapingPython+93
View profile
TS

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”

A/B TestingAPI DevelopmentAWSAWS LambdaAWS Step FunctionsAzure Data Factory+247
View profile
KJ

Krishi Jain

Screened

Junior Implementation Manager / Solution Engineer specializing in AI, ERP integrations, and predictive maintenance

Chicago, IL2y exp
Continuum AIWestcliff University

“LLM/agentic workflow practitioner (Continuum AI) who productionized an LLM system for manufacturing RMA intake and warranty claims by moving from a brittle prompt to a modular pipeline with RAG, function-calling extraction, deterministic validation, and strong observability. Also diagnosed and fixed an agentic ticket-triage misrouting issue by tracing failures to retrieval timeouts, adding guardrails/fallbacks, and implementing retries plus continuous evaluation—bringing misroutes near zero while creating a repeatable debugging playbook.”

PythonJavaSwiftC++CJavaScript+84
View profile
SH

Shiv Harish Prabaharan

Screened

Mid-level Software Engineer specializing in systems, storage, and machine learning

Round Rock, TX4y exp
Dell TechnologiesUniversity of Wisconsin–Madison

“Robotics-focused engineer who built a non-holonomic self-driving car on Raspberry Pi 5 using ROS 2, implementing sensor fusion (robot_localization EKF), 2D SLAM (slam_toolbox), custom Hybrid A*/RRT* planners, and MPC trajectory tracking. Demonstrated strong real-time debugging and performance tuning (timestamp sync, CPU contention mitigation) and is extending the platform toward CV-based plant identification and autonomous plant watering.”

AWSCC++CUDAData ingestionData validation+106
View profile
PS

Pratima Singh

Screened

Senior Full-Stack Software Engineer specializing in FinTech, cloud microservices, and blockchain

Tempe, AZ10y exp
Arizona State UniversityArizona State University

“Python/ML engineer with strong DevOps depth: built an end-to-end regime-aware stock prediction system (custom fine-tuned FinBERT sentiment + technical/macro features) delivering a 12% accuracy lift. Also implemented Kubernetes/Helm + Jenkins/GitHub Actions pipelines (including GitOps-style workflows for multi-cloud Hyperledger Besu) and improved deployment speed/stability by ~50% while addressing race conditions and image drift.”

AgileAPI DevelopmentAuthenticationAWSAWS LambdaC+++158
View profile
NV

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”

PythonSQLCC++RMATLAB+90
View profile
KK

Kalyaan Kanugula

Screened

Mid-Level Software Engineer specializing in AWS distributed systems and microservices

Chico, CA4y exp
AmazonCalifornia State University, Chico

“Backend/ML-systems engineer with experience (including Amazon) building real-time face recognition services using PyTorch (MTCNN/FaceNet) and AWS (SQS/S3/Lambda/EC2) with a focus on low latency, burst handling, and cost control. Also led a revenue-critical legacy pricing workflow migration to a serverless event-driven architecture using strangler-pattern rollout, simulation-based validation, and strong security practices (JWT/RBAC/RLS).”

AWSAWS LambdaC++Cloud ComputingDistributed SystemsDocker+84
View profile
AS

Amit Sharma

Screened

Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems

San Francisco, CA25y exp
FambotUniversity of Delhi

“Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.”

PythonFastAPIFlaskDjangoPandasNumPy+146
View profile
ML

Ming-Kai Liu

Screened

Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision

Raleigh, NC2y exp
Citrus OncologyUC San Diego

“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”

LinuxCC++PythonJavaSQL+81
View profile
AZ

Alex ZhuZhou

Screened

Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems

Berkeley, CA2y exp
EmbraerUC Davis

“Backend/LLM engineer with experience productionizing RAG systems (legal-case natural language querying) and optimizing for latency/cost, including a reported ~40% reduction via Redis caching and batching. Built monitoring and real-time debugging workflows (FastAPI, structured logging, correlation IDs, sandbox repro) and regularly delivered technical demos/workshops. Also partners with BD/sales to translate LLM capabilities into business value, including ESG-metric extraction from corporate filings.”

PythonTypeScriptJavaScriptJavaNode.jsSQL+78
View profile
LK

Lekha Karanam

Screened

Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation

Dallas, TX3y exp
Goldman SachsUniversity of Texas at Dallas

“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”

PythonSQLRScikit-learnTensorFlowPyTorch+61
View profile
SM

Satish Malempati

Screened

Mid-level Full-Stack Developer specializing in Java/Spring Boot, React, and cloud-native AI automation

Long Beach, CA3y exp
UberCalifornia State University

“Software engineer focused on reliability and scalable systems: built React/TypeScript dashboards backed by Java/Spring Boot APIs and designed Kafka-based microservices with strong contract/versioning discipline. Known for shipping incremental improvements with tight feedback loops and for creating internal observability tools that streamline on-call and incident diagnosis under high-traffic conditions.”

JavaPythonTypeScriptSQLSpring BootSpring MVC+104
View profile
PP

Pooja Pun

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud systems and internal platforms

Seattle, WA3y exp
AmazonUC San Diego

“Robotics-focused Python developer who built autonomous navigation for a differential-drive robot using onboard vision and AprilTag detection, including pose estimation and coordinate frame transformations for localization and motion planning. Also has practical backend performance experience using Redis TTL caching to speed responses and reduce server load, plus basic PostgreSQL query/index optimization.”

TypeScriptJavaPythonPostgreSQLFull-stack developmentPerformance optimization+63
View profile
SB

Shriya Bannikop

Screened

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

“Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.”

AgileAmazon CloudWatchAmazon DynamoDBAmazon EC2Amazon ECSAmazon EKS+170
View profile
MX

Ming-Sheng Xu

Screened

Intern Software Engineer specializing in full-stack web development and automation

Oklahoma City, OK1y exp
PaycomTexas A&M University

“Undergraduate robotics researcher who built a crowd-aware motion planning system to navigate safely and efficiently through dynamic pedestrian environments, implementing the full pipeline in ROS (move_base, global planning, SLAM/localization) and validating via 2D crowd simulation. Also brings modern software delivery experience from web apps, including Docker/Kubernetes-based cloud deployment and CI/CD with automated testing.”

AgileAndroidAsanaAzure Blob StorageCC#+72
View profile
1234...24

Related

Software EngineersResearch AssistantsMachine Learning EngineersSoftware DevelopersData ScientistsAI EngineersEngineeringAI & Machine LearningEducationData & Analytics

Need someone specific?

AI Search