Vetted Web Scraping Professionals

Pre-screened and vetted.

Sri Harshitha Yannam - Junior Software Engineer specializing in AI/ML and cloud platforms in Austin, TX

Junior Software Engineer specializing in AI/ML and cloud platforms

Austin, TX2y exp
AmazonUniversity of Wisconsin–Milwaukee

LLM/agent engineer who shipped a production "Memory Assistant" at HydroX AI, building a LangChain/LlamaIndex RAG memory pipeline on ChromaDB/FAISS with robust fallbacks (BERT/BART), prompt-injection mitigation, and 99.9% uptime monitoring. Also built a multi-step customer support agent using Rasa + OpenAI Assistants API with structured tool calling, guardrails, and human-in-the-loop escalation, and has experience hardening agents against messy ERP data via Pydantic validation, idempotency, and transactional outbox patterns.

View profile
SN

Intern Full-Stack Software Engineer specializing in AI/ML and AWS cloud platforms

Birmingham, AL1y exp
Yuva BiosciencesTufts University

Full-stack engineer who built an LLM-powered productivity web app (LifeOS) end-to-end with TypeScript/Next.js, Prisma, and Postgres, emphasizing fast iteration with stable API contracts and an isolated AI service boundary. Also built a security/compliance login-verification workflow at Medpace used within an internal admin portal for thousands of employees, and has AWS experience orchestrating batch GPU workloads with robust retry/idempotency patterns.

View profile
AS

Aisha Sartaj

Screened

Mid-level AI Engineer specializing in LLM systems, RAG, and MLOps

Remote3y exp
ILMAscentUCLA

Built an LLM multi-agent “ingredient safety” analyzer for cosmetics that cuts consumer research time from ~20+ minutes to minutes, using LangGraph orchestration, hybrid retrieval (Qdrant + Tavily), and safety-focused critic validation (false rejections reduced ~30%→~8%). Also has research-internship experience building computer-vision pipelines to classify emerald color/clarity by translating gem-expert heuristics into quantitative model features.

View profile
DK

Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake

Richardson, TX6y exp
PwCUniversity of Central Missouri

Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.

View profile
Wei Jiang - Junior Machine Learning Engineer specializing in MLOps and statistical modeling in Greenwood, SC

Wei Jiang

Screened

Junior Machine Learning Engineer specializing in MLOps and statistical modeling

Greenwood, SC3y exp
ES FoundryNortheastern University

Integration engineer at ES Foundry who led deployment of ELsentinel, a production EL image-based solar cell quality monitoring system using a Swin Transformer classifier (>0.8 F1 across 15+ classes) plus a live real-time prediction dashboard. Strong in solving messy labeling/data-quality problems with process-team collaboration and shipping ML systems despite limited compute/infrastructure.

View profile
Lance Chou - Intern Machine Learning Engineer specializing in NLP and MLOps in Canada

Lance Chou

Screened

Intern Machine Learning Engineer specializing in NLP and MLOps

Canada1y exp
VosynColumbia University

PhD-led research engineer who has shipped LLM-powered agents for automated knowledge extraction from STEM textbooks/papers into a graph database, reporting a 90% accuracy improvement and major reductions in manual curation time. Also built an end-to-end multi-agent news aggregation/sentiment pipeline using the Agno framework with Pydantic-structured outputs, retries, and monitoring, and has experience processing messy SEC filings.

View profile
AS

Mid-level Software Engineer specializing in backend systems and AI automation

San Francisco, CA5y exp
For Women’s HealthUC Santa Cruz

Built a production Python microservice around Grafana Loki focused on reliability, with checkpointing, idempotency, replay tooling, tracing, and alerting to prevent data loss and silent lag. Also has hands-on experience hardening brittle Playwright automations against dynamic UIs, auth expiry, rate limits, MFA, and bot-detection constraints, plus turning tribal-knowledge SOPs into explicit state-machine-driven workflows.

View profile
KS

Kristina Shen

Screened

Intern-level Data Scientist and ML Engineer specializing in analytics and AI systems

Long Island City, NY1y exp
DataLynnUniversity of Chicago

Early-career analytics candidate with hands-on experience in SQL/Python data pipelines, Tableau reporting, and marketing engagement analytics across internship and startup settings. Stands out for combining rigorous data quality practices with practical AI system design, including an end-to-end GPT-4 grading capstone that emphasized explainability and human oversight.

View profile
YY

Yinghai Yu

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML pipelines

San Mateo, CA6y exp
Bubbles and BooksGeorgia Tech

Data-engineering-oriented candidate with hands-on experience building an agentic AI product and operational automation workflows. They described automating inventory-to-ERP discrepancy reconciliation with anomaly detection and daily reporting, and also have practical scraping/automation experience dealing with Cloudflare-protected sites using Selenium and Puppeteer.

View profile
HL

Hao Liang

Screened

Mid-level Data Scientist specializing in GenAI, customer insights, and forecasting

Durham, NC5y exp
BASFUniversity of North Carolina at Chapel Hill

ML/AI practitioner with hands-on experience shipping production time-series forecasting and RAG-based customer insights platforms in an enterprise setting. At BASF, he improved seed sales forecasting beyond naive baselines using model selection tailored by brand size, and he also led a RAG solution over Salesforce reports, complaints, and surveys that reached 2,000+ users with strong daily engagement.

View profile
Amit Dharam - Junior AI/ML Software Engineer specializing in backend systems and cloud deployment in Tempe, AZ

Amit Dharam

Screened

Junior AI/ML Software Engineer specializing in backend systems and cloud deployment

Tempe, AZ3y exp
Arizona State UniversityArizona State University

Built multiple end-to-end automation and data systems, including an Accio RAG pipeline combining PDF parsing, FastAPI, Neo4j, and vector search, plus Selenium-based scraping for a virtual try-on product. Stands out for reliability-minded engineering: automated testing, structured logging, validation layers, and a data-driven approach to debugging flaky automation that improved CI pass rates to over 98%.

View profile
SP

Junior Robotics & AI Researcher specializing in soft robotics and real-time ML control

Boston, MA2y exp
Boston UniversityBoston University

Early-career robotics engineer who has integrated LLM/NLP command interfaces (OpenAI/LLaMA) into ROS-controlled industrial manipulators and built data-driven controls for underwater soft robotic actuators. Combines hands-on fabrication (balloon actuator with embedded copper traces) with sensor debugging (IMU/Aurora) and simulation work in Gazebo, with practical exposure to edge deployment constraints on Jetson Nano and model quantization.

View profile
MW

Senior Full-Stack AI Engineer specializing in Azure OpenAI and RAG/GraphRAG systems

Eagle Mountain, UT24y exp
GoEngineerBrigham Young University

Built GoEngineer’s first production AI systems, including an end-to-end RAG pipeline for SolidWorks technical support using Azure Blob Storage, Azure AI Search, and Azure OpenAI, plus an AI summarization feature adopted by sales/customer success. Strong in productionizing LLM workflows with evaluation harnesses (golden sets, LLM-as-judge, red teaming, shadow deploys) and Azure infrastructure integrations (Redis, Service Bus, App Insights), and has also implemented a custom MCP server for agentic monitoring.

View profile
AO

Alex Olson

Screened

Junior AI & Full-Stack Developer specializing in generative AI and web platforms

Remote1y exp
JerseySTEMBoston University

Recent graduate with internship experience at Bausch + Lomb building Copilot Studio HR chatbots that reduced HR time spent on repetitive inquiries. Strong focus on conversational flow design, prompt-based steering for predictability, and thorough technical/end-user documentation; also building a personal YouTube AI SEO analyzer.

View profile
AM

amani mudili

Screened

Mid-level Data Engineer specializing in cloud ETL pipelines (Azure, AWS, GCP)

Mississauga, Canada4y exp
CitigroupWebster University

Data engineer/backend developer who owned end-to-end pipelines and external data collection systems, including API ingestion and large-scale web scraping. Worked at ~50M records/month scale, improving processing speed by 20% and reducing reporting errors by 15%, and shipped a Rust-based internal data API with versioning, caching, and strong validation/observability practices.

View profile
Sushma Mangalampati - Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering in Boston, MA

Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering

Boston, MA6y exp
ServiceNowNortheastern University

Data engineer with strong end-to-end ownership of production lakehouse pipelines (Snowflake + Databricks + Airflow + dbt + Great Expectations), handling 8M+ records/month and 500K+ daily CDC updates. Delivered measurable reliability and efficiency gains (41% cost reduction, freshness improved from 4h to 30m, 35% fewer downstream incidents) and has experience building a lakehouse platform from scratch across 12 source systems.

View profile
PP

Preeti Pandey

Screened

Senior AI/ML Engineer specializing in predictive analytics and NLP

Birmingham, AL10y exp
Blue Cross and Blue Shield of AlabamaLiverpool John Moores University

ML/AI engineer with hands-on experience building production healthcare AI systems across predictive modeling and GenAI. They built an end-to-end patient risk prediction platform and a RAG-based clinical summarization feature, combining strong NLP/LLM skills with AWS deployment, monitoring, drift detection, and reusable Python service design to deliver measurable clinical and operational impact.

View profile
HD

Mid-level Data Engineer specializing in scalable ETL pipelines and data quality automation

USA6y exp
CentenePurdue University
View profile
JG

Senior Full-Stack Software Engineer specializing in web development and data engineering

Los Angeles, CA12y exp
Aquarius AsiaUCLA
View profile
Divyam Bansal - Mid-level Solutions Engineer specializing in cloud, data analytics, and AI/LLM solutions in Chicago, IL

Mid-level Solutions Engineer specializing in cloud, data analytics, and AI/LLM solutions

Chicago, IL3y exp
The Segal GroupNorthwestern University
View profile
Akhil Nakka - Senior Software Developer specializing in legal data pipelines and backend APIs

Senior Software Developer specializing in legal data pipelines and backend APIs

5y exp
JPMorgan ChaseGeorge Mason University
View profile
AK

Junior AI/ML Engineer specializing in Computer Vision and LLM/RAG systems

Boston, MA2y exp
AriesViewNortheastern University
View profile
SM

Mid-level Data Scientist specializing in LLMs, RAG systems, and production MLOps

Arlington, TX6y exp
Wells FargoUniversity of Texas at Arlington
View profile
Eric Yun Hao Zhang - Intern Software Engineer specializing in AI, data pipelines, and full-stack analytics tools in Birmingham, MI

Intern Software Engineer specializing in AI, data pipelines, and full-stack analytics tools

Birmingham, MI1y exp
OneStreamUniversity of Michigan
View profile

Need someone specific?

AI Search