Vetted Web Scraping Professionals

Pre-screened and vetted.

SS

Intern Full-Stack/Cloud Engineer specializing in AWS, DevOps automation, and backend APIs

Boston, USA2y exp
Software VelocityNortheastern University

Backend/cloud engineer with hands-on ownership of a climate data extraction pipeline (BeautifulSoup + Pandas ETL + CRON) that automated 50k+ monthly data points and removed ~20 hours/week of manual work. Also built a multi-AZ Kubernetes deployment for a Node.js system using Terraform and GitHub Actions (blue-green, rollbacks) and has Kafka/FastAPI experience from a healthcare plan management project.

View profile
AP

Mid-level Data Engineer specializing in cloud data pipelines and Snowflake

Manchester, NH3y exp
Inception Technologies, Inc.New England College

Data engineer who has owned production pipelines end-to-end, ingesting 50–100 GB/day from APIs/S3 and near-real-time Kafka into Snowflake with strong data quality gates (Great Expectations/dbt) and Airflow-based reliability (SLAs, alerting, dashboards). Also built a Snowflake-backed REST data API with caching/pagination and versioned endpoints, and designed a compliant, scalable web-scraping system with anti-bot handling and safe backfills.

View profile
Milan Sharma - Mid-level Software/Systems Engineer specializing in Python, Linux, and network testing in Richardson, TX

Milan Sharma

Screened

Mid-level Software/Systems Engineer specializing in Python, Linux, and network testing

Richardson, TX9y exp
EXFOArizona State University

Entrepreneurial product builder who has shipped two live App Store apps (Pixo content-based product marketing platform and Clutch AI dating reply helper). Also helped build a real-estate seller platform end-to-end, using AI matching to find buyers and contributing to onboarding nearly paying users and generating active MRR.

View profile
Srinivasan Gomadam Ramesh - Mid-level AI/Data Engineer specializing in agentic AI and data platforms in Redmond, WA

Mid-level AI/Data Engineer specializing in agentic AI and data platforms

Redmond, WA7y exp
Quadrant TechnologiesUniversity of Texas at Dallas

AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.

View profile
Saswata Deb - Mid Backend Engineer specializing in AI systems and LLM infrastructure in India

Saswata Deb

Screened

Mid Backend Engineer specializing in AI systems and LLM infrastructure

India4y exp
SentisumHeritage Institute of Technology

Early-to-growth-stage B2B SaaS engineer from Sentisum who combined Python backend, data pipeline, and applied AI work with direct customer-facing product input. Particularly compelling for startup roles: they owned systems end-to-end, migrated transcription infrastructure to cut costs by ~93%, and built scalable async export and data-processing workflows over large enterprise conversation datasets.

View profile
JJ

Mid-level Data Engineer specializing in cloud data platforms and real-time pipelines

Denton, TX5y exp
Real DynamicsUniversity of North Texas

Data engineer who has owned production pipelines end-to-end—from Kafka/Airflow ingestion through SQL/Python validation and dbt transformations into Redshift/BI. Also built and operated a large-scale distributed web scraping platform (50–100 sites daily, ~5–10M records/day) with Kubernetes, Kafka queues, robust retries/DLQ, anti-bot measures, and backfill-safe raw HTML storage.

View profile
Vikram Sandigaru - Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation in Boston, US

Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation

Boston, US3y exp
FounderWayNortheastern University

Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.

View profile
Ryan Perera - Junior Front-End Developer specializing in React and accessible UI in Uxbridge, Canada

Ryan Perera

Screened

Junior Front-End Developer specializing in React and accessible UI

Uxbridge, Canada2y exp
ModallMcMaster University

Frontend engineer who led end-to-end architecture for a warehouse management platform, emphasizing reusable domain-based React components and API-driven performance at scale (including barcode-scanning workflows). Also delivered a production-ready React Native iOS networking app MVP in ~5 weeks and built a data-driven React+TypeScript dashboard for collectible card market decisioning.

View profile
Yash Amre - Intern Data Scientist specializing in machine learning and NLP in California, USA

Yash Amre

Screened

Intern Data Scientist specializing in machine learning and NLP

California, USA1y exp
LexTrack AIUniversity of Colorado Boulder

Analytics-focused early-career candidate with internship experience owning reporting and system performance analysis projects end to end. They combine SQL data preparation, Python automation, and dashboard delivery with measurable impact, including roughly 50% less manual reporting and about 20% better forecast accuracy.

View profile
VK

Mid-level Full-Stack Engineer specializing in AI applications and enterprise SaaS

Remote, USA4y exp
AIDMIndiana State University

AI-focused software engineer who has built production CRM intelligence features including audio transcription, summarization, and action-item extraction, plus a multi-agent LLM/NLU pipeline using Supabase, Node.js, RabbitMQ, and CloudWatch. Stands out for a disciplined approach to AI-assisted coding: treating AI like a junior developer, rigorously testing outputs, and refining prompts to prevent hallucinations in real business workflows like resume screening.

View profile
SK

Sparsh Kapoor

Screened

Intern Full-Stack Engineer specializing in AI and systems

Philadelphia, PA1y exp
WorkMerkPenn State University

Builder of practical AI-backed products across developer tooling, travel search, defense, and healthcare-style workflows. They shipped an MCP/FastAPI/Gemma context-compaction system that cut token usage by about 80%, built a flight-price AI layer that validates LLM output against live search data, and helped shape a visionOS command center for a military air wing.

View profile
HG

Mid-level AI Prompt Engineer specializing in agentic AI and automation

Chicago, IL4y exp
The Aspen GroupIllinois Institute of Technology

Built GRETA, a full-stack multi-agent AI platform for SEO content analysis and blog-writing support, combining React/TypeScript, serverless GCP Cloud Run workflows, and LLM/tool orchestration at scale. The system reportedly reduced manual analysis by 60%, and the candidate shows strong hands-on experience shipping AI products in ambiguous environments and refining them through internal user feedback.

View profile
NK

Junior Full-Stack Software Engineer specializing in automation and web development

Oak Lawn, IL3y exp
PCs for PeopleUniversity of Illinois Chicago

Built Meet.AI end-to-end and made concrete architecture/performance decisions (RPC with type-safe integration; SSR + query prefetching for instant data display). Also created a Python tool at Abbott to resynchronize Ansible inventories and eliminate manual intervention by scheduling it in a Jenkins pipeline; has hands-on Docker/microservices experience including serving a pretrained LLM.

View profile
VY

vivek y

Screened

Junior Software Engineer specializing in full-stack development and machine learning

Tallahassee, FL1y exp
Florida State UniversityFlorida State University

Built a production Apple-focused LLM Q&A bot that answers user issues using similar past discussion records, including large-scale scraping and cleaning of thousands of forum threads. Used BeautifulSoup + Playwright for static/dynamic extraction, PySpark + NLP for preprocessing, and LangChain RAG with a custom response-likeliness metric to evaluate performance.

View profile
AV

Junior AI Engineer & Full-Stack Developer specializing in AI agents and RAG systems

Hyderabad, India2y exp
MavenwitStevens Institute of Technology

Full-stack TypeScript/React/Next.js builder who created an end-to-end customer-facing product (AI Job Master) that generates personalized outreach from resumes and job descriptions. Demonstrates strong product + engineering ownership with rapid MVP iteration, instrumentation-driven prioritization, and pragmatic reliability patterns (microservices, queues, correlation IDs, retries) while tackling a key AI challenge: user trust and output consistency.

View profile
AR

Mid-level Python Backend Developer specializing in APIs, automation, and data pipelines

New Jersey, USA4y exp
Inspira FinancialMontclair State University

Backend Python engineer with end-to-end ownership of secure financial data systems integrating banking/credit/payment platforms, including automated ingestion and reconciliation of large financial statements. Built modular Dockerized Django REST services with pandas-driven validation/normalization and Postgres/Mongo persistence, and supported a phased migration from legacy VM services to AWS containers with stateless refactors and parallel-run integrity checks (run IDs/checksums). Works closely with platform teams on GitOps/CI readiness and deployment coordination (e.g., ArgoCD-managed sync policies).

View profile
Sai Harsha Kurapati - Mid-level Backend Engineer specializing in distributed systems and industrial IoT in Indianapolis, IN

Mid-level Backend Engineer specializing in distributed systems and industrial IoT

Indianapolis, IN4y exp
Purdue UniversityPurdue University Indianapolis

Backend/Python engineer focused on real-time sensor/IoT analytics: built dashboards and a high-throughput ingestion pipeline (MQTT -> Python worker -> TimescaleDB) with buffering, batch inserts, and validation. Strong Kubernetes + GitOps practitioner (Dockerized microservices, HPA, probes, ArgoCD) who has handled production incidents like CrashLoopBackOff under peak load and supported an on-prem analytics migration to AWS using shadow traffic and rollback plans.

View profile
Aneri Patel - Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval in Washington, D.C.

Aneri Patel

Screened

Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval

Washington, D.C.2y exp
Enquire AI, Inc.George Washington University

Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.

View profile
Anita Bhagashetti - Mid-Level Software Engineer specializing in distributed systems and cloud microservices

Mid-Level Software Engineer specializing in distributed systems and cloud microservices

3y exp
ZeOmegaBinghamton University

Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.

View profile
HC

Mid-level Data Engineer specializing in cloud data platforms and ETL automation

Atlanta, GA4y exp
Blue Diamond TechnologiesUniversity of Texas at Arlington

Data engineer who has owned high-volume production pipelines end-to-end (200–300 GB/day) on AWS, implementing strong data quality/observability and achieving 99.9% reliability while cutting data issues ~33%. Also built a large-scale external data collection system ingesting millions of records/day with anti-bot/rate-limit handling and backfill tooling, and shipped a versioned REST service exposing curated Snowflake data to downstream teams.

View profile
AS

Aditya Sharma

Screened

Intern Machine Learning Engineer specializing in deep learning and LLM systems

Tempe, AZ0y exp
Arizona State UniversityArizona State University

Built and shipped a personal LLM-powered news aggregation platform (Clear Brief) that scrapes ~200 articles per cycle, clusters them into ~15–30 consolidated stories, and supports on-demand deep dives via a Next.js API route. Emphasizes production-minded reliability (token/cost controls, timeouts, graceful frontend degradation) and database-backed orchestration using SQLite with retry + exponential backoff for burst processing.

View profile
Jonathan Lee - Director-level Engineering Leader specializing in agentic AI systems in Austin, TX

Jonathan Lee

Screened

Director-level Engineering Leader specializing in agentic AI systems

Austin, TX19y exp
DK LawUniversity of Texas at Austin

Engineering leader and hands-on architect who built a team from scratch and owned everything from architecture and security to DevOps and deployment. Has led sophisticated AI/agentic systems in legal-tech and operations, including demand-letter automation, news/content generation, and human-in-the-loop fax routing, while also guiding major infrastructure and enterprise telephony transitions.

View profile
VK

Vaibhav Kamat

Screened

Senior Software Engineer specializing in AI/ML systems and edge inference

Santa Clara, CA9y exp
ExpederaArizona State University

Software engineer at Expedera working at the intersection of deep learning compilers and neural processor hardware, focused on making customer models run efficiently across custom HW architectures. Particularly notable for building a zero-to-one multi-chip scheduler in a Python + C++ stack and for translating complex model optimization problems into customer-facing performance gains for hardware deployments, including autonomous driving use cases.

View profile

Need someone specific?

AI Search