Vetted Web Scraping Professionals

Pre-screened and vetted.

EX

Junior Backend Engineer specializing in cloud-native systems and observability

Sunnyvale, CA2y exp
WalmartNYU
View profile
GM

Senior RPA & Automation Architect specializing in Agentic AI and enterprise hyperautomation

São Paulo, Brazil8y exp
Boston ScientificUniversity of São Paulo
View profile
SW

Junior Full-Stack/Cloud Engineer specializing in AI and data-driven applications

Los Angeles, CA1y exp
Zage Business of Energy InitiativeUSC
View profile
RG

Mid-level Backend/Data Engineer specializing in legal data pipelines and APIs

5y exp
WalmartUniversity of Texas at Arlington
View profile
SD

Sanya Dod

Screened

Junior Software Engineer specializing in AI/ML and verification

West Lafayette, IN2y exp
WISE Lab, Purdue UniversityPurdue University

Embedded/real-time robotics-style engineer with hands-on STM32 development, sensor integration, and low-level drivers, focused on deterministic control behavior. Demonstrated systematic debugging of jitter/latency by instrumenting the sensing-to-actuation pipeline and eliminating blocking via interrupts, hardware timers, and DMA; also designs asynchronous, message-based interfaces for distributed real-time components. Familiar with ROS/ROS2 concepts (nodes/topics/callbacks) though not yet deployed a full production ROS system.

View profile
SK

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp
Baylor Scott & WhiteUniversity of North Texas

Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).

View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.

View profile
AP

Mid-level Data Engineer specializing in cloud data pipelines and enterprise data platforms

4y exp
ConnectiveRxUniversity of Pennsylvania

Data engineer/backend engineer who owns large-scale, real-time event pipelines on AWS end-to-end, including a petabyte-scale CDC ingestion flow from multiple Postgres DBs into Redshift. Re-architected a legacy DynamoDB+S3 approach into a Delta Lake + DuckDB/PyArrow-compatible design, improving performance dramatically (e.g., ~600s to ~10s for 1k records) and increasing reliability at high file volumes.

View profile
RA

Junior Software Engineer specializing in distributed systems and cloud-native backend services

Boston, MA2y exp
BoroughUniversity of Michigan

Founding engineer at a civic-tech startup (Barrow) who built and operated a Next.js/TypeScript product with map-based public reporting, including clustering and dynamic geospatial loading to improve UX and performance. Also implemented a location-aware RAG chatbot using Pinecone, web scraping/transcription, caching, and fallback web search, and owned post-launch observability plus scaling decisions (load balancing/horizontal scaling) based on API usage patterns.

View profile
SG

Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services

Edison, NJ3y exp
Morgan StanleyPace University

Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.

View profile
AM

Senior Software Engineer specializing in backend microservices and distributed systems

United States7y exp
WalmartCleveland State University

Senior software engineer (5+ years) from Walmart Global Tech who owned and operated high-scale supplier inventory submission systems, including a microservice handling submissions up to 500k items and a data platform processing ~10TB/day. Strong in AWS/Kubernetes (EKS), Kafka/Spark streaming + batch pipelines, and production operations (on-call, metrics/alerting), with demonstrated performance wins (30% faster responses, 50% faster processing).

View profile
Sragvi Vadali - Junior Software Engineer specializing in AI/ML and real-time systems

Sragvi Vadali

Screened

Junior Software Engineer specializing in AI/ML and real-time systems

2y exp
University of Southern CaliforniaUSC

Backend/AI engineer who built a real-time vector database system for high-frequency financial data using Kafka/Flink on Kubernetes, achieving sub-100ms similarity search at 10k+ concurrent load and resolving tricky duplication issues with idempotency/versioning. Also shipped an end-to-end LLM-based travel itinerary feature (profiling + prompt workflows + APIs) with a focus on quality consistency and low latency.

View profile
TS

Mid-Level Full-Stack .NET Developer specializing in cloud microservices and data pipelines

6y exp
Elevance HealthMissouri State University

Backend/data engineer with experience at Citi and Elevance Health, building end-to-end pipelines and data services in regulated, high-volume environments. They combine Python, SQL, .NET, Azure Functions, and strong observability/reliability patterns to improve processing speed, reduce manual effort, and maintain high uptime across financial and healthcare data platforms.

View profile
Pahuldeep Singh - Senior Full-Stack Developer specializing in scalable web platforms and automation in Remote

Senior Full-Stack Developer specializing in scalable web platforms and automation

Remote6y exp
CalianGeorgia Tech

Backend/full-stack engineer focused on TypeScript/Node.js systems, with hands-on ownership of a real-time telemetry and dashboard platform built on Kafka, Debezium, PostgreSQL, and GraphQL. Stands out for combining event-driven architecture, correctness/idempotency patterns, strong observability, multi-tenant security, and developer-friendly API design in production environments.

View profile
JL

Director-level Full-Stack Engineer specializing in web platforms and APIs

New York, NY26y exp
8 Bit Bricks LLCNJIT

Built Bargain Bricks end-to-end as a solo creator, handling product ideation, design, backend, APIs, website, and native iOS/Android apps. They actively maintain and iterate on the product, which has over 1,000 downloads, and have improved conversion through UI changes that surfaced the best deal above the fold.

View profile
Anirban Ghosh - Mid-level Machine Learning Engineer specializing in data science and cloud systems in Seattle, WA

Anirban Ghosh

Screened

Mid-level Machine Learning Engineer specializing in data science and cloud systems

Seattle, WA4y exp
AmazonStony Brook University

ML engineer who independently pitched and built a recommendation engine at Danske Bank in a legacy fintech environment, creating compliant data pipelines and deployment infrastructure from scratch and delivering a 62% engagement lift with 70%+ advisor adoption. Also worked at AWS on classification and GenAI-powered reporting systems, with strengths spanning production ML, platform setup, monitoring, and research-to-production optimization.

View profile
SS

Sutej Singh

Screened

Entry-Level Software Engineer specializing in ATM platforms and backend modernization

San Francisco Bay Area, CA1y exp
Wells FargoUSC

Software engineer with hands-on embedded/robotics coursework experience (Arduino sensor integration and input handling built from scratch without external libraries) and strong DevOps/engineering productivity impact at work, including leading a CI/CD enhancement that runs only impacted tests to catch issues before PR approval.

View profile
UK

Mid-level Generative AI Engineer specializing in LLM agents and RAG systems

4y exp
Capital OneLindsey Wilson College

Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.

View profile
EH

Ebtesam Haque

Screened

Mid-level AI Researcher specializing in LLMs, developer tools, and human-centered AI

McLean, VA4y exp
George Mason UniversityGeorge Mason University

Research-focused AI engineer who built an agentic pipeline to automatically extract Sphinx-based API documentation/changelogs and generate synthetic tasks for a dynamic LLM code benchmark targeting real-world API evolution and deprecations. Experienced with multi-agent orchestration (AutoGen, LangChain, CrewAI) and rigorous evaluation methods, and has prior multi-agent work from a Microsoft Research internship.

View profile
YL

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp
CitigroupUniversity of Missouri

Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
Yash Priyadarshi - Junior Software Engineer specializing in cloud infrastructure and distributed systems in Bengaluru, India

Junior Software Engineer specializing in cloud infrastructure and distributed systems

Bengaluru, India2y exp
EricssonPenn State University

Backend/distributed-systems engineer who built a Golang distributed key-value store on AWS using Multi-Paxos, WAL, and non-blocking gRPC replication (cutting write latency ~40%) and proactively addressed tricky failure modes like leader-election livelock. Also developed a Python/Kubernetes cost-optimization scaling engine deployed with Helm/Terraform, delivering ~$40K annual savings while sustaining 99.99% uptime, and drives contract-first API development (OpenAPI/Swagger) to speed frontend integration.

View profile

Need someone specific?

AI Search