Vetted Batch Processing Professionals

Pre-screened and vetted.

SG

Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services

Edison, NJ3y exp
Morgan StanleyPace University

Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.

View profile
Sai Charan Kolla - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS in TX, USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp
BlackRockTexas A&M University-Kingsville

LLM engineer who built a production document intelligence/RAG pipeline to extract structured data from thousands of unstructured PDFs, cutting manual review time by 60%. Experienced with LangChain and Airflow orchestration plus rigorous evaluation (labeled datasets, prompt testing, HITL review, monitoring) to improve accuracy and reduce hallucinations while partnering closely with non-technical operations stakeholders.

View profile
John Hoffman - Senior Data Engineer specializing in Databricks, Spark, and AWS for government healthcare data systems in Windsor Mill, MD

John Hoffman

Screened

Senior Data Engineer specializing in Databricks, Spark, and AWS for government healthcare data systems

Windsor Mill, MD12y exp
GDITUniversity of Virginia

Python/AWS engineer focused on batch-processing and data workflows, including building reusable S3/boto3 utilities with reliability features and IAM-based auth. Has led low-risk legacy modernizations using parity testing plus a month of parallel production runs, and has owned production issues end-to-end (including fixing a client-side Excel macro) while contributing to significant AWS cost reductions (~$10k/month).

View profile
AC

Annie Chang

Screened

Senior Full-Stack/Backend Software Engineer specializing in cloud-native automation and microservices

San Francisco, CA9y exp
Booz Allen HamiltonUC Davis

Backend/data engineer with strong AWS production experience across containers (ECS) and serverless (API Gateway/Lambda/SQS), plus Glue-based ETL to Parquet for Athena/Redshift. Demonstrates hands-on reliability and security depth (Cognito OAuth2/JWT with JWKS rotation, idempotency/DLQs, monitoring) and measurable performance wins (Redis caching + query tuning), along with legacy-to-services modernization using parallel-run parity and feature-flagged cutovers.

View profile
RP

Raj Patel

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native web applications

Canton, Michigan3y exp
DiscoverUniversity of Michigan

Backend engineer with hands-on experience scaling a Python/Flask incident-logging platform processing thousands of daily logs. Strong in performance tuning (PostgreSQL/SQLAlchemy query optimization, partitioning, summary tables) and reliability patterns (Redis caching, Celery background workers, Docker + Jenkins CI/CD), with some multi-tenant data isolation experience via separate DBs/schemas.

View profile
SP

Junior Software Engineer specializing in full-stack and AI/LLM applications

Santa Cruz, CA1y exp
RoboGrade.ioUC Santa Cruz

Founder/builder of an EdTech startup (robograde.io) who personally conducted on-site classroom discovery with teachers and rapidly iterated the product based on real-world feedback. Implemented a Canvas LMS integration and refined it through weeks of in-person testing, and handled a live production grading failure by quickly debugging and deploying a fix, then adding fault-tolerant/backup API design.

View profile
RR

Mid-level Full-Stack Developer specializing in cloud-native microservices and event-driven systems

4y exp
Molina HealthcareUniversity at Buffalo

Software engineer with experience at Molina Healthcare and Target, owning production features end-to-end across backend, data pipelines, and UI. Built an event-driven claims validation system (Python/Java/Spring Boot/Kafka) with strong observability, and shipped embeddings-based semantic product search with evaluation loops (CTR/top-k + human review) and guardrails like keyword-search fallback.

View profile
Ruthvik Bacha - Mid-level Data Engineer specializing in financial data pipelines and reliability in North Carolina, USA

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp
Wells FargoUniversity of South Florida

Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.

View profile
Jen-Hung Chang - Mid-level Software Engineer specializing in cloud infrastructure and distributed systems in Hsinchu, Taiwan

Mid-level Software Engineer specializing in cloud infrastructure and distributed systems

Hsinchu, Taiwan4y exp
TSMCDuke University

Backend/platform engineer who built an AI RAG system on FastAPI/Postgres/AWS with 10+ microservices, vector search optimization (ANN + two-stage re-ranking), and GitOps-driven CI/CD that cut deploy time from hours to minutes. Also deployed Java identity services on Kubernetes at TSMC for 200K+ users using ArgoCD/Azure Pipelines, and built a reliable real-time IoT pipeline (MQTT/Node/MongoDB) with strong consistency controls.

View profile
DM

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp
JPMorgan ChaseSaint Louis University

Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.

View profile
YL

Yunjie Liu

Screened

Junior Software Engineer specializing in bioinformatics and full-stack development

Remote3y exp
Baylor GeneticsCornell University

Built and stabilized production data pipelines in clinical genomics, including integrating a qPCR step into Baylor Genetics' workflow with a focus on reliability, turnaround time, and reducing manual intervention. Also has hands-on LLM production experience, creating a Python/OpenAI-based translation evaluation pipeline that reduced manual review time by 70% and improved scoring consistency.

View profile
KS

Mid-level Software Engineer specializing in FinTech and distributed systems

New York, NY4y exp
PayPalSt. Francis College

Backend engineer with end-to-end ownership experience on a real-time AI-driven payment authorization/orchestration platform at PayPal. They describe strong fintech systems depth across Java/Spring/Kafka microservices, database and latency optimization, and reliability engineering, with concrete impact including 35% fewer processing failures, latency reduced from 420ms to 140ms, 1,200+ weekly manual reviews eliminated, and 40% faster incident response.

View profile
SB

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.

View profile
SS

Junior Software Engineer specializing in ML, distributed systems, and LLM applications

Austin, TX1y exp
ZondaUC San Diego

Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.

View profile
SA

Mid-level Full-Stack .NET Developer specializing in cloud-native microservices

Dallas, TX6y exp
T-MobileSouthern Arkansas University

Full-stack engineer with primary depth in .NET Core and Python who has built and deployed end-to-end AWS applications (Lambda, API Gateway, S3, CloudFront) and supported them in production. Experienced in scaling large, data-driven workloads using queues/background workers, batching, and database tuning, with strong focus on API contracts, observability, and resilience patterns; also has hands-on React/TypeScript and some Spring Boot exposure.

View profile
DK

Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake

Richardson, TX6y exp
PwCUniversity of Central Missouri

Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.

View profile
AP

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp
Bank of AmericaUniversity of North Carolina at Charlotte

Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.

View profile
JH

Jaylon Holt

Screened

Senior Cybersecurity Engineer specializing in cloud and enterprise security tooling

Remote8y exp
DiscoverUniversity of North Carolina at Charlotte

Infrastructure/operations engineer with enterprise-scale observability ownership across Linux plus exposure to Windows/AIX and AWS SaaS. Has led DR exercises and real incidents involving cross–data center traffic failover, with hands-on firewall policy management and automation (Chef/Ansible) for agent deployment and patching; experience includes Bank of America and Discover Financial Services.

View profile
Harshavardhan Reddy - Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics in Albany, NY

Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics

Albany, NY5y exp
Capital OnePace University

ML/AI engineer with Capital One experience building production-grade customer segmentation and fraud detection systems combining NLP (transformers) and anomaly detection. Strong MLOps and orchestration background (PySpark ETL, MLflow, Airflow, Docker/Kubernetes, Azure ML) with real-time monitoring/alerting and performance optimizations like quantization and caching, plus proven ability to deliver business-facing insights through Power BI/Tableau for marketing stakeholders.

View profile
Sankalp Tiwari - Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines in New York, NY

Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines

New York, NY4y exp
Goldman SachsSan José State University

Backend engineer at Goldman Sachs who built LLM-powered reconciliation/reporting services and high-throughput Kafka pipelines (8M+ events/day). Strong in production-grade Python/FastAPI microservices on Kubernetes with GitOps-style CI/CD, plus experience migrating legacy reporting/settlement services onto an internal Kubernetes platform using shadow deployments and gradual cutovers.

View profile
AS

Mid-level Software Engineer specializing in backend systems and AI automation

San Francisco, CA5y exp
For Women’s HealthUC Santa Cruz

Built a production Python microservice around Grafana Loki focused on reliability, with checkpointing, idempotency, replay tooling, tracing, and alerting to prevent data loss and silent lag. Also has hands-on experience hardening brittle Playwright automations against dynamic UIs, auth expiry, rate limits, MFA, and bot-detection constraints, plus turning tribal-knowledge SOPs into explicit state-machine-driven workflows.

View profile
YY

Yinghai Yu

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML pipelines

San Mateo, CA6y exp
Bubbles and BooksGeorgia Tech

Data-engineering-oriented candidate with hands-on experience building an agentic AI product and operational automation workflows. They described automating inventory-to-ERP discrepancy reconciliation with anomaly detection and daily reporting, and also have practical scraping/automation experience dealing with Cloudflare-protected sites using Selenium and Puppeteer.

View profile
RA

Ravali Aleti

Screened

Senior Python Developer specializing in AWS backend APIs and enterprise authentication

Philadelphia, US7y exp
ComcastUniversity of Bridgeport

Backend/data engineer focused on AWS-based Python services and data pipelines: built a Django/DRF user management/auth platform deployed with serverless AWS (Lambda/API Gateway) and event-driven workflows (Step Functions/EventBridge), with CloudFormation + Jenkins for automated delivery and Secrets Manager/Parameter Store for secure config. Also delivered AWS Glue ETL from S3 to RDS with schema evolution controls and incident-driven improvements, and has demonstrated measurable SQL tuning impact (minutes-to-seconds).

View profile

Need someone specific?

AI Search