Vetted Data Ingestion Professionals

Pre-screened and vetted.

MR

Senior Software Engineer specializing in cloud-native microservices (AWS, Java, Kafka)

Dallas, TX4y exp
AccentureUniversity of Houston

Backend engineer with hands-on experience modernizing high-volume transactional systems by decomposing monoliths into Spring Boot microservices on AWS, using Kafka for async workflows and Redis/SQL tuning for latency. Has built Python/FastAPI services with strong API contracts and production-grade security (OAuth2/JWT, RBAC, row-level security), and proactively hardened payment flows against race conditions and double-charging via idempotency.

View profile
SP

Surya Pavan

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications

Baltimore, MD5y exp
AcerCalifornia State University, Northridge

GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.

View profile
VM

Mid-level Machine Learning & Full-Stack Engineer specializing in GenAI platforms

San Francisco, CA5y exp
WellDhanNortheastern University

LLM/agent builder who has shipped production AI systems in the wellness space, including an LLM-powered food tracking product used by 5000+ users and a voice/call-routing onboarding workflow using LangGraph/LangChain with LiveKit and Twilio. Strong focus on practical reliability work: latency reduction, retrieval/embedding tuning, and CI-driven evaluation with simulations and metrics.

View profile
SK

Mid-level Data Scientist specializing in real-time fraud detection and MLOps

San Francisco, CA5y exp
Charles SchwabCUNY Graduate Center

ML/NLP engineer with experience at Charles Schwab building an NLP + graph (Neo4j) entity-resolution system to unify fragmented user/device/transaction data and improve downstream model quality and analyst querying. Has applied embeddings (SentenceTransformers + FAISS) with domain fine-tuning to boost hard-case matching recall by ~12% while maintaining precision, and has a track record of hardening scalable Python/Spark pipelines and productionizing fraud models via A/B tests and shadow-mode monitoring.

View profile
AI

Intern Software Engineer specializing in AI systems and backend infrastructure

West Lafayette, IN2y exp
Acuvity AIPurdue University

Full-stack engineer with early-stage startup experience who shipped and owned production Next.js (App Router + TypeScript) features end-to-end, including auth-aware APIs, caching, and post-launch monitoring/iteration. Demonstrates strong performance and reliability chops across React UX optimization, Postgres analytics modeling/query tuning (validated via query plans), and durable ingestion workflows with retries/idempotency.

View profile
KR

Mid-Level Backend Engineer specializing in SaaS, FinTech, and AI document intelligence

San Francisco, CA3y exp
IntraEdgeNYU

Full-stack engineer who built an AI-driven document analysis and processing workflow end-to-end, including large-document ingestion, queued async processing, and low-latency retrieval for user-facing flows. Demonstrated practical performance tuning (moving heavy work off request path, polling, caching) and Postgres optimization validated with EXPLAIN ANALYZE, plus durable workflow resilience via retries and dead-letter queues.

View profile
VR

Mid-level Backend/AI Software Developer specializing in data pipelines for FinTech and healthcare

6y exp
TMV InvestmentsWright State University

Data engineer/backend data services builder with end-to-end ownership of production pipelines for a Pfizer client, combining Python/SQL ingestion and transformation with strong data quality controls. Delivered measurable performance gains (~30% faster queries) and improved reliability through monitoring/alerting (Splunk, Prometheus/Grafana), structured logging, and incident response; also built internal REST APIs with versioning and caching and set up GitLab-based CI/CD with containerized deployments.

View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
Ram Usarty - Mid-level Full-Stack Software Engineer specializing in cloud-native distributed systems in USA

Ram Usarty

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native distributed systems

USA4y exp
OnesynergeeUniversity of Cincinnati

Backend/platform-focused engineer who has shipped production LLM agents for messy research dataset submissions, turning manual validation into an automated, reliable ingestion pipeline. Strong on production hardening (streaming large uploads, strict schema/function-calling outputs, idempotency, RBAC) plus eval/monitoring loops that improved data quality, reduced support burden, and increased adoption.

View profile
Srilekha Jakkula - Senior Data Engineer specializing in scalable data pipelines and API-driven data services in Chicago, IL

Senior Data Engineer specializing in scalable data pipelines and API-driven data services

Chicago, IL5y exp
Northern TrustNorthern Illinois University

Data engineer focused on building scalable, reliable end-to-end data pipelines and backend REST data services, spanning API ingestion plus batch/stream processing with Airflow, Kafka, Spark/PySpark, and SQL. Emphasizes strong data quality validation, monitoring/fault tolerance, and performance tuning for large datasets, with experience deploying in cloud environments using containerization and CI/CD.

View profile
AP

Axel Paredes

Screened

Mid-level Business Analyst specializing in operations data and reporting

Miami, FL6y exp
Cole, Scott & Kissane, P.A.Miami Dade College

Candidate has hands-on project experience in healthcare analytics, using SQL, Python, and Power BI to analyze CMS hospital readmissions and HRRP penalty risk in Florida. Their work centers on turning messy CMS flat files into reporting-ready datasets, benchmarking hospitals against national references, and surfacing financial risk through dashboards.

View profile
FE

Franz Engel

Screened

Junior Full-Stack & ML Engineer specializing in research tooling and applied machine learning

San Diego, CA1y exp
University of California, IrvineUC Irvine

Full-stack engineer and ML assistant in UC Irvine’s CS department who deployed a lab project showcase platform and integrated on-demand execution of computational projects using Docker for isolation. Also built and optimized Linux cloud/cluster test automation for research, diagnosing RAM and network sync bottlenecks, and later led development of a Python-based predictive analytics tool for musicians using probabilistic graphical models and flexible data pipelines.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
HE

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp
LexisNexisUniversity of South Florida

AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.

View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp
Capital OneRobert Morris University

AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).

View profile
SK

Mid-level Data Engineer specializing in cloud data platforms and real-time analytics

Saint Louis, MO5y exp
CignaSaint Louis University

Customer-facing data engineering professional who builds and deploys real-time reporting/dashboard solutions, gathering reporting and compliance requirements through direct stakeholder engagement. Experienced with Google Cloud IAM governance, secure integrations (encryption, audit logging), and fast production troubleshooting of ETL/pipeline failures with follow-on monitoring and automated recovery improvements; motivated by hands-on, travel-oriented customer work.

View profile
AG

Archit Gangal

Screened

Senior Full-Stack Developer specializing in cloud-native microservices and AI/ML analytics

7y exp
AllstateColorado State University

Full-stack/backend engineer with deep insurance claims domain experience who built and operated a microservices + ETL platform (Java/Spring Boot + Python + Kafka/Databricks) processing 1M+ daily transactions. Combines production-grade reliability (99.7% uptime, zero-downtime blue/green releases, strong observability) with customer-facing UI delivery (AngularJS/React+TS dashboards and a hackathon-winning research chatbot).

View profile
HK

Humani Korem

Screened

Mid-level Software Engineer specializing in data pipelines and backend APIs

Stamford, CT6y exp
Webster BankUniversity of Central Missouri

Data engineer with Webster Bank experience owning end-to-end pipelines (APIs + databases) processing millions of records/day, improving data quality (25–30% fewer issues) and reliability (~99.9% successful runs). Built resilient external data ingestion/scraping systems (schema-change validation, idempotent backfills, monitoring/alerts) and shipped a FastAPI service exposing curated datasets with versioning and consistently low latency.

View profile
Somil Shah - Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents in San Francisco, CA

Somil Shah

Screened

Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents

San Francisco, CA4y exp
INTERACT Animal LabNortheastern University

AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).

View profile
Saniya Athani - Junior Software Engineer specializing in cloud-native DevOps and GenAI in San Francisco Bay Area, CA

Saniya Athani

Screened

Junior Software Engineer specializing in cloud-native DevOps and GenAI

San Francisco Bay Area, CA2y exp
IpserLabNortheastern University

Cloud-focused engineer with hands-on experience deploying production cloud-native REST APIs on AWS using Pulumi IaC, containerization, and CI/CD, with strong emphasis on secure credential management and operational monitoring via CloudWatch. Also has IoT troubleshooting experience across edge hardware constraints and networking (TLS handshake failures), plus Python-based configurable data-processing tools and customer-facing requirements translation.

View profile
PP

Senior IT Business Systems Analyst specializing in UAT, project delivery, and regulated platforms

Frisco, TX12y exp
AccentureUniversity of Madras

Worked on a P&C insurance integration project at Accenture, using SQL to unify policy, billing, and claims data from APIs and ETL pipelines into clean reporting tables. Demonstrated hands-on experience with data quality validation, window-function-based transformations, and query performance tuning, helping business teams get a single reliable view for faster claims processing and management reporting.

View profile
TN

Junior software developer specializing in data analytics and machine learning

New York, NY4y exp
Stony Brook UniversityStony Brook University

Entry-level software engineer who independently built an AI-powered feedback aggregation and analytics dashboard end-to-end using Cloudflare Workers, D1, and React. Stands out for combining serverless backend design, LLM-based categorization, and thoughtful UI/UX polish, with a practical approach to production debugging and data reliability.

View profile

Need someone specific?

AI Search