Vetted Data Engineering Professionals

Pre-screened and vetted.

RN

Mid-Level Software Engineer specializing in Python backend, data engineering, and cloud microservices

New Jersey, USA6y exp
Abacus InsightsNJIT

Backend-leaning full-stack engineer with production experience in both healthcare (claims enrichment/interoperability at Abacus) and finance (Goldman Sachs pricing/risk APIs + React dashboards). Built an event-driven AI grading platform using Postgres Debezium CDC + Kafka + FastAPI on AWS that cut manual grading ~70% and served 1000+ students, with strong emphasis on reliability, testing, and performance tuning.

View profile
JS

Joshua Spatz

Screened

Executive Technology Leader (CTO) specializing in IoT, enterprise systems, and digital transformation

Miami, FL18y exp
Aroma360University of Florida

Founder of an LLC operating as a consulting firm providing fractional CTO services to startups, giving them parallel exposure to multiple early-stage companies. Has direct experience with MVP development, building org structures from scratch, and supporting early fundraising, and is exploring a pivot from consulting into a scalable product business while staying engaged with the VC/accelerator ecosystem.

View profile
MG

Senior Data Engineer specializing in cloud data platforms and real-time streaming

6y exp
HCA HealthcareWright State University

Data engineer in healthcare (HCA) who owned end-to-end Azure-based pipelines at very large scale (50M+ daily claims/patient records). Strong focus on reliability: schema-drift fail-fast validation, quarantine layers, and Python/SQL data quality checks that reduced issues ~25%, plus performance tuning in Databricks/PySpark and versioned serving in Synapse for downstream consumers.

View profile
Sagar Patel - Mid-level Full-Stack Python Developer & Data Engineer specializing in ETL and web platforms in Arizona, United States

Sagar Patel

Screened

Mid-level Full-Stack Python Developer & Data Engineer specializing in ETL and web platforms

Arizona, United States6y exp
GoDaddyCampbellsville University

Backend engineer who led major modernization efforts at GoDaddy, migrating legacy Perl services to Python/FastAPI with an incremental rollout strategy, containerization (Docker/Kubernetes), and CI/CD (Jenkins/GitHub Actions). Strong focus on secure, reliable API design (JWT, RBAC, PostgreSQL row-level security), rigorous testing, and data integrity—plus experience hardening an automated web-scraping pipeline against changing site structures and downtime.

View profile
Hsi-Chun Wang - Mid-level Data Scientist specializing in LLM development and scalable ML pipelines in Remote

Hsi-Chun Wang

Screened

Mid-level Data Scientist specializing in LLM development and scalable ML pipelines

Remote4y exp
GearFactory.aiUniversity of Maryland, College Park

Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.

View profile
KD

Mid-level Business Analyst specializing in banking analytics and data engineering

Hollywood, FL4y exp
SantanderIndiana University Bloomington

Analytics professional at Santander Bank with hands-on experience building SQL and Python workflows for transaction reporting, reconciliation, and monitoring across messy multi-source financial data. They combine strong data validation and exception-handling practices with stakeholder-friendly dashboards, and also bring digital analytics experience from a Google Analytics UI optimization project focused on funnel drop-off and engagement.

View profile
SK

Mid-level Data Analyst and Data Engineer specializing in healthcare and financial analytics

3y exp
UnitedHealth GroupUniversity of North Texas

Analytics professional with healthcare and operations experience who turns messy enterprise data from platforms like Teradata, GCP, SQL Server, and Snowflake into trusted reporting layers and reproducible analysis workflows. They combine SQL, Python, PySpark, Power BI, and Tableau to improve reporting accuracy and performance, including a 30% dashboard refresh improvement and 20-25% accuracy gains in healthcare reporting.

View profile
RT

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLMs

New York City, NY3y exp
WayfairStevens Institute of Technology

Wayfair ML/AI engineer who has shipped and operated production LLM systems for both internal analytics and customer-facing assistants. Stands out for combining strong RAG/retrieval engineering with production-grade platform work—improving trust, reducing latency by ~30%, and cutting ad hoc reporting demand by ~50%.

View profile
VT

Mid-level AI/ML Engineer specializing in Generative AI and agentic systems

4y exp
WalmartUniversity of Central Missouri

Backend/platform engineer who has owned a Python/FastAPI results API and deployed it on Kubernetes with Helm and GitHub Actions-driven CI/CD. Demonstrates strong production operations mindset across performance tuning, monitoring, safe rollouts/rollbacks, and phased migrations, plus hands-on Kafka streaming experience focused on ordering and idempotency.

View profile
Vineet Jujjavarapu - Mid-level Software Engineer specializing in cloud-native data platforms in College Park, MD

Mid-level Software Engineer specializing in cloud-native data platforms

College Park, MD3y exp
University of Maryland, College ParkUniversity of Maryland, College Park

Software engineer with hands-on experience using AI coding assistants and LangChain-based agent workflows in RAG/LLM projects. Stands out for combining practical multi-agent experimentation with strong grounding in system design, distributed systems, and production-minded validation of AI-generated outputs.

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile
AS

Aditya Sairam

Screened

Mid-Level Software Engineer specializing in cloud data platforms and AI search

Troy, MI6y exp
Robotics Technologies LLCCleveland State University

Open-source JavaScript contributor focused on data visualization, extending Chart.js/React with custom plugins for real-time streaming dashboards. Designed an end-to-end telemetry pipeline using Apache Kafka and Azure Cosmos DB, optimizing partitioning, batching, caching, and client throttling to keep latency low and support thousands of concurrent users. Demonstrates strong ownership in fast-changing environments, including building full-stack AI applications and ingestion/ETL pipelines at Robotics Technologies LLC.

View profile
KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
MR

Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines

Overland Park, Kansas5y exp
MinutentagWilmington University

Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.

View profile
Prachika Agarwal - Senior Solutions Architect and Data Analyst specializing in cloud data platforms and experimentation in New York, NY

Senior Solutions Architect and Data Analyst specializing in cloud data platforms and experimentation

New York, NY4y exp
Ovative GroupNYU

Software engineer who built and scaled an internal automation/auditing tool for analyzing Google and Adobe tagging containers, adopted by 13 internal clients and saving ~15 hours per audit. Has experience shipping containerized, Kubernetes-orchestrated systems and integrating OpenAI APIs into an agentic chatbot feature (plus prior NLP chatbot work during a Cyber Peace Foundation internship).

View profile
Sai Chatrathi - Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps in NY, USA

Sai Chatrathi

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

NY, USA4y exp
HumanaSyracuse University

Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.

View profile
Young Joon Suh - Senior Research Scientist specializing in AI for autonomous driving and semiconductors in Seoul, Korea

Senior Research Scientist specializing in AI for autonomous driving and semiconductors

Seoul, Korea5y exp
Korea Institute of Science and TechnologySan José State University

Robotics perception engineer focused on autonomous driving 3D detection, integrating PETR embeddings into BEVFormer and tackling hard orientation/temporal alignment issues in multi-camera BEV pipelines. Uses Gazebo with custom sensor plugins to validate calibration, timing, and transforms, and blends synthetic labels with real imagery for scalable 3D box generation.

View profile
Sri Harsha patallapalli - Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS in Boston, MA

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp
Dextr.aiNortheastern University

Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.

View profile
Divyansh Agarwal - Junior Machine Learning Engineer specializing in computer vision and LLM applications in New York, NY

Junior Machine Learning Engineer specializing in computer vision and LLM applications

New York, NY3y exp
AdeptmindNYU

Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).

View profile
KP

Mid-level Data Engineer specializing in capital markets post-trade data platforms

Whippany, NJ3y exp
BarclaysUniversity of Connecticut

Data/streaming engineer in capital markets who led an end-to-end trade settlement data product (Kafka→MongoDB→data lake) with rigorous data-quality logic and ~$175K first-year operational impact. Also built a low-latency Go-based CME market data engine feeding SOFR curve generation, using MSK on EKS with performance tuning (idempotency, compression, partitioning) to achieve sub-100ms delivery.

View profile
AB

Junior Software Engineer specializing in full-stack, data engineering, and mobile apps

Seattle, WA3y exp
AmazonArizona State University

Built production LLM agents at Hivenue and Amazon, spanning consumer booking automation and internal data-query/reporting workflows. Stands out for combining conversational UX with strong reliability engineering—strict tool use, state machines, schema validation, idempotency, and evaluation pipelines—and can point to measurable impact including a 21% reduction in time to book and a 12% conversion lift.

View profile
ST

Senior Software Engineer specializing in backend systems and data platforms

Texas, USA5y exp
WalmartNew England College

Software developer who uses AI pragmatically across the full stack to accelerate coding, testing, debugging, and documentation while maintaining strong human oversight. Stands out for treating AI output like any other code source—reviewing for architecture fit, security risks, performance, and standards before integration—and for coordinating multiple AI tools across backend, frontend, and test workflows.

View profile
RG

Senior Full-Stack Developer specializing in Python, cloud microservices, and AI/ML

Oviedo, Florida11y exp
FocustAppsSt. Francis University

Backend/data engineer with hands-on production experience across GCP and AWS: built FastAPI microservices on Cloud Run and delivered AWS Lambda + ECS Fargate systems with Terraform/GitHub Actions. Strong in data engineering (Glue/Spark, S3/Redshift) and modernization (SAS to Python/SQL), with proven reliability and incident ownership—including cutting a 20+ minute reporting query to under 2 minutes.

View profile
SG

sumanth gunda

Screened

Mid-level Backend Software Engineer specializing in cloud data services

4y exp
Cardinal HealthArizona State University

Data engineer/backend engineer with experience in healthcare (Cardinal Health provider enrollment) and finance (Northern Trust) building and stabilizing data pipelines and REST services. Worked with APIs and Kafka at ~200k–300k records/day, improving data quality (DLQ + validation), performance (SQL/indexing), and reliability/observability (logging, alerts, consumer lag metrics), and stood up an early-stage financial data service with Jenkins-based CI/CD.

View profile

Need someone specific?

AI Search