Vetted AWS Glue Professionals

Pre-screened and vetted.

NN

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

Green Bay, WI5y exp
StripeNew England College
View profile
TM

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Austin, TX11y exp
Accenture
View profile
DV

Senior Software Engineer specializing in cloud backend systems and LLM-powered agents

Seattle, WA5y exp
AmazonSan José State University

Amazon Fire TV Devices engineer who built and shipped a production LLM-powered lab triage and validation system that grounds recommendations in internal runbooks/known-issue data and pushes evidence-based actions via dashboards and Slack. Emphasizes safety and measurability with structured JSON outputs, replay-based evaluation on historical incidents, and production metrics (e.g., disagreement rate and time-to-first-action), plus cost/latency optimizations like caching, batching, and rule-based fast paths.

View profile
NS

Mid-Level Software Engineer specializing in cloud-native systems, automation, and LLM-enabled robotics

Sunnyvale, CA6y exp
AmazonIndiana University Bloomington

React-focused engineer who built a full-stack analytics/test-metrics dashboard (React frontend + Python backend) and turned common UI pieces (data tables, filter panels, chart wrappers) into a reusable internal component library with docs, examples, and basic tests. Strong on profiling-driven performance optimization (React Profiler, memoization) and on owning ambiguous internal-tool projects end-to-end; now planning to package internal patterns into public open-source components.

View profile
SS

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp
NVIDIAKennesaw State University

Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).

View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.

View profile
LM

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp
eBayTexas Tech University

Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Vismay Patel - Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps in Berkeley, CA

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp
Kaiser PermanenteSan Francisco State University

ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.

View profile
Saiteja Gaddam - Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp
IntuitUniversity at Buffalo

Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.

View profile
HK

Mid-level Full-Stack Software Engineer specializing in cloud and data platforms

Boston, MA5y exp
Northeastern UniversityPenn State University

Full-stack engineer with experience spanning Amazon IMDb and Northeastern’s NeuroJSON portal, combining consumer product work with complex scientific data applications. Built IMDb’s streaming providers feature—described as the company’s most impactful feature of 2023—and has hands-on experience with React/Angular, GraphQL, AWS, Python services, and production monitoring.

View profile
SS

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.

View profile
SV

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp
CVS HealthUniversity of Central Missouri

Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.

View profile
JV

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

San Diego, CA6y exp
IntuitCleveland State University

Data engineer with Intuit experience owning end-to-end, high-volume financial data pipelines (API/S3 ingestion, Airflow orchestration, Spark/PySpark + SQL transforms, Snowflake marts). Strong focus on reliability and data quality—achieved 99.8% SLA and cut discrepancies by 35% using Great Expectations, reconciliation, schema versioning, and automated backfills; also built near real-time Kafka/API data services with CI/CD and observability.

View profile
Amit Sharma - Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems in San Francisco, CA

Amit Sharma

Screened

Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems

San Francisco, CA25y exp
FambotUniversity of Delhi

Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.

View profile
BB

Biplob Bidari

Screened

Senior Data Engineer specializing in FinTech analytics and ML data platforms

USA5y exp
Goldman SachsUniversity of the Cumberlands

ML/AI engineer with Goldman Sachs experience building production fraud detection and RAG-based trading insights systems end-to-end. Stands out for combining real-time ML infrastructure, GenAI retrieval systems, and compliance-aware design, with measurable impact including nearly 25% false-positive reduction and improved analyst productivity.

View profile
KC

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp
OpendoorUSC

Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.

View profile
SG

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance

6y exp
CVS HealthUniversity of New Haven

Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.

View profile
GV

Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot and Angular

Frisco, TX5y exp
CiscoPurdue University

Full-stack engineer with Cisco supply-chain and Wipro internal platform experience, focused on customer-facing UI performance and secure backend services. Built a bulk Excel inventory upload feature (Spring Boot/Apache POI) that cut manual effort ~80%, and delivered high-scale Angular/React dashboards with strong reliability/observability (FastAPI, JWT, Docker, AWS, AppDynamics).

View profile
Vasudha Prerepa - Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

5y exp
BMOTexas Tech University

QA/validation-focused engineer with experience at Meta testing an ML+LLM content classification/summarization system, including production-vs-test behavior gaps. Built automated E2E validation and drift monitoring (PSI, KL divergence, embedding cosine similarity) run daily/multiple times per day and gated via CI. Also implemented Jenkins-orchestrated Selenium/API test suites in Docker at Capgemini and partnered with a business analyst to convert business rules into automated AI-driven validation checks.

View profile
Praveen Nutulapati - Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems in New York, NY

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.

View profile

Need someone specific?

AI Search