Vetted Amazon EMR Professionals

Pre-screened and vetted.

KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
OL

Mid-level Data Engineer specializing in cloud data pipelines and streaming

Charlotte, NC5y exp
Wells FargoUniversity of North Texas

Data engineer with experience at Wells Fargo and Accenture owning end-to-end production pipelines processing hundreds of millions of transactional/risk records daily. Strong focus on data quality and reliability (reconciliation checks, schema drift detection, CloudWatch alerting) plus Spark performance tuning and idempotent backfills using Delta Lake/merge logic across AWS (S3/EMR/Databricks/Redshift) and Azure (ADF/Azure DevOps/Azure Monitor).

View profile
MR

Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics

VA, USA5y exp
UnitedHealth GroupGeorge Mason University

Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).

View profile
Yijun Chen - Senior Full-Stack Software Developer specializing in IoT and cloud systems in Toronto, ON

Yijun Chen

Screened

Senior Full-Stack Software Developer specializing in IoT and cloud systems

Toronto, ON4y exp
PulsenicsUniversity of Toronto

Frontend-focused engineer who built a full movie recommendation system from concept to production, comparing classic collaborative filtering with LLM-based recommendation approaches on AWS. Emphasizes scalable architecture, strict TypeScript data contracts, and high-quality Next.js/React UI patterns (defensive states, scoped state management, performance optimization) with disciplined QA and feature-flagged rollouts.

View profile
NR

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp
Wells FargoSouthern Methodist University

Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.

View profile
Rishitha reddy katamareddy - Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems in USA

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp
OptumUniversity at Buffalo

Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.

View profile
Daniel Jin - Intern Site Reliability Engineer specializing in Kubernetes, AWS, and observability in New York, NY

Daniel Jin

Screened

Intern Site Reliability Engineer specializing in Kubernetes, AWS, and observability

New York, NY1y exp
Woori America BankNYU

Backend/data engineering candidate specializing in Python/Flask services and ML-enabled systems, deploying containerized workloads on AWS ECS/EKS with strong observability (Prometheus/Grafana) and PostgreSQL performance tuning. Built multi-tenant architectures with row- and schema-level isolation and optimized a Kubernetes-based Airflow + Spark nightly ETL pipeline for an e-commerce client, improving performance by 250%+ and reliably beating morning reporting deadlines; also contributed to Apache Airflow (SQLAlchemy/PostgreSQL area).

View profile
Revanth Goli - Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems in Morrisville, NC

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp
Syneos HealthUniversity of Alabama at Birmingham

Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.

View profile
BK

Mid-level Data Engineer specializing in big data pipelines and real-time streaming

Dallas, TX6y exp
Johnson & JohnsonUniversity of North Texas

Data engineer who has owned end-to-end production pipelines processing a few million records/day, using Python/Airflow/SQL/PySpark with Snowflake serving to BI (Power BI). Built resilient external web data collection systems (anti-bot, schema-change detection, backfills) and shipped versioned REST APIs for internal consumers, improving pipeline success rates to 99% through monitoring, retries, and idempotent design.

View profile
TD

Mid-level Cloud Data Engineer specializing in Azure/AWS pipelines and medallion architecture

USA4y exp
UnitedHealth GroupSouthern Illinois University Carbondale

Data engineer focused on reliability and data quality, owning end-to-end pipelines processing ~100k–300k records/day. Implemented robust validation and monitoring that cut reporting issues by ~30%, and built stable external data collection with anti-bot measures, backfills, and schema-change detection while maintaining backward-compatible internal data services.

View profile
Nithyashree Raghunathan - Mid-level Software Engineer in Test specializing in AI and healthcare platforms in Santa Clara, CA

Mid-level Software Engineer in Test specializing in AI and healthcare platforms

Santa Clara, CA5y exp
MetaPenn State Great Valley

QA/data pipeline engineer with hands-on AI product building experience, spanning enterprise AWS migration testing for Belgium postal services and personal multi-agent systems in fintech and recruiting. Stands out for combining rigorous validation and production stability work with modern LLM orchestration, guardrails, and messy-document normalization workflows.

View profile
NJ

Mid-level Data & AI Engineer specializing in healthcare data pipelines and MLOps

FL, USA4y exp
HumanaFlorida State University

Built and deployed a production LLM-powered clinical note summarization system used by care managers to speed review of 5–20 page unstructured medical records. Implemented safety-focused validation (prompt constraints, rule-based and section-level checks, human-in-the-loop) to reduce hallucinations while maintaining low latency and meeting privacy/regulatory constraints, integrating via APIs into existing clinical tools.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp
UnitedHealthcareKent State University

AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.

View profile
HG

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp
Bank of AmericaUniversity of Cincinnati

Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.

View profile
MV

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Seattle, WA8y exp
SafecoFitchburg State University

Data engineer focused on building reliable, production-grade pipelines and external data collection systems on AWS (S3/Lambda/SQS/Glue/EMR) using PySpark/SQL, serving curated datasets to Snowflake/Redshift for finance and fraud teams. Has operated a large-scale crawler ingesting millions of records/day with anti-bot tactics, schema versioning/quarantine, and CloudWatch/Datadog monitoring, and also shipped a versioned REST API with caching and query optimization.

View profile
Pravalika Kasojjala - Mid-level AI/ML Engineer specializing in LLM, RAG/GraphRAG, and fraud analytics in Charlotte, NC

Mid-level AI/ML Engineer specializing in LLM, RAG/GraphRAG, and fraud analytics

Charlotte, NC5y exp
Bank of AmericaUniversity of Wisconsin–Milwaukee

LLM/agent engineer who has deployed a production internal assistant to reduce employee inquiry resolution time while maintaining regulatory compliance. Experienced with RAG, hallucination risk triage, and graph-based orchestration (LangGraph) for enterprise/banking-style workflows, emphasizing schema-validated, citation-backed, tool-constrained agent designs and tight collaboration with non-technical business/compliance stakeholders.

View profile
Nikitha Margadi - Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps in Texas, USA

Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps

Texas, USA5y exp
AT&TCal State Fullerton

Data engineer at AT&T focused on large-scale telecom (5G/IoT) data platforms, owning end-to-end pipelines from Kafka/Azure ingestion through Databricks/Delta Lake transformations to serving analytics and ML. Has operated at very high volumes (~50+ TB/day) and delivered measurable performance gains (25–30% faster processing) plus improved reliability via Airflow monitoring, robust data quality checks, and resilient external data collection patterns (rate limiting, retries, dynamic schemas).

View profile
BT

Bharath TVS

Screened

Senior Data Scientist specializing in NLP, LLMs, and Computer Vision

Westlake, OH7y exp
KeyBank

Applied NLP/ML engineer with experience at KeyBank and Novartis building production document intelligence and entity-resolution systems in finance and healthcare. Has delivered end-to-end pipelines (Airflow + AWS) using transformers (DistilBERT/Sentence-BERT), vector search (FAISS/Milvus/Pinecone), and human-in-the-loop labeling to achieve measurable gains (40%+ faster queries; up to 88% F1 and 93% precision/90% recall in entity linking).

View profile
RL

Ramya Latha

Screened

Senior AI/ML & Data Engineer specializing in Generative AI and RAG systems

Birmingham, AL8y exp
Regions Bank

GenAI/RAG engineer who has deployed a production policy/regulatory search assistant for a financial client using LangChain + Vertex AI, FastAPI, Docker/Kubernetes, and Airflow-orchestrated data pipelines. Demonstrated measurable impact with 50–60% latency reduction and 70% fewer pipeline failures, plus KPI-driven grounding evaluation (90%+ target) and strong cross-functional collaboration with compliance/business teams.

View profile
BR

Mid-level Data Engineer specializing in financial risk, compliance, and real-time streaming

Remote, USA4y exp
LTIMindtreeConcordia University, St. Paul
View profile
Sonica Kalmangi - Senior Data Engineer specializing in cloud data platforms and LLM/RAG solutions in Boston, MA

Senior Data Engineer specializing in cloud data platforms and LLM/RAG solutions

Boston, MA10y exp
ThoughtWorksUniversity of Massachusetts
View profile
Ramya V - Mid-level Full-Stack Java Developer specializing in cloud-native microservices in Toronto, Canada

Mid-level Full-Stack Java Developer specializing in cloud-native microservices

Toronto, Canada8y exp
TD BankLoyalist College
View profile
Yashwitha Linga - Mid-level Data Engineer specializing in cloud-native data pipelines and streaming in New Haven, CT

Mid-level Data Engineer specializing in cloud-native data pipelines and streaming

New Haven, CT3y exp
Dell TechnologiesSacred Heart University
View profile
RS

Mid-level Full-Stack Software Engineer specializing in scalable backend and FinTech systems

San Diego, CA4y exp
CalAmpUniversity of Illinois Chicago
View profile

Need someone specific?

AI Search