Vetted Data Quality Professionals

Pre-screened and vetted.

PV

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.

View profile
SP

Mid-level Data Analyst specializing in AI/ML and advanced analytics

USA3y exp
AccentureMurray State University

Accenture data/ML practitioner who deployed a retail churn prediction and BERT-based sentiment analysis system to production, integrating behavioral + feedback data and operationalizing it with ETL automation, orchestration, and CI/CD. Experienced managing 2TB+ multi-source data, monitoring drift in Databricks, and translating results into Power BI dashboards for marketing teams (including K-means customer segmentation).

View profile
VM

Senior DevOps & Release Engineer specializing in CI/CD automation and AWS IaC

Raleigh, NC12y exp
VidmobUniversity of Central Missouri

Infrastructure/DevOps engineer (Vidmob) focused on AWS + containers, owning GitLab CI/CD and Terraform-managed environments. Led a high-impact CI incident by correlating runner queue time, Docker pull latency, and NAT egress; implemented ECR pull-through caching and VPC endpoints to restore performance and then standardized the fix in Terraform for future scale-ups.

View profile
AZ

Alicia Zhang

Screened

Mid-level Sales Engineer & Solution Architect specializing in cloud and data platforms

CA, US6y exp
TP-LinkBentley University

LLM-focused customer-facing technical leader with experience productionizing LLM workflows in financial services (State Street), including guardrails, retrieval tuning, and reliability improvements. Also partners closely with sales and executives—at Payoneer helped drive enterprise-wide adoption for a $10M ARR global account through technical discovery, demos, and pilots.

View profile
NY

Naga Yanala

Screened

Mid-level Data Engineer specializing in cloud data pipelines and analytics platforms

Texas, USA5y exp
Molina HealthcareSoutheast Missouri State University

Data engineer with healthcare and enterprise experience (Molina Healthcare, Dell Technologies) building and operating high-volume batch + streaming pipelines across AWS and Azure. Strong focus on data quality (schema validation, fail-fast checks), reliability (monitoring/alerts, retries), and performance tuning (Spark/partitioning), with measurable runtime reduction and improved downstream trust.

View profile
SK

Mid-level Data Engineer specializing in cloud data pipelines and financial services warehousing

Chicago, IL4y exp
Charles SchwabDePaul University

Data engineer (Charles Schwab) who took ownership of an unstable, ambiguous nightly financial data pipeline and rebuilt it into a reliable, incremental AWS Glue/Airflow/Redshift system feeding Power BI. Created a custom Python data-quality framework with hard-stop gating and schema drift detection, improving integrity (99.9%), cutting runtime (~20%), and reducing incidents/tickets (35% fewer schema-related dashboard incidents; 30% fewer investigations).

View profile
Jaideep bommidi - Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps in Denton, TX

Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps

Denton, TX8y exp
Webster BankUniversity of North Texas

Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.

View profile
Mike Khorrami - Director-level Engineering Leader specializing in enterprise SaaS and cloud-native platforms in Woodland Hills, CA

Mike Khorrami

Screened

Director-level Engineering Leader specializing in enterprise SaaS and cloud-native platforms

Woodland Hills, CA25y exp
BlackLineCalifornia State University, Northridge

Engineering leader/player-coach who modernized a legacy C#/SQL Server system to Snowflake + Python on GCP, enabling ~30x scale and supporting hundreds of millions of transactions per day per customer. Strong in architecture tradeoffs (Snowflake vs Databricks), production reliability (New Relic, logging/alerting), and lightweight process improvements like a rigorous Definition of Done and structured PR reviews.

View profile
UMESH KAMISETTY - Mid-level Data Engineer specializing in cloud lakehouse and streaming platforms in Seattle, WA

Mid-level Data Engineer specializing in cloud lakehouse and streaming platforms

Seattle, WA5y exp
First United BankCleveland State University

Data engineer focused on building production-grade pipelines on AWS (Kafka/Kinesis/Glue/S3) through to curated serving layers in Snowflake and Delta Lake. Emphasizes automated data quality validation (PySpark + CI/CD), modular dbt transformations for analytics (customer spending, risk metrics), and operational reliability with CloudWatch and DLQs; data consumed by BI tools and ML pipelines for fraud detection and risk analytics.

View profile
Mohan Naik Megavath - Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms in Remote, USA

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp
TruistElmhurst University

Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.

View profile
KP

Mid-level Data Engineer specializing in capital markets post-trade data platforms

Whippany, NJ3y exp
BarclaysUniversity of Connecticut

Data/streaming engineer in capital markets who led an end-to-end trade settlement data product (Kafka→MongoDB→data lake) with rigorous data-quality logic and ~$175K first-year operational impact. Also built a low-latency Go-based CME market data engine feeding SOFR curve generation, using MSK on EKS with performance tuning (idempotency, compression, partitioning) to achieve sub-100ms delivery.

View profile
SS

Senior Data Analyst specializing in healthcare and financial analytics

Columbus, OH5y exp
NationwideWichita State University

Healthcare analytics candidate with hands-on experience turning messy claims data in Redshift and S3 into validated reporting tables, plus automating KPI workflows in Python. They’ve owned end-to-end operational analytics projects, including a claims delay analysis that improved processing efficiency by about 20%, and have experience driving stakeholder adoption of standardized metrics across dashboards.

View profile
MOUNIKA SAI MEKALA - Junior Data Analyst specializing in financial and operational analytics in Kansas, USA

Junior Data Analyst specializing in financial and operational analytics

Kansas, USA3y exp
KPMGUniversity of Central Missouri

Analytics professional with experience at KPMG turning messy operational and financial data from SQL Server and AWS S3 into clean reporting datasets and automated Python workflows. They combine SQL, Python, Power BI, and experimentation methods to deliver stakeholder-aligned KPI dashboards and marketing performance insights with a strong focus on data integrity and reproducibility.

View profile
MA

Junior Business & Data Analyst specializing in analytics and AI-driven insights

Seattle, WA2y exp
CarnelianUniversity of Washington

Master’s in Business Analytics candidate with hands-on project experience spanning FMCG sales analytics, insurance risk modeling, and HR attrition analysis. Demonstrates strong SQL and Python fundamentals, including advanced CTE/window-function work, reproducible modeling workflows, and Power BI dashboards that translate analysis into clear business actions.

View profile
SS

Intern AI/ML Engineer specializing in full-stack and data systems

Boston, MA1y exp
ChewyUniversity of Massachusetts Amherst

Built an LLM-powered customer segmentation agent during a Chewy internship, consolidating Snowflake data into a knowledge graph so non-technical marketing users could query customer cohorts in natural language. Stands out for combining agent/tooling design with rigorous data engineering practices, including schema audits, imputation, validation layers, and idempotent pipelines on messy large-scale datasets.

View profile
VM

Varun Mallela

Screened

Mid-level Data Analyst specializing in financial and healthcare analytics

Richardson, TX3y exp
Franklin TempletonUniversity of Colorado Boulder

Analytics professional with experience at Franklin Templeton and IQVIA India, focused on turning messy cross-system data into trusted reporting and actionable business insights. Stands out for combining SQL, Python, AWS ETL, and BI dashboards to solve data quality issues, improve investor engagement analysis, and standardize commercial reporting in financial services and pharma contexts.

View profile
SAITEJA MALLEMPUDI - Senior Data Scientist and AI/ML Engineer specializing in GenAI and cloud ML in Chicago, IL

Senior Data Scientist and AI/ML Engineer specializing in GenAI and cloud ML

Chicago, IL6y exp
BMOLewis University

ML/AI engineer with hands-on experience owning systems from experimentation through deployment and monitoring, including a Bank of Montreal project that improved timely interventions by 12%. Also brings GenAI/RAG experience with evaluation and safety guardrails, plus clinical NLP pipeline work extracting medication data from notes for patient risk prediction.

View profile
GY

George Yu

Screened

Senior Full-Stack & Mobile Engineer specializing in Node.js and React

San Francisco, CA12y exp
TaskRabbitUniversidad Tecnológica de Panamá

Backend engineer with TaskRabbit experience building and operating payment/booking services in Python/Django on AWS (ECS + Lambda) with Kafka/SQS eventing. Demonstrates strong production reliability and incident ownership in high-stakes payment flows (idempotency, strict timeouts, retries, monitoring/alerting) plus data/ETL work in AWS Glue and measurable SQL performance wins.

View profile
AE

Ashwitha E

Screened

Junior Data Scientist specializing in fraud analytics and cloud data platforms

Dallas, TX3y exp
Bank of AmericaUniversity of North Texas

Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.

View profile
KK

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
OL

Mid-level Data Engineer specializing in cloud data pipelines and streaming

Charlotte, NC5y exp
Wells FargoUniversity of North Texas

Data engineer with experience at Wells Fargo and Accenture owning end-to-end production pipelines processing hundreds of millions of transactional/risk records daily. Strong focus on data quality and reliability (reconciliation checks, schema drift detection, CloudWatch alerting) plus Spark performance tuning and idempotent backfills using Delta Lake/merge logic across AWS (S3/EMR/Databricks/Redshift) and Azure (ADF/Azure DevOps/Azure Monitor).

View profile
MR

Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics

VA, USA5y exp
UnitedHealth GroupGeorge Mason University

Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).

View profile
SG

sumanth gunda

Screened

Mid-level Backend Software Engineer specializing in cloud data services

4y exp
Cardinal HealthArizona State University

Data engineer/backend engineer with experience in healthcare (Cardinal Health provider enrollment) and finance (Northern Trust) building and stabilizing data pipelines and REST services. Worked with APIs and Kafka at ~200k–300k records/day, improving data quality (DLQ + validation), performance (SQL/indexing), and reliability/observability (logging, alerts, consumer lag metrics), and stood up an early-stage financial data service with Jenkins-based CI/CD.

View profile

Need someone specific?

AI Search