Vetted Data Validation Professionals

Pre-screened and vetted.

HV

Hariom Vyas

Screened

Senior Business Analyst specializing in BFSI reporting and BI

Dallas, TX4y exp
Goldman SachsUniversity of Maryland, Baltimore County

Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.

View profile
KW

Entry Data Scientist specializing in healthcare analytics and automation

Kansas City, KS1y exp
University of Kansas Medical CenterUniversity of Michigan

QA engineer with B2B SaaS startup exposure across healthcare and data center clients, including serving as the sole QA on one engagement. Stands out for quickly ramping from no OOP background into automated testing in C# and Java, while also handling client communication, release regression reporting, and prospective client research.

View profile
AK

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and production ML systems

5y exp
ComcastUniversity of Central Missouri

Backend/founding-engineer-style builder who designed and evolved a near-real-time customer churn prediction platform (FastAPI + AWS SageMaker/Lambda + Redis + MLflow) to enable real-time retention actions, reporting ~18% churn reduction. Demonstrates strong production engineering in secure API design, incremental migrations with data integrity safeguards, and robustness improvements in async pipelines (idempotency, DLQs, retry visibility).

View profile
SD

Sai Dev

Screened

Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP

Newark, CA4y exp
Lucid MotorsCleveland State University

GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.

View profile
AA

Mid-Level Full-Stack Python Engineer specializing in cloud APIs and data/ML platforms

Bentonville, AR4y exp
WalmartUniversity of Central Missouri

Backend engineer at Goldman Sachs who deployed internal LLM-powered utilities to summarize operational logs/tickets, with a strong emphasis on data sensitivity and reliability. Built deterministic workflows with template-based prompts, confidence checks, and rule-based fallbacks, and used monitoring plus failure-rate metrics to tune performance; also has hands-on Temporal orchestration experience for resilient async backend jobs.

View profile
HK

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.

View profile
SM

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).

View profile
YL

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp
CitigroupUniversity of Missouri

Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.

View profile
NM

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp
SonyFitchburg State University

Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Ruthvik Bacha - Mid-level Data Engineer specializing in financial data pipelines and reliability in North Carolina, USA

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp
Wells FargoUniversity of South Florida

Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
Akshaya Chiduruppa - Mid-level Quality Assurance Engineer specializing in AI/ML and Apple ecosystem testing in Seattle, WA

Mid-level Quality Assurance Engineer specializing in AI/ML and Apple ecosystem testing

Seattle, WA4y exp
AppleMissouri University of Science and Technology

QA automation engineer with end-to-end ownership of a regression suite for a warehouse loan management platform (.NET/Angular), using Selenium (Java/Cucumber/POM) and Cypress. Improved suite stability and expanded risk-based coverage (DB/API/SQL, RBAC approval workflows), catching critical financial defects like EMI calculation errors and cutting regression effort by ~50% while gating releases via GitLab CI/CD with actionable Slack reporting.

View profile
Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
Ishaan Nanal - Intern-level Software Engineer specializing in backend systems and AI/ML in Ithaca, NY

Ishaan Nanal

Screened

Intern-level Software Engineer specializing in backend systems and AI/ML

Ithaca, NY1y exp
QuorAgraCornell University

Built and shipped an LLM-powered RAG research copilot used by 20+ users across biology, physics, and ML, cutting literature review from days to minutes. Strong focus on production reliability—iterated on chunking/retrieval/prompting, added validation and modular pipelines for debuggability, and is now containerizing and scaling the system with Docker and GCP.

View profile
DM

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp
JPMorgan ChaseSaint Louis University

Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.

View profile
AR

Mid-level Business Analyst specializing in BI, reporting, and data insights

5y exp
Coca-ColaUniversity of Massachusetts Boston

Healthcare analytics professional with experience at UnitedHealth Group, focused on turning messy claims, eligibility, and provider data into clean reporting datasets and Power BI dashboards. Combines SQL and Python automation with strong stakeholder alignment around KPI definitions, helping operations teams improve claim turnaround visibility and cost efficiency.

View profile
SB

Mid-level Data Analyst specializing in financial and telecom analytics

Remote, USA5y exp
AT&TLewis University

Analytics candidate with hands-on experience at AT&T building SQL/Python pipelines for churn, usage, billing, and network-performance data at multi-million-row scale. Stands out for combining strong data quality and reconciliation practices with measurable operational impact, including a 30% query runtime improvement and ~8 hours/week of reporting automation savings.

View profile
Harrishkumar Loganathan - Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI in Remote, USA

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp
SocureArizona State University

AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.

View profile
YL

Yunjie Liu

Screened

Junior Software Engineer specializing in bioinformatics and full-stack development

Remote3y exp
Baylor GeneticsCornell University

Built and stabilized production data pipelines in clinical genomics, including integrating a qPCR step into Baylor Genetics' workflow with a focus on reliability, turnaround time, and reducing manual intervention. Also has hands-on LLM production experience, creating a Python/OpenAI-based translation evaluation pipeline that reduced manual review time by 70% and improved scoring consistency.

View profile
PD

Mid-level Full-Stack Engineer specializing in enterprise SaaS and optimization platforms

Redwood City, CA5y exp
C3 AINortheastern University

Full-stack engineer with strong enterprise delivery experience across manufacturing and semiconductor use cases, owning deployments from discovery through post-launch support. Stands out for combining traditional product engineering with applied GenAI workflows and data pipeline reliability work, including a manufacturing app that reportedly saved a Fortune 500 customer about $6M and an AI chat panel adopted by 70% of pricing analysts.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
QL

qichen liu

Screened

Entry-level engineer specializing in manufacturing, embedded systems, and computer vision

Atlanta, GA0y exp
Runergy Solar ManufacturingGeorgia Tech

Operations-to-engineering builder who turned a real dispatcher pain point into a live fleet-tracking and routing prototype using React, FastAPI, WebSockets, and mapping/routing tools. Also building AI workflow-automation products for non-technical office users, combining deterministic validation, sandbox replay, and LLM-assisted workflow splitting. Especially interesting for teams that value scrappy zero-to-one product ownership and practical automation.

View profile
PS

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp
University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.

View profile

Need someone specific?

AI Search