Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Validation Professionals

Pre-screened and vetted.

Data Validation Python SQL AWS CI/CD Git

Mohammad Kashif

Screened

Junior Data Engineer / Analyst specializing in AI/ML data infrastructure

Houston, Texas1y exp

CallAgent AIUniversity of Texas at Austin

“Built and deployed a compliance-sensitive LLM pipeline that extracts rebate logic from hospital–supplier medical contracts, using multi-layer redaction (regex/NER/dictionary), schema-validated structured outputs, and secure placeholder reinsertion. Hosted models on Amazon Bedrock to avoid retraining on sensitive data and improved both accuracy and cost by splitting the workflow into a lightweight section classifier plus a fine-tuned extraction model, orchestrated with LangChain and evaluated via layered, test-driven agent assessments.”

Agentic AI AWS BigQuery Compliance Data Modeling Data Pipelines+175

View profile

Ramiz Qudsi

Screened

Principal Data Scientist & Software Engineer specializing in space mission data systems

Boston, MA13y exp

Boston UniversityUniversity of Delaware

“Space/heliophysics ML engineer who built a PyTorch GRU model to propagate solar wind from L1 to the magnetopause with probabilistic outputs for uncertainty quantification, achieving ~25% better CRPS than standard approaches. Also developed production-grade Python ETL and an open-source telemetry processing package for a mission (LEXI), using Docker and GitHub Actions CI/CD and iterating with scientist/engineer stakeholders.”

Python MATLAB Bash SQL PyTorch Scikit-learn+75

View profile

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp

Dreamline AICalifornia State University, Fullerton

“At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.”

A/B Testing Amazon Athena Amazon CloudWatch Amazon Kinesis Amazon Redshift Amazon S3+114

View profile

Pooja Murigappa

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp

Charles SchwabUniversity of Central Missouri

“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”

Amazon DynamoDB Apache Airflow Apache Kafka Apache Spark AWS AWS Glue+183

View profile

Supriya Mattapelly

Screened

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp

UnitedHealthcareKent State University

“AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.”

A/B Testing Amazon Athena Amazon CloudWatch Amazon EC2 Amazon EMR Amazon Redshift+152

View profile

NAGA SUDHEESTNA PENUMARTHI

Screened

Junior QA Automation Engineer specializing in banking and trading platforms

Bengaluru, India2y exp

BarclaysUniversity of Texas at Dallas

“QA automation engineer with Barclays digital banking experience who owned an end-to-end regression suite across UI, API, and database layers (Selenium/TestNG, REST Assured, SQL) and integrated it into CI/CD (Jenkins/GitLab). Known for preventing high-impact financial defects like duplicate transaction postings by adding backend SQL validations, negative/edge-case coverage, and converting production issues into automated regression tests; also strong in Cypress flake reduction using cy.intercept/cy.session and stable selectors.”

API Testing Agile Anomaly Detection Azure DevOps CI/CD Data Analytics+107

View profile

HarshaSree gudapati

Screened

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp

Bank of AmericaUniversity of Cincinnati

“Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.”

Azure Data Factory Azure Synapse Analytics AWS Amazon S3 AWS Glue Amazon Redshift+125

View profile

Prateek Pravanjan

Screened

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp

MercorStevens Institute of Technology

“LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.”

PyTorch Transformers NumPy Scikit-learn Model evaluation Pandas+135

View profile

SUMIT MAMTANI

Screened

Mid-level Data Scientist specializing in ML, MLOps, and customer analytics

Tempe, AZ4y exp

QlikArizona State University

“ML/NLP practitioner focused on insurance/claims analytics for a large financial firm, working with millions of fragmented structured and unstructured records. Built production-grade pipelines for entity extraction, entity resolution, and semantic search using Sentence-BERT + vector DB, including fine-tuning with contrastive learning (reported ~15% recall lift) and scalable ETL/containerized deployment on Kubernetes.”

Python Pandas NumPy Scikit-learn TensorFlow PyTorch+117

View profile

Saicharitha Yanamandala

Screened

Mid-Level Software Developer specializing in Java, Cloud, and Microservices

Chicago, IL6y exp

Capital OneChicago State University

“Backend/Python engineer who owned an end-to-end FastAPI + AWS internal natural-language document Q&A system (Textract extraction, embeddings/vector DB, LLM integration) with strong focus on reliability and latency. Hands-on with Kubernetes + GitOps (Argo CD, Helm, rolling updates/auto-rollback) and built/optimized Kafka streaming pipelines using Prometheus/Grafana. Also supported a zero-downtime on-prem to cloud migration with parallel run and gradual traffic cutover.”

API Gateway AWS AWS CloudFormation AWS Lambda Angular Bash+265

View profile

Madhav Vaddepalli

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Seattle, WA8y exp

SafecoFitchburg State University

“Data engineer focused on building reliable, production-grade pipelines and external data collection systems on AWS (S3/Lambda/SQS/Glue/EMR) using PySpark/SQL, serving curated datasets to Snowflake/Redshift for finance and fraud teams. Has operated a large-scale crawler ingesting millions of records/day with anti-bot tactics, schema versioning/quarantine, and CloudWatch/Datadog monitoring, and also shipped a versioned REST API with caching and query optimization.”

Agile Amazon Athena Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon EMR+192

View profile

Aayush Anand

Screened

Intern Full-Stack/Software Engineer specializing in web apps, cloud, and data/ML systems

New York, NY1y exp

The NorthStar GroupNYU

“Built and productionized LLM-driven content intelligence/SEO agents for a high-traffic media platform, automating tagging/summarization/metadata with FastAPI + async orchestration and strict JSON-schema outputs. Demonstrated measurable impact (40% faster publishing, +20% organic traffic in 3 months) and strong reliability practices (offline evals, shadow mode, canaries, fallbacks, idempotency, and monitoring).”

Agile Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+112

View profile

vinay maruthi

Screened

Mid-level Software Engineer specializing in LLM agents and ERP-integrated workflow automation

New York, NY4y exp

DeloitteUniversity of Central Missouri

“Built and shipped a production LLM-powered agent that automated purchasing and inventory operations by integrating with live ERP data and returning structured, machine-readable outputs usable by downstream systems. Emphasizes real-world reliability through orchestration, strict schemas/validation, confidence-based fallbacks with human handoff, and monitoring/evaluation feedback loops to reduce silent failures and make issues observable.”

Python Go Java SQL PostgreSQL MySQL+57

View profile

Sushanth Reddy

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

Columbus, OH4y exp

Western Alliance BankUniversity of Missouri-Kansas City

“Data engineer focused on production-grade pipelines and data services: ingests millions of records/day into S3, performs SQL/Python quality validation and PySpark/SQL transformations, and serves curated datasets via Athena/Redshift. Has experience hardening external data collection with retries/rate-limit handling and shipping versioned internal data APIs with backward compatibility, monitoring, and CI/CD in early-stage environments.”

Python SQL R Node.js ETL Data pipelines+57

View profile

Nivedita Shainaj Nair

Screened

Mid-level ML Data Engineer specializing in MLOps and scalable healthcare data pipelines

Boston, MA5y exp

CignaNortheastern University

“Data/ML platform engineer with healthcare (Cigna) experience owning an end-to-end pipeline spanning Airflow + Debezium CDC ingestion, PySpark/SQL transformations, rigorous data quality gates, and feature-store/API serving for ML training and inference. Worked at 10+ TB scale and cites a ~30% latency reduction plus stronger reliability via idempotent design, monitoring, and backfill-safe reprocessing; also built pragmatic early-stage data pipelines at Frankenbuild Ventures.”

Agile Alerting Anomaly Detection Apache Airflow Apache Kafka Apache Spark+135

View profile

Shiva Ganduru

Screened

Senior Backend Software Engineer specializing in microservices, Kafka, and cloud-native AWS platforms

USA5y exp

ExperianWestern Illinois University

“LLM/agent engineer with production experience in the insurance claims domain, integrating OpenAI + LangChain into a claims platform to automate unstructured document extraction/classification and cut manual effort by 35%. Built reliable, fault-tolerant AWS/Kubernetes microservices with CloudWatch monitoring plus circuit breakers/retries/fallbacks, and implemented multi-step Spring Boot orchestration with schema validation, confidence gating, and human-in-the-loop handling for low-confidence cases.”

Java Python JavaScript TypeScript SQL Spring Boot+125

View profile

Nikitha Margadi

Screened

Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps

Texas, USA5y exp

AT&TCal State Fullerton

“Data engineer at AT&T focused on large-scale telecom (5G/IoT) data platforms, owning end-to-end pipelines from Kafka/Azure ingestion through Databricks/Delta Lake transformations to serving analytics and ML. Has operated at very high volumes (~50+ TB/day) and delivered measurable performance gains (25–30% faster processing) plus improved reliability via Airflow monitoring, robust data quality checks, and resilient external data collection patterns (rate limiting, retries, dynamic schemas).”

Python SQL PL/SQL PySpark Apache Spark Apache Kafka+114

View profile

Krishnamraju Penumatsa

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Fort Worth, TX6y exp

American AirlinesUniversity of North Texas

“Data engineer currently at American Airlines who built and owned end-to-end flight operations and booking data pipelines (batch + real-time) using Azure Data Factory, Kafka, Spark/Databricks, Synapse, and Snowflake—processing hundreds of GBs/day. Strong focus on reliability and data quality (idempotency, checkpointing, retries, validation/alerts) and delivered near-real-time analytics powering Power BI dashboards; previously helped stand up an early-stage data platform at Sysco on AWS (Glue/S3/Redshift) with Airflow and Jenkins CI/CD.”

Amazon Athena Amazon DynamoDB Amazon Kinesis Amazon Redshift Amazon S3 Apache Airflow+118

View profile

Shakhzod Kushimov

Screened

Junior Business & Operations Analyst specializing in banking analytics

Baton Rouge, LA2y exp

Currency BankUniversity of Florida

“Analytics professional with Bank of America experience analyzing the digital card replacement journey at scale, transforming 20M event records across 900K clients into actionable reporting and recommendations. Stands out for combining SQL/Python data engineering, customer journey analytics, and executive-facing storytelling strong enough to get senior leadership to adopt most of their proposed app improvements.”

SQL Python Tableau Power BI Business Intelligence Dashboard Development+56

View profile

Firoz Shaik

Screened

Mid-level Data Analyst specializing in business intelligence and customer analytics

4y exp

Molina HealthcareUniversity of Missouri-Kansas City

“Healthcare-focused data analyst with hands-on experience at Molina Healthcare building SQL and Python workflows for retention and churn analytics. They combined enrollment, CRM, and claims data into Power BI reporting, automated predictive churn analysis, and tied their work to measurable outcomes including faster processing, better reporting accuracy, and reduced churn.”

SQL Python Power BI ETL Data Analysis Data Modeling+101

View profile

Anudeepreddy Maramreddy

Screened

Mid-level NetSuite Consultant specializing in ERP implementations and integrations

North Carolina, USA4y exp

Bank of AmericaUniversity of Cincinnati

“Enterprise NetSuite Financials professional with hands-on ownership of GL configuration, month-end close, reporting, and production support in a multi-subsidiary environment at Bank of America. Particularly strong in stabilizing finance operations under deadline pressure, improving close efficiency, and building audit-ready reporting and controls across NetSuite and integrated systems.”

JavaScript Workflow Automation OCR Machine Learning Predictive Analytics REST APIs+164

View profile

Yuchen Wang

Screened

Intern Software Engineer specializing in full-stack development and AI/ML

New York, NY1y exp

AdasEcoNYU

“Built and maintains an AI Finance Tracker end-to-end as a solo full-stack product owner, from Figma designs and React frontend to Flask APIs, Firestore, auth, deployment, and AI insights. Stands out for combining product instinct with pragmatic engineering decisions like pre-aggregating financial data to control LLM costs and adding OCR receipt scanning based on real user feedback.”

JavaScript TypeScript React Jest Python Java+125

View profile

Bharath Seshavarapu

Screened

Mid-level Salesforce Developer specializing in CRM automation and integrations

Dallas, TX5y exp

Human CoalitionFlorida State University

“Salesforce-focused engineer with hands-on depth across Service Cloud, OmniScript/DataRaptor, LWC, Aura, and Apex. Particularly strong in building metadata-driven workflows that let operations teams change intake flows without developer tickets, while still making disciplined architecture decisions around when custom code is actually warranted.”

Salesforce REST API CI/CD CRM Automation Stripe+95

View profile

Michael Yeung

Screened

Mid-level Software Engineer specializing in FinTech trading platforms

San Francisco, CA4y exp

Wells FargoUC Irvine

“Built and deployed internal trading tools at Wells Fargo that reduced manual production-support dependency for trader configuration workflows. Brings hands-on experience in financial systems, data quality, and production incident resolution, including building 400+ SQL validation rules and designing an internal RAG assistant for engineering documentation.”

C#SQL TypeScript JavaScript Python .NET+57

View profile

Software Engineers Data Engineers Machine Learning Engineers Data Analysts Software Developers Data Scientists Engineering Data & Analytics AI & Machine Learning Education

Need someone specific?

AI Search

Related

Need someone specific?