Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Quality Professionals

Pre-screened and vetted.

Data Quality Python SQL AWS CI/CD Docker

Sreelekha Vuppala

Screened

Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms

USA4y exp

CitiusTechArizona State University

“GenAI/ML engineer (CitiusTech) who has deployed production RAG systems for compliance/operations document Q&A, using Pinecone + FastAPI microservices on Kubernetes with strong monitoring and guardrails. Also built a GenAI-powered incident triage/routing solution in collaboration with non-technical stakeholders, achieving 35% faster response times and 40% fewer misclassified tickets, and has hands-on orchestration experience with Airflow and AutoSys.”

A/B Testing Agile Amazon Kinesis Apache Airflow Apache Hadoop Apache Kafka+246

View profile

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp

Syneos HealthUniversity of Alabama at Birmingham

“Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.”

Python Pandas PySpark FastAPI LangChain SQL+97

View profile

Bhargavi Kondaveeti

Screened

Mid-level Data Engineer specializing in big data pipelines and real-time streaming

Dallas, TX6y exp

Johnson & JohnsonUniversity of North Texas

“Data engineer who has owned end-to-end production pipelines processing a few million records/day, using Python/Airflow/SQL/PySpark with Snowflake serving to BI (Power BI). Built resilient external web data collection systems (anti-bot, schema-change detection, backfills) and shipped versioned REST APIs for internal consumers, improving pipeline success rates to 99% through monitoring, retries, and idempotent design.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon Redshift Amazon S3 Amazon SQS+101

View profile

Sai Vardhan Reddy

Screened

Mid-Level Data Engineer specializing in cloud data platforms and governed analytics

5y exp

OptumUniversity of Central Missouri

“Data engineer with Optum experience building end-to-end healthcare data pipelines for HL7/FHIR, processing millions of records daily across Kafka streaming and Databricks/Spark batch. Strong focus on data quality (schema enforcement/validations), reliability (Airflow monitoring/alerts), and analytics-ready serving in Snowflake powering Power BI/Tableau, with CI/CD via Git and Jenkins.”

AWS Amazon EC2 AWS Lambda AWS Glue Amazon S3 Amazon Kinesis+94

View profile

Tanvi Dasaripally

Screened

Mid-level Cloud Data Engineer specializing in Azure/AWS pipelines and medallion architecture

USA4y exp

UnitedHealth GroupSouthern Illinois University Carbondale

“Data engineer focused on reliability and data quality, owning end-to-end pipelines processing ~100k–300k records/day. Implemented robust validation and monitoring that cut reporting issues by ~30%, and built stable external data collection with anti-bot measures, backfills, and schema-change detection while maintaining backward-compatible internal data services.”

Python SQL PySpark Apache Kafka Azure Data Factory AWS+72

View profile

Sriraj Samala

Screened

Mid-level Data Analyst specializing in business analytics and BI

Dayton, OH3y exp

University of DaytonUniversity of Dayton

“Analytics professional with higher education experience at the University of Dayton, focused on turning inconsistent operational data into standardized metrics and recurring dashboards. They combine SQL, Python, and Power BI to automate reporting, improve data integrity, and reduce manual reporting by 30%, with outputs adopted in semester planning and cross-department performance tracking.”

Power BI Tableau Microsoft Excel Python Pandas NumPy+61

View profile

Vinodini Bassetti

Screened

Entry Data Scientist specializing in data engineering and automotive analytics

Bangalore, India1y exp

Tata ElxsiUniversity of Cincinnati

“Frontend-focused candidate with hands-on experience building React and TypeScript dashboards for searching, filtering, and analyzing large datasets in real time. Demonstrates practical performance tuning skills using React DevTools, memoization, debouncing, and pagination, and has also built a Mapbox-based location data dashboard with interactive markers and popups.”

Python SQL PySpark Shell Scripting Git GitHub+73

View profile

Manali Shetye

Screened

Mid-level Software Engineer specializing in AI platforms and enterprise full-stack systems

Fremont, CA5y exp

Trend MicroUniversity of Texas at Arlington

“Full-stack product engineer who has built both operational systems and enterprise AI copilots in production. They owned an AI-powered inventory platform end-to-end, driving a 45% drop in stock issues, and also shipped a Microsoft Teams-based HR/IT copilot using RAG and workflow automation that reduced repetitive support queries by roughly 30%.”

Python C#SQL JavaScript TypeScript PowerShell+149

View profile

Mohammad Kashif

Screened

Junior Data Engineer / Analyst specializing in AI/ML data infrastructure

Houston, Texas1y exp

CallAgent AIUniversity of Texas at Austin

“Built and deployed a compliance-sensitive LLM pipeline that extracts rebate logic from hospital–supplier medical contracts, using multi-layer redaction (regex/NER/dictionary), schema-validated structured outputs, and secure placeholder reinsertion. Hosted models on Amazon Bedrock to avoid retraining on sensitive data and improved both accuracy and cost by splitting the workflow into a lightweight section classifier plus a fine-tuned extraction model, orchestrated with LangChain and evaluated via layered, test-driven agent assessments.”

Agentic AI AWS BigQuery Compliance Data Modeling Data Pipelines+175

View profile

Farhath Banu

Screened

Senior Software Engineer specializing in AI-driven marketing and data platforms

Boston, MA7y exp

PostscriptShadan College of Engineering and Technology

“Backend/data engineer who builds production FastAPI microservices and AWS serverless/Glue pipelines for SMS analytics and marketing segmentation. Led a legacy batch modernization into modular services (FastAPI + Glue/Athena + ClickHouse) using shadow-mode parity checks, feature flags, and incremental rollout. Demonstrated measurable performance wins (12s to sub-second SQL; ~40% CPU reduction) and strong incident ownership with proactive schema-drift prevention.”

Python TypeScript Java C C++FastAPI+127

View profile

Saloni Patadia

Screened

Mid-level Machine Learning Engineer specializing in LLM systems and healthcare data automation

California, USA2y exp

Prime HealthcareUSC

“React performance-focused engineer who contributed performance patches back to an open-source context+reducer state helper after profiling and fixing excessive re-renders in an enterprise project management platform at Easley Dunn Productions. Also built an end-to-end LLM-driven pipeline at Prime Healthcare to normalize millions of supply-chain records, reducing defects by 80% and saving 160+ hours/month.”

LangChain LlamaIndex FAISS Vector Search Semantic Search Prompt Engineering+100

View profile

Neeraj Jawahirani

Screened

Mid-level Data & AI Engineer specializing in healthcare data pipelines and MLOps

FL, USA4y exp

HumanaFlorida State University

“Built and deployed a production LLM-powered clinical note summarization system used by care managers to speed review of 5–20 page unstructured medical records. Implemented safety-focused validation (prompt constraints, rule-based and section-level checks, human-in-the-loop) to reduce hallucinations while maintaining low latency and meeting privacy/regulatory constraints, integrating via APIs into existing clinical tools.”

Agile Amazon CloudWatch Amazon Redshift Amazon S3 Amazon SageMaker Ansible+122

View profile

Siva Manikanta Lakumarapu

Screened

Mid-level AI/ML Engineer specializing in Generative AI and NLP

Dallas, TX5y exp

Gilead SciencesUniversity of North Texas

“AI/LLM engineer with production experience building secure, scalable compliance-focused generative AI systems (GPT-3/4, BERT) including RAG over internal regulatory document bases. Has delivered end-to-end pipelines on AWS with PySpark/Airflow/Kubernetes/FastAPI, emphasizing privacy controls, monitoring, and iterative evaluation (A/B testing). Also partnered closely with bank compliance officers using prototypes to refine NLP summarization/classification and reduce document review time.”

A/B Testing Agile Amazon EC2 Amazon Redshift Amazon S3 Apache Airflow+164

View profile

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp

Dreamline AICalifornia State University, Fullerton

“At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.”

A/B Testing Amazon CloudWatch Amazon Kinesis Amazon Redshift Amazon S3 Amazon SageMaker+114

View profile

HarshaSree gudapati

Screened

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp

Bank of AmericaUniversity of Cincinnati

“Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.”

Azure Data Factory AWS Amazon S3 AWS Glue Amazon Redshift AWS Lambda+125

View profile

Prateek Pravanjan

Screened

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp

MercorStevens Institute of Technology

“LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.”

PyTorch Transformers NumPy Scikit-learn Model evaluation Pandas+135

View profile

Molli Dinesh

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps

Remote, USA4y exp

Marsh McLennanIllinois Institute of Technology

“Built an AI-driven insurance policy summarization platform at Marsh, taking it end-to-end from messy PDF ingestion/OCR and custom extraction through LLM fine-tuning and AWS SageMaker deployment. Delivered measurable impact (25% reduction in manual review time, 99% uptime) and demonstrated strong production MLOps/LLMOps practices with Airflow/Step Functions orchestration, rigorous evaluation (ROUGE + human review), and continuous monitoring for drift, latency, and hallucinations.”

Python Pandas NumPy Scikit-learn R SQL+132

View profile

Madhav Vaddepalli

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Seattle, WA8y exp

SafecoFitchburg State University

“Data engineer focused on building reliable, production-grade pipelines and external data collection systems on AWS (S3/Lambda/SQS/Glue/EMR) using PySpark/SQL, serving curated datasets to Snowflake/Redshift for finance and fraud teams. Has operated a large-scale crawler ingesting millions of records/day with anti-bot tactics, schema versioning/quarantine, and CloudWatch/Datadog monitoring, and also shipped a versioned REST API with caching and query optimization.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon Redshift Amazon RDS+192

View profile

Sushanth Reddy

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

Columbus, OH4y exp

Western Alliance BankUniversity of Missouri-Kansas City

“Data engineer focused on production-grade pipelines and data services: ingests millions of records/day into S3, performs SQL/Python quality validation and PySpark/SQL transformations, and serves curated datasets via Athena/Redshift. Has experience hardening external data collection with retries/rate-limit handling and shipping versioned internal data APIs with backward compatibility, monitoring, and CI/CD in early-stage environments.”

Python SQL R Node.js ETL Data pipelines+57

View profile

Nivedita Shainaj Nair

Screened

Mid-level ML Data Engineer specializing in MLOps and scalable healthcare data pipelines

Boston, MA5y exp

CignaNortheastern University

“Data/ML platform engineer with healthcare (Cigna) experience owning an end-to-end pipeline spanning Airflow + Debezium CDC ingestion, PySpark/SQL transformations, rigorous data quality gates, and feature-store/API serving for ML training and inference. Worked at 10+ TB scale and cites a ~30% latency reduction plus stronger reliability via idempotent design, monitoring, and backfill-safe reprocessing; also built pragmatic early-stage data pipelines at Frankenbuild Ventures.”

Agile Alerting Anomaly Detection Apache Airflow Apache Kafka Apache Spark+135

View profile

Shiva Ganduru

Screened

Senior Backend Software Engineer specializing in microservices, Kafka, and cloud-native AWS platforms

USA5y exp

ExperianWestern Illinois University

“LLM/agent engineer with production experience in the insurance claims domain, integrating OpenAI + LangChain into a claims platform to automate unstructured document extraction/classification and cut manual effort by 35%. Built reliable, fault-tolerant AWS/Kubernetes microservices with CloudWatch monitoring plus circuit breakers/retries/fallbacks, and implemented multi-step Spring Boot orchestration with schema validation, confidence gating, and human-in-the-loop handling for low-confidence cases.”

Java Python JavaScript TypeScript SQL Spring Boot+125

View profile

Aswani D

Screened

Mid-level Software Engineer specializing in cloud microservices and data pipelines

5y exp

Johnson & JohnsonIndiana Wesleyan University

“Data engineer/platform builder who has owned production pipelines end-to-end processing millions of records/day, with strong emphasis on data quality (quarantine workflows) and reliability (monitoring, retries, incremental loads). Also designed large-scale external data collection/crawling with anti-bot handling and backfills, and shipped versioned REST data services optimized for performance and developer usability in an early-stage environment.”

Python SQL PL/SQL WebSockets Spring Boot Spring MVC+144

View profile

Nikitha Margadi

Screened

Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps

Texas, USA5y exp

AT&TCal State Fullerton

“Data engineer at AT&T focused on large-scale telecom (5G/IoT) data platforms, owning end-to-end pipelines from Kafka/Azure ingestion through Databricks/Delta Lake transformations to serving analytics and ML. Has operated at very high volumes (~50+ TB/day) and delivered measurable performance gains (25–30% faster processing) plus improved reliability via Airflow monitoring, robust data quality checks, and resilient external data collection patterns (rate limiting, retries, dynamic schemas).”

Python SQL PL/SQL PySpark Apache Spark Apache Kafka+114

View profile

Sai Kumar

Screened

Mid-Level .NET Developer specializing in microservices and cloud-native FinTech/Healthcare systems

5y exp

HumanaUniversity of Central Missouri

“Backend engineer with healthcare and financial services experience (Humana, PNC) who owned production-grade, high-volume ingestion-to-API pipelines end-to-end in C#/.NET and SQL. Strong focus on data quality, handling out-of-order/partial upstream records, and improving reliability/observability via structured logging and telemetry, plus significant SQL performance tuning to reduce peak-load issues.”

.NET Agile Angular AWS AWS Lambda Azure DevOps+95

View profile

Data Engineers Software Engineers Machine Learning Engineers Data Analysts Data Scientists Software Developers Data & Analytics Engineering AI & Machine Learning Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?