Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted ELT (Extract, Load, Transform) Professionals

Pre-screened and vetted.

ELT (Extract, Load, Transform)Python ETL SQL AWS Docker

Pranit Chetta

Screened

Senior Full-Stack Java Engineer specializing in cloud-native AI and enterprise platforms

Wilmington, DE11y exp

JPMorgan ChaseGujarat Technological University

“Full-stack product engineer who owned a live-events digital ticketing platform end-to-end, including blockchain-based ticket validation and high-traffic booking flows. Stands out for combining Angular/React frontend work with Java/Spring Boot backend architecture, plus strong production reliability practices around concurrency control, queues, observability, and UX optimization.”

Java C C++JavaScript TypeScript SQL+308

View profile

AnilKumar Kanakadandila

Screened

Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps

San Francisco Bay Area, CA5y exp

VerizonCalifornia State University

“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”

Python Pandas NumPy Scikit-learn SQL MySQL+105

View profile

shiva kumar kotha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and healthcare data

NJ, USA6y exp

Johnson & JohnsonWichita State University

“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”

Analytics API Integration API Testing AWS Azure Data Factory Azure Synapse Analytics+158

View profile

Ananya Bojja

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

USA4y exp

CignaUniversity of New Hampshire

“AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.”

A/B Testing Agile API Integration Apache Airflow Apache Kafka Apache Spark+148

View profile

Joshua Spatz

Screened

Executive Technology Leader (CTO) specializing in IoT, enterprise systems, and digital transformation

Miami, FL18y exp

Aroma360University of Florida

“Founder of an LLC operating as a consulting firm providing fractional CTO services to startups, giving them parallel exposure to multiple early-stage companies. Has direct experience with MVP development, building org structures from scratch, and supporting early fundraising, and is exploring a pivot from consulting into a scalable product business while staying engaged with the VC/accelerator ecosystem.”

Agile Angular Asana Automated Testing Bash Budgeting+98

View profile

Neimisha Konda

Screened

Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries

USA5y exp

American ExpressUniversity of Massachusetts Boston

“Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.”

Snowflake SQL PostgreSQL MySQL NoSQL Apache Spark+109

View profile

Manasa Gunreddy

Screened

Senior Data Engineer specializing in cloud data platforms and real-time streaming

6y exp

HCA HealthcareWright State University

“Data engineer in healthcare (HCA) who owned end-to-end Azure-based pipelines at very large scale (50M+ daily claims/patient records). Strong focus on reliability: schema-drift fail-fast validation, quarantine layers, and Python/SQL data quality checks that reduced issues ~25%, plus performance tuning in Databricks/PySpark and versioned serving in Synapse for downstream consumers.”

AWS AWS CloudFormation AWS CodePipeline AWS Glue AWS IAM AWS Lambda+136

View profile

Cary Burdick

Screened

Senior Data Scientist specializing in data engineering and analytics

Chicago, IL6y exp

USDAAuburn University

“Data/NLP practitioner with experience in both financial services (Truist) and government (USDA), including an NLP-driven analysis of EU regulations to anticipate US regulatory focus and a major redesign/cleaning of complex pathogen lab-test public datasets. Built production data-quality pipelines with Dagster, Pandera, and Azure Synapse, and is comfortable validating hypotheses with historical backtesting and SME-driven quality controls.”

Python PySpark Pandas NumPy R SciPy+53

View profile

Sreenija Karnati

Screened

Mid-level Data Analyst and Data Engineer specializing in healthcare and financial analytics

3y exp

UnitedHealth GroupUniversity of North Texas

“Analytics professional with healthcare and operations experience who turns messy enterprise data from platforms like Teradata, GCP, SQL Server, and Snowflake into trusted reporting layers and reproducible analysis workflows. They combine SQL, Python, PySpark, Power BI, and Tableau to improve reporting accuracy and performance, including a 30% dashboard refresh improvement and 20-25% accuracy gains in healthcare reporting.”

SQL Python Power BI Snowflake Databricks Business Intelligence+89

View profile

Pavan Punna

Screened

Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI

Dallas, TX5y exp

Federal Soft SystemsConcordia University

“Built and owned a production GPT-4 RAG assistant for clinical and enterprise query resolution, taking it from initial experiment to deployment, monitoring, and iterative improvement. Their work cut resolution time from 45 minutes to under 2 minutes, achieved roughly 95% accuracy, and scaled to thousands of additional monthly queries while emphasizing safety and trust in a sensitive clinical domain.”

Python SQL Java Scala Bash PyTorch+124

View profile

Rushabh Thakkar

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLMs

New York City, NY3y exp

WayfairStevens Institute of Technology

“Wayfair ML/AI engineer who has shipped and operated production LLM systems for both internal analytics and customer-facing assistants. Stands out for combining strong RAG/retrieval engineering with production-grade platform work—improving trust, reducing latency by ~30%, and cutting ad hoc reporting demand by ~50%.”

Machine Learning Deep Learning Natural Language Processing Computer Vision Large Language Models Python+168

View profile

Siva Harini Sri Janaki Raman

Screened

Mid-level Data Engineer specializing in cloud data platforms

Dallas, TX3y exp

CVS HealthTexas Tech University

“Built an AI-powered internal support assistant at CVS Health using GPT-4, LangChain, and Pinecone, applying RAG, validation, and monitoring to reduce repetitive support tickets while protecting sensitive healthcare data. Stands out for a pragmatic approach to AI engineering: using multi-agent and LLM workflows to accelerate development while keeping systems constrained, observable, and production-friendly.”

Python SQL R AWS Amazon S3 AWS Glue+110

View profile

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp

GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B Testing Agile AI Agents Amazon Athena Apache Kafka Apache Spark+170

View profile

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp

AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine Learning Generative AI Large Language Models (LLMs)Prompt Engineering Retrieval-Augmented Generation (RAG)Embeddings+131

View profile

Krishna Kandlakunta

Screened

Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning

United States5y exp

CitigroupUniversity of North Texas

“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”

A/B Testing Agile Anomaly Detection Apache Hadoop Apache Hive Apache Kafka+167

View profile

Naga Yanala

Screened

Mid-level Data Engineer specializing in cloud data pipelines and analytics platforms

Texas, USA5y exp

Molina HealthcareSoutheast Missouri State University

“Data engineer with healthcare and enterprise experience (Molina Healthcare, Dell Technologies) building and operating high-volume batch + streaming pipelines across AWS and Azure. Strong focus on data quality (schema validation, fail-fast checks), reliability (monitoring/alerts, retries), and performance tuning (Spark/partitioning), with measurable runtime reduction and improved downstream trust.”

Python SQL PySpark Bash ETL Data pipelines+85

View profile

Sai Kavyusha Ponnagant

Screened

Mid-level Data Engineer specializing in cloud data pipelines and financial services warehousing

Chicago, IL4y exp

Charles SchwabDePaul University

“Data engineer (Charles Schwab) who took ownership of an unstable, ambiguous nightly financial data pipeline and rebuilt it into a reliable, incremental AWS Glue/Airflow/Redshift system feeding Power BI. Created a custom Python data-quality framework with hard-stop gating and schema drift detection, improving integrity (99.9%), cutting runtime (~20%), and reducing incidents/tickets (35% fewer schema-related dashboard incidents; 30% fewer investigations).”

Python SQL Amazon S3 AWS Glue Amazon Redshift Amazon Athena+73

View profile

Yaswanth Thota Thota

Screened

Mid-level Data Analyst specializing in financial risk and healthcare analytics

AZ, USA4y exp

Wells FargoArizona State University

“AI/ML engineer focused on real-time, production-grade LLM systems, with a robotics-adjacent mindset around latency/accuracy tradeoffs and modular pipelines. Built a scalable RAG-based assistant orchestrated as microservices on Kubernetes with Kafka async messaging, ONNX/quantization optimizations, and monitoring (Prometheus/Grafana), citing a ~35% hallucination reduction; has also experimented with ROS Noetic/Gazebo to understand ROS concepts.”

A/B Testing Agile Amazon Redshift Apache Airflow Apache Kafka Azure Monitor+117

View profile

UMESH KAMISETTY

Screened

Mid-level Data Engineer specializing in cloud lakehouse and streaming platforms

Seattle, WA5y exp

First United BankCleveland State University

“Data engineer focused on building production-grade pipelines on AWS (Kafka/Kinesis/Glue/S3) through to curated serving layers in Snowflake and Delta Lake. Emphasizes automated data quality validation (PySpark + CI/CD), modular dbt transformations for analytics (customer spending, risk metrics), and operational reliability with CloudWatch and DLQs; data consumed by BI tools and ML pipelines for fraud detection and risk analytics.”

Python PySpark SQL Shell Scripting AWS Amazon S3+146

View profile

Harshitha Parupalli

Screened

Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines

Jersey City, NJ4y exp

Elevance HealthNJIT

“Data engineer with healthcare domain experience who owned 100M+ record pipelines end-to-end (Kafka/Kinesis/ADF → PySpark/dbt validation → Spark SQL transforms → Snowflake/Power BI serving). Built production-grade reliability practices (Airflow orchestration, CloudWatch/Grafana monitoring, pytest + contract/regression tests, idempotent ingestion/backfills) and delivered measurable improvements: 35% lower latency and 40% better query performance.”

Python SQL Shell Scripting R Scala Java+160

View profile

Mohan Naik Megavath

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp

TruistElmhurst University

“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”

Amazon Athena Amazon DynamoDB Amazon EC2 Amazon Redshift Amazon S3 AngularJS+137

View profile

MOUNIKA SAI MEKALA

Screened

Junior Data Analyst specializing in financial and operational analytics

Kansas, USA3y exp

KPMGUniversity of Central Missouri

“Analytics professional with experience at KPMG turning messy operational and financial data from SQL Server and AWS S3 into clean reporting datasets and automated Python workflows. They combine SQL, Python, Power BI, and experimentation methods to deliver stakeholder-aligned KPI dashboards and marketing performance insights with a strong focus on data integrity and reproducibility.”

SQL Python Pandas NumPy SciPy R+103

View profile

Dharanidharan Loganathan

Screened

Senior Python Developer specializing in data engineering, MLOps, and cloud platforms

Dallas, TX13y exp

CBREAnna University

“Backend/data engineer with production experience building secure Django/DRF APIs (JWT RS256 + rotating refresh tokens), background processing with Celery, and strong reliability practices (timeouts, retries/backoff, structured logging, audit trails). Has delivered AWS solutions spanning Lambda + ECS with IaC/CI-CD and built Glue/PySpark ETL pipelines with schema evolution and data-quality quarantine patterns; also modernized a legacy SAS pipeline to Python/PySpark with parallel-run parity validation and phased rollout.”

Python C#C++Go Java JavaScript+170

View profile

Anuj Shah

Screened

Senior Data Analyst specializing in cloud data platforms, experimentation, and predictive analytics

GA, USA9y exp

UnitedHealth GroupNorthwestern Polytechnic University

“Healthcare data/ML practitioner with experience at UnitedHealth Group building production ETL and streaming pipelines (Python, BigQuery, Kafka) that unify EHR, IoT device, and lab data for patient risk prediction. Also implemented embedding-based semantic search/linking for noisy clinical notes via domain adaptation and rigorous validation with clinical stakeholders; previously built churn prediction at DirecTV using XGBoost.”

Python SQL R Apache Spark PySpark Apache Kafka+111

View profile

Data Engineers Machine Learning Engineers Software Engineers Data Scientists Data Analysts AI Engineers Data & Analytics AI & Machine Learning Engineering Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?