Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Azure Synapse Analytics Professionals

Pre-screened and vetted.

Azure Synapse Analytics Python SQL Amazon S3 Power BI AWS

Nafeezuddin Mohammed

Screened

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp

SonyFitchburg State University

“Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.”

Agile Amazon Athena Amazon CloudWatch Amazon EMR Amazon Redshift Amazon S3+124

View profile

Bhanu Prakash Reddy Dakilli

Screened

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp

Bank of AmericaNew England College

“Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.”

Python SQL PySpark Apache Spark Java Power BI+97

View profile

Deepthi Mundarinti

Screened

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp

JPMorgan ChaseSaint Louis University

“Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.”

Python NumPy Pandas PySpark Scikit-learn TensorFlow+120

View profile

Sathyavarthan Balachandar

Screened

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp

Fidelity InvestmentsNortheastern University

“Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.”

Python SQL Go Apache Spark PySpark Databricks+125

View profile

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp

BlackRockWebster University

“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”

Python SQL Shell Scripting Apache Spark PySpark Apache Hadoop+103

View profile

Shanmukh Sai Madhu

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp

JPMorgan ChaseUniversity of South Dakota

“Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.”

Agile Amazon EMR Apache Airflow Apache Kafka Apache Spark AWS+122

View profile

Devender Kunta

Screened

Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake

Richardson, TX6y exp

PwCUniversity of Central Missouri

“Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.”

AWS Databricks Apache Spark PySpark Scala Python+109

View profile

Harshavardhan Reddy

Screened

Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics

Albany, NY5y exp

Capital OnePace University

“ML/AI engineer with Capital One experience building production-grade customer segmentation and fraud detection systems combining NLP (transformers) and anomaly detection. Strong MLOps and orchestration background (PySpark ETL, MLflow, Airflow, Docker/Kubernetes, Azure ML) with real-time monitoring/alerting and performance optimizations like quantization and caching, plus proven ability to deliver business-facing insights through Power BI/Tableau for marketing stakeholders.”

Python R SQL PySpark Scala Java+105

View profile

Bhavyasree Chinthala

Screened

Mid-level Data Engineer specializing in cloud data pipelines and real-time streaming

USA, USA5y exp

PNCSaint Peter's University

“Data engineer with PNC Bank experience owning high-volume financial transaction pipelines end-to-end (Kafka/REST ingestion through Spark/Glue transformations to Redshift serving) for risk and fraud analytics. Built strong reliability and data quality practices (Great Expectations, reconciliation, Airflow alerting, idempotent retries, incremental/windowed processing), reporting 40% ingestion efficiency gains and ~99.9% data accuracy.”

Python SQL Apache Spark PySpark Apache Kafka Apache Airflow+72

View profile

Prutha Patel

Screened

Mid-level Business Analyst specializing in healthcare and data analytics

Texas, USA3y exp

Blue Cross Blue ShieldUniversity of Texas at Arlington

“Analytics candidate with hands-on experience at BCBS building HIPAA-compliant SQL/Snowflake/Tableau pipelines across fragmented legacy healthcare systems. Stands out for turning a 5-day claims reporting process into a near real-time 10-minute dashboard and for pairing strong data engineering discipline with reproducible Python-based churn modeling that drove measurable retention outcomes.”

Data Analysis SQL Tableau Snowflake Python AWS+84

View profile

Anapuru chetana

Screened

Mid-level Business Data Analyst specializing in healthcare analytics

USA6y exp

Johnson & JohnsonGovernors State University

“Analytics-focused candidate with strong SQL, Excel, Python, and Tableau skills who supports payroll-, compensation-, and finance-adjacent processes through rigorous data validation and reconciliation. Stands out for uncovering a duplicate-record mapping issue that exposed roughly $250K in revenue leakage and for building repeatable controls, dashboards, and automated checks to improve reporting accuracy.”

SQL PL/SQL MySQL PostgreSQL BigQuery NoSQL+112

View profile

Michael Miller

Screened

Executive technology leader specializing in model risk and regulatory technology

Waco, TX19y exp

Campton CorpPortland State University

“Candidate is pursuing a CTO role and has helped multiple startups turn early technology concepts into concrete, real-world technical requirements. They cite a systems science and mathematics background, along with experience at JPMorgan Chase, and appear strongest in technical strategy, concept fleshing, and identifying strong people to help teams succeed.”

Data Pipelines Statistical Analysis Machine Learning Python R SQL+113

View profile

Prathyusha Mardhi

Screened

Mid-level AI/ML Engineer specializing in LLM agents and workflow automation

4y exp

UnitedHealth GroupKansas State University

“AI/LLM engineer with strong healthcare domain depth who has shipped production-grade agents for care coordination and clinical workflow automation. Stands out for combining Knowledge Graph RAG, LangGraph orchestration, and rigorous eval/guardrail systems to improve reliability in high-stakes environments, with measurable gains in review time, hallucination reduction, latency, and clinician adoption.”

Python R SQL PySpark Java PyTorch+115

View profile

Prem Kumar

Screened

Senior Data Engineer specializing in cloud data platforms and regulated analytics

McLean, VA6y exp

Capital OneRowan University

“Data engineer at Capital One building AWS-based real-time and batch pipelines and backend data services for financial/fraud use cases. Has owned end-to-end pipelines processing millions of records/day, implemented dbt/Great Expectations quality gates, and tuned Redshift/Snowflake workloads (cutting query latency ~22–25% and reducing pipeline failures ~30–40%) while supporting 15+ downstream consumers.”

Python SQL PySpark Scala Java Bash+152

View profile

amani mudili

Screened

Mid-level Data Engineer specializing in cloud ETL pipelines (Azure, AWS, GCP)

Mississauga, Canada4y exp

CitigroupWebster University

“Data engineer/backend developer who owned end-to-end pipelines and external data collection systems, including API ingestion and large-scale web scraping. Worked at ~50M records/month scale, improving processing speed by 20% and reducing reporting errors by 15%, and shipped a Rust-based internal data API with versioning, caching, and strong validation/observability practices.”

Amazon Kinesis Amazon Redshift Ansible Apache Airflow Artificial Intelligence Automation+91

View profile

Sushma Mangalampati

Screened

Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering

Boston, MA6y exp

ServiceNowNortheastern University

“Data engineer with strong end-to-end ownership of production lakehouse pipelines (Snowflake + Databricks + Airflow + dbt + Great Expectations), handling 8M+ records/month and 500K+ daily CDC updates. Delivered measurable reliability and efficiency gains (41% cost reduction, freshness improved from 4h to 30m, 35% fewer downstream incidents) and has experience building a lakehouse platform from scratch across 12 source systems.”

Python SQL PySpark Apache Spark Stored Procedures ETL+89

View profile

Zhiwen Zhao

Screened

Junior Data Engineer specializing in cloud ETL and big data platforms

New York, NY3y exp

Bank of ChinaNYU

“Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.”

Python Java Scala R SQL C#+75

View profile

Sathvik Maridasana Nagaraj

Screened

Mid-level AI/ML & GenAI Engineer specializing in LLMs, RAG, and MLOps

5y exp

UnitedHealth GroupLoyola University Chicago

“LLM/agent engineer with production experience in healthcare claims automation, delivering large operational impact (cut case handling from ~8–10 minutes to ~3 minutes, ~2,000 staff hours saved/month at ~3,000 claims/month). Built resilient Azure-based deployments (Azure DevOps CI/CD, Docker/FastAPI, Redis caching, autoscaling, observability) and improved reliability via safety/evaluation frameworks that reduced hallucinations by 32%.”

A/B Testing Agile Amazon Athena Amazon CloudWatch Amazon ECS Amazon Redshift+162

View profile

Barbara Christina Cruze

Screened

Senior Business Analytics Consultant specializing in BI, data engineering, and predictive analytics

Dallas, TX8y exp

InfosysUniversity of North Texas

“Healthcare analytics candidate with hands-on experience turning messy claims, enrollment, and reference data into trusted SQL reporting layers and reproducible Python workflows. They emphasize metric standardization, stakeholder alignment, and operational impact, including ~40% reduction in manual reporting effort and improved forecasting/resource prioritization through high-risk patient segmentation.”

Power BI Tableau Python Pandas NumPy Scikit-learn+99

View profile

Yeshwanth Vudugu

Screened

Mid-level Data Analyst specializing in banking and product analytics

Memphis, TN4y exp

Bank of AmericaUniversity of Memphis

“Analytics engineer/data analyst with Bank of America experience turning fragmented financial data across SQL Server, PostgreSQL, Kafka, and flat files into trusted Snowflake/dbt reporting models. Stands out for unifying disputed business definitions like churn and payment success rate, automating manual analysis in Python, and pairing strong data quality rigor with stakeholder adoption through self-service dashboards.”

SQL Python PySpark R Pandas NumPy+92

View profile

Vamsi Reddy

Screened

Mid-level AI/ML Engineer specializing in healthcare and financial ML systems

Nashville, TN5y exp

HCA HealthcareNew England College

“ML/AI engineer with hands-on experience shipping both predictive healthcare models and clinical GenAI assistants into production. They combine strong MLOps depth across Azure and AWS with healthcare-specific safety thinking, including PHI guardrails, retrieval grounding, and production monitoring, and they also built internal Python tooling for fraud ML workflows at Capital One.”

Python SQL Shell scripting JavaScript Scikit-learn PyTorch+147

View profile