Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Azure Synapse Analytics Professionals

Pre-screened and vetted.

Azure Synapse Analytics Python SQL Amazon S3 Power BI AWS

Lalithya Manasa Patri

Screened

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp

eBayTexas Tech University

“Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.”

Python SQL Java Scala R Apache Spark+97

View profile

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp

TredenceUniversity of Texas at Austin

“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”

A/B Testing API Development AWS AWS Lambda AWS Step Functions Azure Data Factory+247

View profile

Saiteja Gaddam

Screened

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp

IntuitUniversity at Buffalo

“Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.”

Scala Hibernate JDBC JSON HTML CSS+120

View profile

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp

CVS HealthUniversity of Central Missouri

“Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.”

Python SQL Scala Java Shell Scripting Apache Spark+117

View profile

jahnavi Vasala

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

San Diego, CA6y exp

IntuitCleveland State University

“Data engineer with Intuit experience owning end-to-end, high-volume financial data pipelines (API/S3 ingestion, Airflow orchestration, Spark/PySpark + SQL transforms, Snowflake marts). Strong focus on reliability and data quality—achieved 99.8% SLA and cut discrepancies by 35% using Great Expectations, reconciliation, schema versioning, and automated backfills; also built near real-time Kafka/API data services with CI/CD and observability.”

Python SQL PySpark Scala Shell scripting Apache Spark+87

View profile

Biplob Bidari

Screened

Senior Data Engineer specializing in FinTech analytics and ML data platforms

USA5y exp

Goldman SachsUniversity of the Cumberlands

“ML/AI engineer with Goldman Sachs experience building production fraud detection and RAG-based trading insights systems end-to-end. Stands out for combining real-time ML infrastructure, GenAI retrieval systems, and compliance-aware design, with measurable impact including nearly 25% false-positive reduction and improved analyst productivity.”

Python Pandas NumPy PySpark SQL Bash+139

View profile

Palak Siroya

Screened

Senior Site Reliability Engineer specializing in Azure cloud reliability and data analytics

Renton, WA10y exp

MicrosoftCentral Washington University

“AppSec-focused customer advisor with hands-on experience integrating SAST/DAST/SCA into production CI/CD (Azure DevOps) and designing secure agent/scanning deployments in AWS (least-privilege IAM, private subnets, VPC endpoints). Demonstrates strong incident troubleshooting using logs/metrics/traces to diagnose load-related failures (timeouts/retry storms) and drive durable fixes, while tailoring risk/tradeoff communication across engineering, security, and leadership stakeholders.”

Automation Azure Data Factory Azure DevOps Azure SQL Database CI/CD C+125

View profile

Praveen Nutulapati

Screened

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp

JPMorgan ChaseUniversity of Central Missouri

“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”

A/B Testing Agile Amazon Bedrock Amazon EC2 Amazon EMR Amazon RDS+184

View profile

Rahul Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

New York, NY6y exp

CVS HealthSouthern Arkansas University

“Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.”

Python Java SQL MySQL PostgreSQL Apache Hive+118

View profile

Bhanu Chander

Screened

Senior Data Engineer specializing in cloud data platforms and real-time pipelines

New York, NY6y exp

DisneyIndiana Wesleyan University

“Data engineer focused on reliability and observability, building end-to-end pipelines processing millions of records/day from sources like S3 and Kafka. Has hands-on experience with Airflow-based data quality automation, PySpark/Databricks transformations, and shipping versioned Python REST APIs deployed via Docker/Kubernetes with CI/CD (Jenkins) and monitoring (CloudWatch/Azure Logs).”

Python SQL Scala C#JavaScript Java+140

View profile

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp

DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

Python SQL PySpark Bash TensorFlow PyTorch+106

View profile

Rishitha Reddy K

Screened

Mid-level Data Scientist specializing in risk, forecasting, and segmentation across finance and healthcare

McLean, Virginia5y exp

Capital OneUniversity of Cincinnati

“Data/ML engineer with experience across pharma (Dr. Reddy Laboratories) and financial services (Cincinnati Financial, Capital One), building production NLP and entity-resolution systems that connect messy unstructured text with enterprise SQL data. Delivered semantic search with BERT + vector DB and domain fine-tuning (reported ~35% relevance lift), and builds robust pipelines using Airflow/dbt/Spark with strong validation, monitoring, and stakeholder-aligned rollout practices.”

Python R SQL Scala Java Scikit-learn+139

View profile