Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data & Analytics Professionals

Pre-screened and vetted.

NYC Metro DFW Metroplex Bay Area Remote Greater Boston DMV Chicago Metro Los Angeles Metro Greater Seattle Greater Houston

Sowjanya Ande

Screened

Mid-level Business Analyst specializing in finance, insurance, and data analytics

Rhode Island, USA4y exp

Liberty MutualWilmington University

“Business/data analyst with experience at KPMG and Liberty Mutual, focused on financial reporting, data quality, and analytics automation. Has built SQL and Python workflows for large transaction datasets, reduced manual reporting effort by 15+ hours per week, and translated ambiguous business questions into standardized KPIs and Power BI dashboards used for decision-making.”

Requirements Gathering Process Improvement Data Analysis Python R SQL+115

View profile

SAITEJA MALLEMPUDI

Screened

Senior Data Scientist and AI/ML Engineer specializing in GenAI and cloud ML

Chicago, IL6y exp

BMOLewis University

“ML/AI engineer with hands-on experience owning systems from experimentation through deployment and monitoring, including a Bank of Montreal project that improved timely interventions by 12%. Also brings GenAI/RAG experience with evaluation and safety guardrails, plus clinical NLP pipeline work extracting medication data from notes for patient risk prediction.”

Python SQL PySpark Scala Bash Shell Scripting+153

View profile

Ashwitha E

Screened

Junior Data Scientist specializing in fraud analytics and cloud data platforms

Dallas, TX3y exp

Bank of AmericaUniversity of North Texas

“Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.”

Python SQL R Machine Learning Predictive Modeling Feature Engineering+105

View profile

Anuj Shah

Screened

Senior Data Analyst specializing in cloud data platforms, experimentation, and predictive analytics

GA, USA9y exp

UnitedHealth GroupNorthwestern Polytechnic University

“Healthcare data/ML practitioner with experience at UnitedHealth Group building production ETL and streaming pipelines (Python, BigQuery, Kafka) that unify EHR, IoT device, and lab data for patient risk prediction. Also implemented embedding-based semantic search/linking for noisy clinical notes via domain adaptation and rigorous validation with clinical stakeholders; previously built churn prediction at DirecTV using XGBoost.”

Python SQL R Apache Spark PySpark Apache Kafka+111

View profile

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp

S&P GlobalUniversity of Central Missouri

“Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.”

Python PySpark SQL Scala Batch Processing Data Transformation+119

View profile

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp

Data Security Council of IndiaUSC

“Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.”

A/B Testing Artificial Intelligence Backend Development C++Cloud Computing Data Structures and Algorithms+76

View profile

Omkarreddy Lakkireddy

Screened

Mid-level Data Engineer specializing in cloud data pipelines and streaming

Charlotte, NC5y exp

Wells FargoUniversity of North Texas

“Data engineer with experience at Wells Fargo and Accenture owning end-to-end production pipelines processing hundreds of millions of transactional/risk records daily. Strong focus on data quality and reliability (reconciliation checks, schema drift detection, CloudWatch alerting) plus Spark performance tuning and idempotent backfills using Delta Lake/merge logic across AWS (S3/EMR/Databricks/Redshift) and Azure (ADF/Azure DevOps/Azure Monitor).”

AWS Amazon S3 AWS Glue Amazon Redshift AWS IAM AWS Lambda+89

View profile

Mukesh Rajmohan

Screened

Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics

VA, USA5y exp

UnitedHealth GroupGeorge Mason University

“Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).”

Python SQL Java AWS Amazon S3 AWS Lambda+95

View profile

Akhil Reddy Edla

Screened

Senior Data Engineer specializing in cloud data platforms and automated data quality

Houston, TX4y exp

CenterPoint EnergyUniversity of Central Missouri

“Data engineer at CenterPoint Energy who built and operated multiple production-grade GCP data systems: a daily Snowflake→BigQuery replication framework (150+ tables) with Monte Carlo/Atlan-driven observability and schema-drift protection, plus a FastAPI metrics service for pipeline health. Demonstrated measurable impact (40% faster dashboard queries, 70% less manual refresh work, zero data loss) and strong operational rigor (scaling Cloud Run jobs, SAP SLT reconciliation, quarantine patterns, CI/CD via GitHub Actions + Terraform).”

Apache Airflow Apache Kafka Apache Spark API Development AWS AWS Glue+116

View profile

SASIREKHA GULIPALLI

Screened

Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning

Alpharetta, GA4y exp

MotrexGeorgia State University

“Strategic sourcing professional specializing in seasonal apparel supply chains, combining Coupa/JD Edwards analytics with Excel/Python modeling and Power BI dashboards to drive cost reduction and OTIF gains. Notable for rapid mitigation of a 10-day factory delay affecting 12 holiday SKUs (preserved 95% of revenue) and for automating PO workflows to cut cycle time by 4.2 days and improve OTIF by 15%.”

A/B Testing Amazon EC2 Amazon S3 Bash BigQuery Classification+113

View profile

Aniket Janrao

Screened

Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs

Houma, LA2y exp

Objective Medical Systems LLCUniversity at Buffalo

“Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.”

BERT C++Data preprocessing Data visualization Deep learning Docker+125

View profile

Abdul Mohammed

Screened

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp

Cardinal HealthIndiana Tech

“Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.”

SQL Python R HTML JSON Microsoft Excel+105

View profile

Sri Charan Raju Karampudi

Screened

Mid-level Data Engineer specializing in cloud ETL and financial data platforms

Virginia, USA3y exp

Capital OneAvila University

“Data engineer with experience at Capital One and HSBC building and operating GCP-based data platforms. Led an end-to-end Oracle-to-BigQuery migration processing ~200–300GB/day using Dataflow/Beam, Airflow, Dataproc/PySpark, and Looker, achieving ~99.5% pipeline success and ~30% fewer data quality issues. Strong in production reliability, schema drift handling for external APIs, and BigQuery performance/serving patterns (materialized views, authorized views, versioned datasets).”

ETL Java Spring Framework Apache Airflow SQL Snowflake+102

View profile

Sukesh Anamaneni

Screened

Senior Business Analyst specializing in AI and commercial banking analytics

Detroit, MI5y exp

UnitedHealth GroupWalsh College

“Analytics candidate with hands-on experience supporting a workforce system transformation from symplr to Oracle Fusion Time and Labor, using SQL and Python to turn operational HR, attendance, and payroll data into reporting-ready datasets. They emphasize performance optimization, reusable analytics pipelines, and metric consistency across dashboards, with project work focused on overtime reduction, workforce efficiency, and retention trends by department.”

Fraud Detection Agile Scrum Requirements Gathering SQL Power BI+85

View profile

Ram Jayesh Parekh

Screened

Junior Data Analyst specializing in ML, NLP, and cloud data pipelines

New York City, NY3y exp

Cambium AssessmentNYU

“Built and deployed a GenAI-powered PhD career intelligence platform at NYU that maps academic backgrounds to career paths and converts long academic CVs into job-ready resumes. Stands out for treating LLM systems as structured production pipelines—combining NLP extraction, embeddings, orchestration, and AWS deployment—to improve recommendation quality and cut resume preparation time by 70%.”

A/B Testing Agentic AI Anomaly Detection Azure Machine Learning Bitbucket C+++140

View profile

Dhriti Kanchan

Screened

Mid-level Data Analyst specializing in healthcare and financial analytics

Texas, USA5y exp

McKessonNortheastern University

“Analytics-focused candidate with hands-on experience turning messy CRM, e-commerce, payments, and support data into trusted reporting datasets using SQL and Python. They have owned end-to-end churn and retention analytics work, including RFM-based segmentation, dashboard delivery, and metric standardization across sales, marketing, and finance.”

SQL Python Pandas NumPy Matplotlib Seaborn+103

View profile

Keerthana Priya

Screened

Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms

Dallas, TX5y exp

MattelKennesaw State University

“At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).”

SQL Python R PySpark Apache Spark Pandas+123

View profile

Sravan Kumar Jajam

Screened

Mid-level Data Scientist / ML Engineer specializing in streaming ML systems for healthcare and IoT

Urbandale, IA4y exp

John DeereAuburn University at Montgomery

“ML/GenAI engineer with production experience building an LLM-powered governance layer that summarizes verified drift/performance signals into validation reports and release notes, designed for regulated environments with de-identification and non-blocking fallbacks. Strong Airflow-based orchestration background across healthcare and finance, integrating Databricks/Spark and MLflow for scalable retraining/monitoring. Demonstrated ability to partner with non-technical healthcare operations teams to deliver actionable risk-scoring outputs via dashboards and automated reporting.”

Python R SQL Bash Pandas NumPy+127

View profile

Hanish Kukkala

Screened

Mid-level Data Scientist specializing in Generative AI and NLP

USA6y exp

CVS HealthUniversity of Central Missouri

“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”

A/B Testing Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+170

View profile

Sailaja Lokasani

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and healthcare analytics

Dallas, TX5y exp

Lightbeam Health SolutionsSyracuse University

“Healthcare-focused data engineer/ML practitioner with experience at Lightbeam Health Solutions and Humana building production entity-resolution and semantic similarity pipelines across EMR, lab, and claims data. Uses NLP/ML (spaCy, scikit-learn, BioBERT/LightGBM) plus Snowflake/Airflow and vector search (Pinecone) to improve linkage accuracy (reported 90%) and semantic match quality (reported +12–15%), while reducing manual cleanup by 40%+.”

Apache Airflow AWS AWS Glue AWS Lambda Agile C+++134

View profile

Mayur Komaravelly

Screened

Senior Data Analyst specializing in data pipelines, web scraping, and legal data enrichment

Illinois, USA5y exp

The HartfordIndiana Wesleyan University

“Data engineer focused on reliable, scalable analytics pipelines and external data collection. Has owned end-to-end pipelines processing 5–10M records/day, serving Snowflake data marts to Power BI/Tableau, and reports ~99% reliability through strong validation/monitoring. Also shipped versioned REST APIs for curated data with query optimization and caching.”

Apache Airflow Apache Kafka Apache Spark Ansible API Design AWS Glue+140

View profile

Sai Nekkanti

Screened

Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance

Mount Laurel, NJ4y exp

MetLifeRowan University

“Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.”

SDLC Agile Waterfall Python C C+++155

View profile

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp

Syneos HealthUniversity of Alabama at Birmingham

“Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.”

Python Pandas PySpark FastAPI LangChain SQL+97

View profile

Hadi Jaffery

Screened

Junior Data Engineer specializing in Snowflake and investment data platforms

Boston, MA3y exp

Liberty MutualUniversity of Maryland, College Park

“Private markets/private credit data engineer owning core Snowflake/AWS data infrastructure (S3 → ActiveBatch → Snowflake) with automated iceDQ quality checks and curated datasets for internal Power BI/React reporting. Drove major reliability and delivery improvements, including cutting DB CI/CD deploy time 50% and reducing downstream table errors by 90%+, and also built an internal React/FastAPI app to visualize the team’s data infrastructure in an ambiguous early-stage environment.”

AWS AWS Lambda CI/CD C C++Data Engineering+84

View profile

Data Engineers Data Scientists Data Analysts Business Analysts Software Engineers Machine Learning Engineers Research Assistants Analysts Teaching Assistants

Need someone specific?

AI Search

Vetted Data & Analytics Professionals

Popular Titles

Sowjanya Ande

SAITEJA MALLEMPUDI

Ashwitha E

Anuj Shah

Kamal Ede

Ansh Krishna

Omkarreddy Lakkireddy

Mukesh Rajmohan

Akhil Reddy Edla

SASIREKHA GULIPALLI

Aniket Janrao

Abdul Mohammed

Sri Charan Raju Karampudi

Sukesh Anamaneni

Ram Jayesh Parekh

Dhriti Kanchan

Keerthana Priya

Sravan Kumar Jajam

Hanish Kukkala

Sailaja Lokasani

Mayur Komaravelly

Sai Nekkanti

Revanth Goli

Hadi Jaffery

Related

Need someone specific?