Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Hive Professionals

Pre-screened and vetted.

Apache Hive Python SQL Apache Spark Docker AWS

Sri Charan Raju Karampudi

Screened

Mid-level Data Engineer specializing in cloud ETL and financial data platforms

Virginia, USA3y exp

Capital OneAvila University

“Data engineer with experience at Capital One and HSBC building and operating GCP-based data platforms. Led an end-to-end Oracle-to-BigQuery migration processing ~200–300GB/day using Dataflow/Beam, Airflow, Dataproc/PySpark, and Looker, achieving ~99.5% pipeline success and ~30% fewer data quality issues. Strong in production reliability, schema drift handling for external APIs, and BigQuery performance/serving patterns (materialized views, authorized views, versioned datasets).”

ETL Java Spring Framework Apache Airflow SQL Snowflake+102

View profile

Sharath Bandi

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation

Saint Louis, Missouri4y exp

LSEGAvila University

“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”

A/B Testing Apache Airflow Apache Hadoop Apache Hive Apache Kafka Apache Spark+225

View profile

Himangshu Das

Screened

Staff Software Engineer / Technical Architect specializing in cloud data platforms and GenAI agents

Menlo Park, CA10y exp

PromethiumUniversity of Illinois Urbana-Champaign

“Small-team builder of Promethium’s “Mantra” next-gen agentic text-to-SQL engine, using vector DB + LangGraph tooling and SQL validation/evaluation to improve query accuracy. Experienced in diagnosing production LLM workflow failures via LangSmith traces and in running hands-on developer workshops and pre-sales POCs with live debugging and real customer data.”

AI Agents Alerting Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon ECS+107

View profile

Hanish Kukkala

Screened

Mid-level Data Scientist specializing in Generative AI and NLP

USA6y exp

CVS HealthUniversity of Central Missouri

“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”

A/B Testing Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+170

View profile

Sreelekha Vuppala

Screened

Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms

USA4y exp

CitiusTechArizona State University

“GenAI/ML engineer (CitiusTech) who has deployed production RAG systems for compliance/operations document Q&A, using Pinecone + FastAPI microservices on Kubernetes with strong monitoring and guardrails. Also built a GenAI-powered incident triage/routing solution in collaboration with non-technical stakeholders, achieving 35% faster response times and 40% fewer misclassified tickets, and has hands-on orchestration experience with Airflow and AutoSys.”

A/B Testing Agile Amazon Kinesis Apache Airflow Apache Hadoop Apache Kafka+246

View profile

Harini Vinu

Screened

Intern Software Engineer specializing in cloud, big data, and test automation

New York, United States1y exp

QualitestNYU

“Internship experience at Qualitest building and deploying an LLM-powered test automation system that reduced manual test creation and improved efficiency (~40%). Demonstrates strong production engineering for LLM systems (timeouts/retries/monitoring/caching, prompt optimization, batching) and has scaled workflows to 100+ concurrent jobs; also has orchestration experience with AWS Step Functions and Kubernetes.”

Amazon CloudWatch Amazon DynamoDB Amazon Kinesis Amazon S3 Amazon SQS Amazon API Gateway+149

View profile

Sai Nekkanti

Screened

Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance

Mount Laurel, NJ4y exp

MetLifeRowan University

“Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.”

SDLC Agile Waterfall Python C C+++155

View profile

Bhargavi Kondaveeti

Screened

Mid-level Data Engineer specializing in big data pipelines and real-time streaming

Dallas, TX6y exp

Johnson & JohnsonUniversity of North Texas

“Data engineer who has owned end-to-end production pipelines processing a few million records/day, using Python/Airflow/SQL/PySpark with Snowflake serving to BI (Power BI). Built resilient external web data collection systems (anti-bot, schema-change detection, backfills) and shipped versioned REST APIs for internal consumers, improving pipeline success rates to 99% through monitoring, retries, and idempotent design.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon Redshift Amazon S3 Amazon SQS+101

View profile

Sai Vardhan Reddy

Screened

Mid-Level Data Engineer specializing in cloud data platforms and governed analytics

5y exp

OptumUniversity of Central Missouri

“Data engineer with Optum experience building end-to-end healthcare data pipelines for HL7/FHIR, processing millions of records daily across Kafka streaming and Databricks/Spark batch. Strong focus on data quality (schema enforcement/validations), reliability (Airflow monitoring/alerts), and analytics-ready serving in Snowflake powering Power BI/Tableau, with CI/CD via Git and Jenkins.”

AWS Amazon EC2 AWS Lambda AWS Glue Amazon S3 Amazon Kinesis+94

View profile

sanketh koritikanti

Screened

Mid-level Full-Stack Python Developer specializing in cloud, data engineering, and AI/ML

Washington, USA4y exp

Fannie MaeSt. Francis College

“Full stack Python developer who actively integrates AI coding assistants into day-to-day engineering work, including code generation, debugging, testing, and documentation. Has also coordinated multi-agent workflows across backend, frontend, testing, and code review, showing an applied, productivity-focused approach to AI-enabled software delivery.”

Python SQL JavaScript Scala HTML CSS+103

View profile

Avantik Tiwari

Screened

Junior Data Scientist / Big Data Engineer specializing in ML, LLMs, and analytics platforms

Tempe, Arizona3y exp

Arizona State UniversityArizona State University

“Backend/data platform engineer who led a major redesign of a hybrid streaming+batch analytics platform processing 10+ TB/day (Airflow/Hive/BigQuery) with strong data-quality automation. Also built a production RAG PDF assistant with concrete mitigations for hallucinations and prompt injection (re-ranking, grounding, verifier step) and has deep experience executing low-risk migrations (dual-write, blue-green, rapid rollback) and implementing JWT-based row-level security.”

Python SQL Rust Java JavaScript MySQL+112

View profile

Gowthami chilukuru

Screened

Mid-Level Full-Stack Software Engineer specializing in healthcare, cloud, and data platforms

Sunnyvale, CA5y exp

Intuitive SurgicalStevens Institute of Technology

“Backend/platform engineer who owned a real-time customer analytics microservice stack in Python/FastAPI with Kafka streaming into PostgreSQL, including schema enforcement (Avro) and high-throughput optimizations. Strong Kubernetes + GitOps practitioner (EKS/GKE, Helm, Argo CD) who has handled CI/CD reliability issues with automated pre-deploy checks and rollbacks, and supported major migrations (on-prem to AWS; VM to EKS) with blue-green cutover planning.”

Python R Java C JavaScript TypeScript+200

View profile

Narayanaroyal Marisetty

Screened

Mid-level Data Scientist/ML Engineer specializing in healthcare AI and MLOps

USA4y exp

CVS HealthUniversity at Buffalo

“Designed and deployed an enterprise LLM-powered clinical/pharmacy policy knowledge assistant at CVS Health, replacing manual searches across PDFs/Word/SharePoint with a HIPAA-compliant RAG system. Built end-to-end ingestion and orchestration (Airflow + Azure ML/Data Lake + vector index) with PHI masking, versioned re-embedding, and production monitoring (Prometheus/Grafana), and partnered closely with clinicians/compliance to ensure policy-grounded, auditable answers.”

A/B Testing Apache Airflow Apache Hadoop Apache Hive Apache Kafka Apache Spark+132

View profile

kesav boob

Screened

Mid-Level Full-Stack Java Engineer specializing in microservices and cloud

San Francisco, California5y exp

Dell TechnologiesCal State LA

“Full-stack developer who built an end-to-end Hotel Management System using React and Spring Boot with MongoDB and AWS. Has hands-on experience debugging API/data-fetching issues with Postman and validating results against the database, plus exposure to handling large data workloads with chunking and monitoring via Grafana/Tabula.”

Java SQL C C++C#Python+129

View profile

Megan Montag

Screened

Senior Talent Acquisition & HR Change/Program Leader specializing in TA operations and process optimization

Greenwood Village, CO9y exp

SpectrumUniversity of Colorado Boulder

“Talent Acquisition/Recruiting Operations leader with 10+ years in TA and 6–7 years leading ops teams (up to 5 direct reports). Drove an enterprise-wide, global redesign of EchoStar’s recruiting and selection process, focusing on automation and process streamlining, and has led major HR/TA system implementations (Workday, Dayforce, iCIMS) across ATS/HRIS, onboarding, and payroll.”

Strategic Planning Stakeholder Management Change Management Program Management Project Management Data Analytics+82

View profile

Rakesh Kolagani

Screened

Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications

Mountain View, CA5y exp

IntuitUniversity of Central Missouri

“AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.”

A/B Testing Amazon S3 Apache Airflow AWS Glue AWS Lambda AWS Step Functions+126

View profile

Pooja Murigappa

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp

Charles SchwabUniversity of Central Missouri

“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”

Amazon DynamoDB Apache Airflow Apache Kafka Apache Spark AWS AWS Glue+183

View profile

Nandini Kalita

Screened

Senior Data Scientist / ML Engineer specializing in NLP, anomaly detection, and cloud ML platforms

Remote, CA10y exp

EmotionallNMIMS University

“ML/NLP practitioner who built customer-feedback topic modeling (NMF + TF-IDF) to diagnose chatbot-to-agent handovers and drove product/ops changes that reduced operational costs by 20%. Also developed LSTM-based intent recognition using Word2Vec/GloVe embeddings for semantic linking, and deployed an LSTM autoencoder for fraud anomaly detection that cut false positives by 25% while capturing 15% more fraud in A/B testing.”

A/B Testing Agile Anomaly Detection AWS BigQuery Bitbucket+116

View profile

Supriya Mattapelly

Screened

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp

UnitedHealthcareKent State University

“AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.”

A/B Testing Amazon CloudWatch Amazon EC2 Amazon Redshift Amazon S3 Apache Airflow+152

View profile

HarshaSree gudapati

Screened

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp

Bank of AmericaUniversity of Cincinnati

“Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.”

Azure Data Factory AWS Amazon S3 AWS Glue Amazon Redshift AWS Lambda+125

View profile

Madhav Vaddepalli

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Seattle, WA8y exp

SafecoFitchburg State University

“Data engineer focused on building reliable, production-grade pipelines and external data collection systems on AWS (S3/Lambda/SQS/Glue/EMR) using PySpark/SQL, serving curated datasets to Snowflake/Redshift for finance and fraud teams. Has operated a large-scale crawler ingesting millions of records/day with anti-bot tactics, schema versioning/quarantine, and CloudWatch/Datadog monitoring, and also shipped a versioned REST API with caching and query optimization.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon Redshift Amazon RDS+192

View profile

Aayush Anand

Screened

Intern Full-Stack/Software Engineer specializing in web apps, cloud, and data/ML systems

New York, NY1y exp

The NorthStar GroupNYU

“Built and productionized LLM-driven content intelligence/SEO agents for a high-traffic media platform, automating tagging/summarization/metadata with FastAPI + async orchestration and strict JSON-schema outputs. Demonstrated measurable impact (40% faster publishing, +20% organic traffic in 3 months) and strong reliability practices (offline evals, shadow mode, canaries, fallbacks, idempotency, and monitoring).”

Agile Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+112

View profile

Molli Dinesh

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps

Remote, USA4y exp

Marsh McLennanIllinois Institute of Technology

“Built an AI-driven insurance policy summarization platform at Marsh, taking it end-to-end from messy PDF ingestion/OCR and custom extraction through LLM fine-tuning and AWS SageMaker deployment. Delivered measurable impact (25% reduction in manual review time, 99% uptime) and demonstrated strong production MLOps/LLMOps practices with Airflow/Step Functions orchestration, rigorous evaluation (ROUGE + human review), and continuous monitoring for drift, latency, and hallucinations.”

Python Pandas NumPy Scikit-learn R SQL+132

View profile

Yuvraj Singh Chauhan

Screened

Entry-level AI/ML Engineer specializing in LLMs, RAG, and DevOps automation

Bangalore, India1y exp

RapidFortThapar Institute of Engineering and Technology

“Built and owned a production-scale AI-driven software release/version intelligence platform orchestrated via GitHub Actions that tracks 1000+ upstream repositories and automatically generates SLA-bound JIRA upgrade tickets for hardened container images. Replaced brittle regex/PEP440 parsing with an LLM-based semantic filtering layer plus deterministic validation to handle noisy/inconsistent GitHub tags at scale, with monitoring for coverage, latency, and correctness validated against upstream ground truth.”

API Integration Bash Computer Vision C C++Data Analytics+71

View profile

Data Engineers Machine Learning Engineers Software Engineers Data Scientists Data Analysts Software Developers Data & Analytics AI & Machine Learning Engineering Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?