Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp

GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B Testing Agile AI Agents Apache Kafka Apache Spark AWS Glue+170

View profile

Aditya Sairam

Screened

Mid-Level Software Engineer specializing in cloud data platforms and AI search

Troy, MI6y exp

Robotics Technologies LLCCleveland State University

“Open-source JavaScript contributor focused on data visualization, extending Chart.js/React with custom plugins for real-time streaming dashboards. Designed an end-to-end telemetry pipeline using Apache Kafka and Azure Cosmos DB, optimizing partitioning, batching, caching, and client throttling to keep latency low and support thousands of concurrent users. Demonstrates strong ownership in fast-changing environments, including building full-stack AI applications and ingestion/ETL pipelines at Robotics Technologies LLC.”

Apache Kafka AWS AWS Lambda Azure Functions C#Cloud Computing+89

View profile

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp

AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine Learning Generative AI Large Language Models (LLMs)Prompt Engineering Retrieval-Augmented Generation (RAG)Embeddings+131

View profile

Vigneshwaran Moorthi

Screened

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and Clinical AI

Chicago, Illinois4y exp

OptumIllinois Institute of Technology

“Built and productionized a HIPAA-compliant LLM+RAG Clinical AI assistant at Optum, fine-tuning GPT/LLaMA on de-identified patient notes and integrating FAISS/Pinecone for sub-second retrieval; reported to cut diagnosis time by ~20 minutes per case. Experienced in orchestrating ML pipelines (Airflow, AWS Step Functions, Azure Data Factory) and in reliability techniques for LLM systems (grounding, citations, confidence filters, monitoring) while partnering closely with clinicians and compliance teams.”

A/B Testing Amazon CloudWatch Amazon EC2 Amazon Redshift Amazon S3 Apache Airflow+138

View profile

Krishnakaanth Reddy Yeduguru

Screened

Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps

Texas, USA4y exp

McKessonUniversity of Texas at Arlington

“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”

Python C++SQL Bash TensorFlow PyTorch+129

View profile

BHARATH BHOOTHPUR

Screened

Mid-level Data Analyst specializing in healthcare and finance analytics

New Jersey, USA5y exp

Omada HealthRowan University

“Built an end-to-end Alexa smart-home IoT application controlling a Wi-Fi bulb, including ESP32 firmware (MQTT) and an AWS serverless backend (IoT Core/Device Shadow, Lambda, DynamoDB) with a REST API. Demonstrates strong real-time scalability patterns (streaming ingestion, stateless processing, partition-key design) and full-stack delivery with Spring Boot + React (JWT auth, CORS, data-heavy dashboards).”

Python SQL R NumPy Pandas Matplotlib+113

View profile

Sushma Puchakayala

Screened

Mid-level Data Analyst specializing in AI/ML and advanced analytics

USA3y exp

AccentureMurray State University

“Accenture data/ML practitioner who deployed a retail churn prediction and BERT-based sentiment analysis system to production, integrating behavioral + feedback data and operationalizing it with ETL automation, orchestration, and CI/CD. Experienced managing 2TB+ multi-source data, monitoring drift in Databricks, and translating results into Power BI dashboards for marketing teams (including K-means customer segmentation).”

Python Pandas NumPy Matplotlib Scikit-learn Seaborn+122

View profile

Kumud Sharma

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations

USA6y exp

IntuitIndiana University

“Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.”

Python Java JavaScript TypeScript C#C+++192

View profile

Gordon Ng

Screened

Mid-Level Software Engineer specializing in AI/ML and distributed systems

Brooklyn, NY3y exp

OptumBoston University

“Software engineer with production experience building a serverless monolith and multi-layer video pipeline at easyML, plus hands-on integration of multiple LLM providers (Grok/Claude/OpenAI) into a full-stack app. Interested in robotics via computer vision (OpenCV/OpenMMLab), with a strong real-time systems mindset around SLOs, latency, determinism, and reliability; also has low-level OS experience writing a keyboard device driver.”

Apache Kafka AWS AWS Lambda CI/CD Cloud Computing C+++77

View profile

ASHWINKUMAR PACHIPALA

Screened

Mid-level Full-Stack Java Developer specializing in cloud-native microservices

USA4y exp

Epic SystemsWebster University

“Full-stack Java developer with IBM and Epic Systems experience modernizing legacy enterprise apps into microservices and delivering customer-facing healthcare claims workflows at very high scale (2M+ transactions/day). Strong blend of product engineering (APIs + React/TypeScript UI) and production operations on AWS, including performance incident remediation via query optimization, indexing, and autoscaling.”

Java Python C#Spring Boot Spring MVC Flask+136

View profile

Mallikarjuna Reddy Gayam

Screened

Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI

Springfield, Missouri5y exp

O'Reilly Auto PartsSaint Louis University

“ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.”

Python SQL R Java Scala Machine Learning+112

View profile

Naga Venkata Padala

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and real-time fraud detection

4y exp

U.S. BankUniversity of Massachusetts Dartmouth

“GenAI/ML engineer who has shipped production agentic systems in highly regulated and high-throughput environments, including an AWS Bedrock-based fraud/compliance workflow at U.S. Bank with PII redaction and hallucination detection that cut investigation time by 50%+. Also built and evaluated RAG and recommendation systems at Target, using RAGAS-driven testing, hybrid retrieval with re-ranking, and SHAP explainability dashboards to align model behavior with merchandising business KPIs.”

AWS AWS CloudFormation AWS Glue AWS Lambda AI agents Apache Airflow+143

View profile

Krishna Kandlakunta

Screened

Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning

United States5y exp

CitigroupUniversity of North Texas

“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”

A/B Testing Agile Anomaly Detection Apache Hadoop Apache Hive Apache Kafka+167

View profile

Ram Kottala

Screened

Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms

Michigan, USA5y exp

FordWebster University

“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”

Python PySpark Scala Java R SQL+173

View profile

Naga Yanala

Screened

Mid-level Data Engineer specializing in cloud data pipelines and analytics platforms

Texas, USA5y exp

Molina HealthcareSoutheast Missouri State University

“Data engineer with healthcare and enterprise experience (Molina Healthcare, Dell Technologies) building and operating high-volume batch + streaming pipelines across AWS and Azure. Strong focus on data quality (schema validation, fail-fast checks), reliability (monitoring/alerts, retries), and performance tuning (Spark/partitioning), with measurable runtime reduction and improved downstream trust.”

Python SQL PySpark Bash ETL Data pipelines+85

View profile

Sai Kavyusha Ponnagant

Screened

Mid-level Data Engineer specializing in cloud data pipelines and financial services warehousing

Chicago, IL4y exp

Charles SchwabDePaul University

“Data engineer (Charles Schwab) who took ownership of an unstable, ambiguous nightly financial data pipeline and rebuilt it into a reliable, incremental AWS Glue/Airflow/Redshift system feeding Power BI. Created a custom Python data-quality framework with hard-stop gating and schema drift detection, improving integrity (99.9%), cutting runtime (~20%), and reducing incidents/tickets (35% fewer schema-related dashboard incidents; 30% fewer investigations).”

Python SQL Amazon S3 AWS Glue Amazon Redshift AWS Lambda+73

View profile

Jaideep bommidi

Screened

Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps

Denton, TX8y exp

Webster BankUniversity of North Texas

“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”

A/B Testing Agile Amazon EC2 Amazon EKS Amazon ECS Amazon Kinesis+181

View profile

ESHWANTH D. G

Screened

Mid-level Robotics Software Engineer specializing in autonomous perception and sensor fusion

CA, USA4y exp

HoneywellUniversity at Buffalo

“Robotics engineer with Honeywell and Tata Motors experience deploying ROS/ROS2 autonomous mobile robot fleets into live factory environments, integrating sensors, safety PLCs, and on-prem services. Known for solving end-to-end latency and stability issues (including network spikes under load) using gRPC, Docker, and improved diagnostics—cutting diagnosis time from hours to minutes and achieving sub-150 ms control response.”

Python C++C SQL Bash Git+118

View profile

Esha Gangam

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA4y exp

DeloitteUniversity at Albany

“GenAI/ML engineer from Deloitte who built and shipped a production RAG-based internal search assistant for support teams, delivering quantified operational gains (20% effort reduction, 35% faster manual lookup). Experienced in enterprise-grade LLM reliability (grounding/hallucination control), compliance/security constraints, and rapid release cycles using CI/CD, MLflow, and orchestration tools (Airflow, Databricks Jobs, LangChain).”

Python PySpark SQL Data Processing Feature Engineering Data Validation+89

View profile

UMESH KAMISETTY

Screened

Mid-level Data Engineer specializing in cloud lakehouse and streaming platforms

Seattle, WA5y exp

First United BankCleveland State University

“Data engineer focused on building production-grade pipelines on AWS (Kafka/Kinesis/Glue/S3) through to curated serving layers in Snowflake and Delta Lake. Emphasizes automated data quality validation (PySpark + CI/CD), modular dbt transformations for analytics (customer spending, risk metrics), and operational reliability with CloudWatch and DLQs; data consumed by BI tools and ML pipelines for fraud detection and risk analytics.”

Python PySpark SQL Shell Scripting AWS Amazon S3+146

View profile

Harshitha Parupalli

Screened

Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines

Jersey City, NJ4y exp

Elevance HealthNJIT

“Data engineer with healthcare domain experience who owned 100M+ record pipelines end-to-end (Kafka/Kinesis/ADF → PySpark/dbt validation → Spark SQL transforms → Snowflake/Power BI serving). Built production-grade reliability practices (Airflow orchestration, CloudWatch/Grafana monitoring, pytest + contract/regression tests, idempotent ingestion/backfills) and delivered measurable improvements: 35% lower latency and 40% better query performance.”

Python SQL Shell Scripting R Scala Java+160

View profile

Sri Harsha patallapalli

Screened

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp

Dextr.aiNortheastern University

“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”

Python Java C++SQL JavaScript Bash+113

View profile

Ti Wu

Screened

Junior Full-Stack Developer specializing in web apps and reinforcement learning

Hsinchu, Taiwan1y exp

Industrial Technology Research InstituteUniversity of Wisconsin–Madison

“Built an AI basketball shooting coach that analyzes player form against NBA players and recruited 30+ beta users via Reddit to drive iterative UI/workflow improvements. Also has internship experience building an administrative server and coordinating API/database compatibility with another client server, emphasizing communication and integration quality.”

Python Java C C++JavaScript TypeScript+95

View profile

Preetham Reddy Konuganti

Screened

Junior Full-Stack Engineer specializing in AI applications and scalable web platforms

San Jose, CA2y exp

Cognia SecurityArizona State University

“Full-stack engineer with customer-facing delivery experience who built and deployed a multi-platform social media automation product (Next.js/Node/MongoDB) and optimized it using BullMQ/Redis background jobs, retries, and rate limiting for reliable posting at scale. Also delivered an AI-powered false-positive analysis service in a cybersecurity context, resolving production pipeline stalls via log-driven debugging, parallelization, caching, and LLM guardrails.”

Agile Amazon EC2 Amazon S3 Amazon SNS Amazon SQS Angular+126

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?