Vetted PySpark Professionals

Pre-screened and vetted.

PySpark Python Docker SQL CI/CD AWS

Nora Jaf

Senior AI/ML Engineer specializing in Generative AI and LLMOps

Washington, DC10y exp

Clarion Tech

A/B Testing Agile Apache Kafka Argo CD Audit Logging AWS+147

View profile

Pranava Reddy Kothapally

Screened ReferencesStrong rec.

Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization

Hyderabad, India2y exp

TechwaveCleveland State University

“LLM/AI engineer who has deployed production RAG conversational analytics and Text-to-SQL systems over Snowflake and curated data marts, emphasizing enterprise-grade guardrails for accuracy, security, and cost. Notable for a structured approach to reducing hallucinations (curated metric/table registry, SQL validation, RBAC, and citation-backed responses) and for building resilient, observable multi-step agent workflows using LangChain/LlamaIndex and Airflow.”

Agile API Integration Audit Logging Azure Data Factory Azure DevOps Batch Processing+168

View profile

Ashish Shah

Screened

Mid-level Data Engineer / Software Engineer specializing in streaming and cloud data platforms

Arlington, TX3y exp

The University of Texas at ArlingtonUniversity of Texas at Arlington

“Backend engineer with deep Kafka/FastAPI microservices experience who redesigned a notification pipeline to cut end-to-end latency from ~5s to ~3s (including custom partition assignment and consumer tuning). Led a high-stakes ClickUp-to-Oracle migration of 1M+ records using idempotent ETL, reconciliation, and shadow deployment to achieve >99% integrity with zero downtime, and has hands-on production security implementation with Django/DRF (JWT + RBAC).”

Python TypeScript JavaScript Java SQL Django+100

View profile

Santhi Sampath Gamidi

Screened

Mid-level AI Engineer and Data Scientist specializing in LLM agents and RAG systems

Palo Alto, CA5y exp

LemmataUniversity at Buffalo

“Built a production-grade LLM evaluation and regression system that stress-tests models across hundreds of iterations, combining LLM-as-judge, semantic similarity, statistical metrics, and rule-based checks, with results delivered via stakeholder-friendly HTML reports and dashboards. Experienced orchestrating multi-agent RAG workflows using LangChain/LangGraph and event-driven GenAI pipelines in n8n integrating OCR, speech-to-text, and external APIs, with strong emphasis on reliability, observability, and explainable failures.”

A/B Testing Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS Glue+149

View profile

Gautam Agrawal

Screened

Mid-Level Software Engineer specializing in backend systems, cloud, and applied LLM/NLP

IN, USA4y exp

Project 990Indiana University Bloomington

“Applied LLMs to classify long nonprofit mission statements into 8 segments without labeled data, using an ensemble of clustering/embedding methods plus zero-shot RoBERTa/BART and a Tree-of-Thought prompting pipeline with LLM-as-judge evaluation (Gemma). Also built LangChain/LlamaIndex agentic RAG workflows including a text-to-SQL data analysis assistant grounded on DB schema with retries and performance optimizations on an HPC cluster.”

Python Java C#JavaScript TypeScript HTML+121

View profile

Sree Sai Preetham Nandamuri

Screened

Mid-level Data Scientist specializing in Generative AI and LLMOps

Dover, USA4y exp

Visual TechnologiesUniversity of Houston

“Built a production-grade, semi-automated document recognition and classification system for large volumes of scanned PDFs, starting from little/no labeled data and handling highly variable scan quality. Deployed on AWS using SageMaker + Docker and orchestrated on EKS with a microservices design that scales CPU-heavy OCR separately from GPU inference, with strong reliability controls (validation, fallbacks, retries, readiness probes).”

A/B Testing API Gateway AWS AWS Lambda BERT CI/CD+124

View profile

Sasaunk Vanamali

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native apps and ML services

Bowling Green, OH4y exp

Senecio Software IncBowling Green State University

“Software engineer who deployed and stabilized a real-time analytics platform at Senecio Software, focusing on production reliability, observability, and performance under load. Experienced debugging issues spanning distributed services and networking (e.g., tracing timeouts to packet loss from misconfiguration) and extending Python (FastAPI/Django) APIs for customer-specific analytics features in a configurable, maintainable way.”

Java Python PHP JavaScript TypeScript SQL+139

View profile

Ajith Kumar

Screened

Mid-level AI Data Engineer specializing in GenAI, RAG, and cloud data pipelines

Irving, TX5y exp

Mouri TechGeorge Mason University

“LLM/agentic AI builder who deployed a production ITSM automation agent on Google ADK integrating ServiceNow and FreshService, with strong safety guardrails (human-approval gating and runbook-only command execution) and rigorous evaluation (500 synthetic tickets; 80%+ false-positive reduction). Also partnered with finance to deliver an AI agent that automated invoice/SOW retrieval and monthly reporting to account managers, reducing manual back-and-forth.”

Python R SQL C#.NET Angular+124

View profile

Lakshmi Swathi Sreedhar

Screened

Mid-level AI Engineer specializing in Generative AI and LLM systems

Grand Ledge, MI3y exp

ChainSysUniversity of Michigan-Dearborn

“Built and deployed a production-grade, multi-agent Text-to-SQL assistant that lets non-technical stakeholders query large enterprise databases in natural language. Uses Pinecone-based schema retrieval + LLM reasoning (Gemini/Claude/GPT) with a dedicated validation agent (schema/syntax checks and safe dry runs) to reduce hallucinations and improve reliability, while optimizing latency and cost via async execution and embedding caching.”

A/B Testing Agile API Integration Apache Airflow Azure Data Factory Azure Machine Learning+172

View profile

Vengalarao Pachava

Screened

Junior AI Data Engineer specializing in Azure Databricks lakehouse and GenAI RAG systems

Irving, TX2y exp

Cloud Rack SystemsIllinois Institute of Technology

“Backend/applied AI engineer from Cloud Rack Systems who built production GenAI/RAG and data platforms on Azure/Databricks at enterprise scale (2.5M records/day). Known for making LLM systems behave like deterministic services via strict retrieval contracts, citation-based validation, and strong observability—shipping a knowledge assistant used daily by 50+ users while driving hallucinations near zero and materially improving latency and cost.”

Agile Algorithms API Integration Audit Logging AWS AWS Glue+197

View profile

Nikhitha Todeti

Screened

Mid-level AI Engineer specializing in ML, LLM applications, and data automation

Atlanta, GA4y exp

Exus Renewables North AmericaGeorgia State University

“Data/ML practitioner who has built a production RAG-based knowledge assistant integrated into Microsoft 365/internal dashboards to help employees query internal documents in plain English. Experienced orchestrating and hardening ETL pipelines with Airflow and Azure Data Factory (validation, retries, monitoring) and running end-to-end model evaluation and production performance tracking via Power BI.”

A/B Testing Agile API Integration Azure Data Factory Classification Clustering+81

View profile

Presha Nakrani

Screened

Intern Software Developer specializing in ML, NLP, and data engineering

India1y exp

Karmanye TechUniversity of Texas at Dallas

“Robotics competition (ABU Robocon) team member who programmed two robots for a rugby-style game, integrating IoT sensors and real-time decision-making. Implemented low-latency, secure inter-robot communication by moving from Bluetooth to ESP8266/NodeMCU WiFi (with Bluetooth as backup) and used OpenCV plus CNN training workflows for vision-related tasks; no practical ROS/ROS2 experience.”

Python C Java HTML CSS JavaScript+71

View profile

Neel Thiru

Screened

Mid-level Data Analyst specializing in analytics engineering and financial services

3y exp

Lipdub AiSeneca Polytechnic

“Data-driven growth and partnerships professional with experience leading an analytics/reporting vendor rollout end-to-end (vendor selection via stakeholder interviews and PoC, then negotiating scope/pricing/support and tracking adoption/efficiency/accuracy KPIs). At PC Financial, built regression and segmentation models to optimize multi-channel targeting (in-app/email/push), driving +15% campaign engagement and +10% PC Optimum offer loads, and ran behavior-triggered lifecycle experiments that lifted upsell conversion by 20%.”

PostgreSQL MySQL Tableau Power BI Python Pandas+54

View profile

Manas Agarwal

Screened

Junior Full-Stack Software Engineer specializing in Python APIs, React, and cloud AI integrations

Superior, CO2y exp

VertexOneUniversity of New Haven

“Customer-facing software engineer who builds and deploys practical AI/RAG solutions (e.g., an AI assistant for searching billing PDFs) by deeply understanding support workflows and iterating with users. Demonstrates strong production instincts—quickly stabilizing peak-traffic API timeouts with caching/background jobs, then implementing durable fixes with proper monitoring and maintainable code practices.”

Python Java JavaScript TypeScript PHP SQL+158

View profile

Sriram Krishna

Screened

Mid-Level Software Engineer specializing in AI/ML and cloud-native platforms

Redmond, WA5y exp

Quadrant TechnologiesSeattle University

“Backend/AI engineer who has built production LLM orchestration and agentic workflow systems in Python/FastAPI on Kubernetes across AWS/Azure. Demonstrated strong reliability engineering by debugging a real-world memory retention issue that caused latency spikes/timeouts, and strong data/performance chops with a PostgreSQL optimization that cut query latency from ~1.2s to ~15ms. Targets roles building scalable, guardrailed AI-driven workflow automation with robust observability and human-in-the-loop controls.”

Python C#Java JavaScript TypeScript SQL+145

View profile

Karthik Patralapati

Screened

Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices

Seattle, WA5y exp

DVR SoftekSan José State University

“Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.”

Python Pandas NumPy PySpark C C+++197

View profile

Gomathy Selvamuthiah

Screened

Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications

Portland, US2y exp

SBD TechnologiesNortheastern University

“Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.”

Python Java C C++FastAPI Node.js+99

View profile

Bhavana Polakala

Screened

Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms

Chicago, IL3y exp

Immerso.aiIllinois Institute of Technology

“LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).”

AJAX Apache Tomcat BigQuery Bootstrap C++CI/CD+153

View profile

Vamsi Krishna

Screened

Senior Machine Learning Engineer specializing in MLOps and Generative AI

Austin, TX7y exp

Tungsten AutomationUniversity of Central Missouri

“Built and deployed a production generative-AI copilot at Tungsten that automates invoice/form extraction template creation, reducing weeks of manual model-building work. Combines fine-tuned LLMs (PyTorch/HuggingFace) with OpenCV layout grounding to reduce hallucinations, and runs an end-to-end Kubeflow-based MLOps pipeline with drift monitoring, canary releases, and automated retraining.”

A/B Testing Amazon DynamoDB Amazon EC2 Amazon EKS Amazon Redshift Amazon RDS+111

View profile

Sampath Achalla

Screened

Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms

USA3y exp

DoJaGaIllinois Institute of Technology

“Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.”

Python SQL Django Flask FastAPI SQLAlchemy+178

View profile

Dinal Dholiya

Screened

Mid-level Full-Stack Engineer specializing in AI-powered and cloud-native systems

Remote4y exp

ZentraisUniversity at Buffalo

“Product-minded engineer who has owned features end-to-end, including a full onboarding redesign that lifted completion ~25% and a production LLM/RAG report-generation system with strong guardrails (schema-constrained JSON, confidence gating, logging) and an automated eval/regression loop built from real user queries. Also built a scalable research data pipeline ingesting messy PDFs/JSON/CSVs with normalization, idempotent reruns, observability, and cost/latency tradeoffs.”

TypeScript JavaScript Python Go SQL C+++91

View profile

Saideep Reddy Talusani

Screened

Mid-level Backend Engineer specializing in Python APIs and cloud-native services

Texas, USA5y exp

Verveba TelecomNorthern Arizona University

“Data engineer with experience at Morgan Stanley and Star Health owning production-grade lakehouse pipelines for credit risk and healthcare datasets. Built Azure/Databricks/Delta/Snowflake-based platforms processing millions of records per day with strong data quality, observability (Monte Carlo/Azure Monitor), and reliability practices, plus experience delivering curated data services with performance tuning and backward-compatible versioning.”

Audit Logging Containerization Docker FastAPI Flask JavaScript+99

View profile

Sam Sharif

Screened

Senior Full-Stack Engineer specializing in React and Python

Drexel Hill, Pennsylvania9y exp

Tech PrysmTemple University

“Backend/data engineer focused on production AWS systems: builds multi-tenant FastAPI services on ECS behind API Gateway/ALB with serverless orchestration (Lambda, SQS, Step Functions) and strong reliability practices (JWT/JWKS auth, idempotency, backoff retries, structured logging). Also delivers AWS Glue/PySpark ETL pipelines with schema/data-quality controls and has modernized legacy analytics logic into Python with parity validation; improved a key dashboard SQL query from ~12–25s to ~2–3s.”

React JavaScript TypeScript Vue.js Bootstrap Tailwind CSS+80

View profile

Alejandro Alemany

Screened

Senior Full-Stack AI/ML Engineer specializing in MLOps and GenAI

Belmont, Michigan10y exp

AvaSureCapitol Technology University

“Senior backend/data engineer who has built and maintained HIPAA-compliant, real-time clinical FastAPI services on AWS, orchestrating ML/LLM and vector DB calls with strong reliability patterns (auth, timeouts/retries, graceful degradation, idempotency). Also delivered AWS IaC/CI-CD (Terraform/Helm/GitHub Actions) across EKS/Lambda/SageMaker and built Glue/Spark ETL with schema evolution and data quality controls, plus demonstrated large SQL performance wins (15 min to <9 sec) and hands-on incident ownership.”

Angular API Design Authentication Authorization AWS Azure Blob Storage+197

View profile

Machine Learning Engineers Data Scientists Software Engineers Data Engineers AI Engineers Data Analysts AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?