Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Chaitanya Prasad Reddy Narala

Screened

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

USA4y exp

ServiceNowSaint Louis University

“Senior AI/ML engineer focused on production LLM systems, combining RAG, fine-tuning, distributed training, and AI safety to ship scalable real-time moderation and conversational AI platforms. Stands out for pairing deep AWS/Kubernetes MLOps expertise with measurable impact: 40% lower latency/cost, 30-50% fewer hallucinations, and major reliability gains through observability and automation.”

Python Java SQL R Scikit-learn XGBoost+139

View profile

Manish Challa

Screened

Mid-level AI/ML Engineer specializing in Generative AI and financial services

OR, USA5y exp

JPMorgan ChaseSeattle University

“ML/AI engineer with hands-on experience shipping regulated financial AI systems at JPMC and Capgemini, spanning credit risk, fraud detection, and generative AI assistants. Stands out for combining modern LLM/RAG architectures with strong MLOps, real-time infrastructure, and explainability/compliance practices, while delivering measurable business impact in latency, accuracy, cost, and risk reduction.”

Python SQL Java PyTorch TensorFlow Keras+134

View profile

Sachin Komati

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML

Florida, USA5y exp

BlackRockFlorida International University

“Built an end-to-end GenAI/RAG platform for financial compliance and research at BlackRock, focused on safe, auditable answers in a highly regulated environment. Combines strong LLM engineering depth with production platform skills and delivered clear business impact, including reducing research/compliance turnaround from hours to seconds, improving retrieval relevance by 22%, and cutting inference costs by 75%.”

SDLC Agile MLOps Cross-Functional Collaboration Machine Learning Deep Learning+134

View profile

Mohammad Gouse Ali Shaik

Screened

Mid-level Software Development Engineer specializing in cloud-native AI/ML systems

California, USA4y exp

ServiceNowCal State Long Beach

“AI/ML-focused engineer with practical experience building RAG-based and multi-agent systems, including architectures for retrieval, reasoning, context processing, and response generation. Stands out for combining LLM productivity gains with disciplined software engineering practices like validation, monitoring, and reproducibility.”

Agile Scrum Kanban SDLC Python TypeScript+136

View profile

Yashwanth Reddy

Screened

Mid-level Machine Learning Engineer specializing in MLOps and NLP

4y exp

Goldman SachsAvila University

“ML engineer with production experience at Goldman Sachs and Medtronic, focused on real-time AI systems in fraud detection and healthcare. Brings a rare mix of backend ML infrastructure, MLOps, and product-minded UX thinking, including dashboard and API design that made complex model outputs usable for analysts and clinical users.”

Machine Learning MLOps Natural Language Processing Predictive Modeling Automation Model Monitoring+70

View profile

Hariom Vyas

Screened

Senior Business Analyst specializing in BFSI reporting and BI

Dallas, TX4y exp

Goldman SachsUniversity of Maryland, Baltimore County

“Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.”

SQL Python Power BI Tableau Business Intelligence Reporting+280

View profile

Akanksha Kummari

Screened

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and production ML systems

5y exp

ComcastUniversity of Central Missouri

“Backend/founding-engineer-style builder who designed and evolved a near-real-time customer churn prediction platform (FastAPI + AWS SageMaker/Lambda + Redis + MLflow) to enable real-time retention actions, reporting ~18% churn reduction. Demonstrates strong production engineering in secure API design, incremental migrations with data integrity safeguards, and robustness improvements in async pipelines (idempotency, DLQs, retry visibility).”

Python SQL R Bash JavaScript Machine Learning+128

View profile

Mihika Prasad Gaonkar

Screened

Entry-Level Software Engineer specializing in ML and backend systems

Remote1y exp

Easley-Dunn ProductionsUSC

“Built and deployed a production LLM-based real-time stance detection system for social media, fine-tuning LLaMA 3.1 on A100s with DeepSpeed ZeRO/FSDP and iteratively refining data to handle sarcasm and context-dependent meaning. Also has Kubernetes operations experience (Kafka/Logstash/Elasticsearch observability pipeline) and delivered an OCR automation project during a Worley India internship that saved 20+ hours/week for on-site energy safety stakeholders.”

Python Java MySQL PostgreSQL MongoDB Django+85

View profile

Dheeraj Vajjarapu

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP/LLMs, and computer vision

Remote, USA4y exp

BarclaysYeshiva University

“Built and shipped a production LLM/RAG risk-case summarization and triage system used by fraud/compliance analysts, with strong grounding controls (evidence-cited outputs and refusal on low confidence). Demonstrates end-to-end ownership across retrieval quality, Airflow-orchestrated indexing pipelines, and compliance-grade privacy (PII redaction, RBAC, encrypted redacted logging, and auditable prompt/model versioning) plus a tight feedback loop with non-technical domain experts.”

Python SQL Bash Machine Learning Deep Learning Scikit-learn+124

View profile

Sai Dev

Screened

Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP

Newark, CA4y exp

Lucid MotorsCleveland State University

“GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.”

Python C++R SQL Scala TensorFlow+121

View profile

Vasavi Mittapalli

Screened

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp

Texas InstrumentsTrine University

“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”

A/B Testing Agile Amazon DynamoDB Amazon EC2 Amazon Kinesis Amazon Redshift+195

View profile

Jaswanth Vakkala

Screened

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp

Wells FargoSt. Francis College

“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”

A/B Testing Anomaly Detection Apache Hadoop Apache Hive Apache Spark AWS+224

View profile

Sahar Zargarzadeh

Screened

Junior AI/Backend Software Engineer specializing in ML and scalable systems

Dallas, TX2y exp

PMGUniversity of Maryland, College Park

“Backend engineer with strong AWS/CI/CD experience (multi-repo deployments, Lambda + core app, immutable ECR and image promotion) and a published master’s thesis building an ML framework for Solar PV energy prediction and CO2 reduction impact modeling using ensemble and meta-learning approaches benchmarked against SAM.”

Python Node.js Terraform Java R JavaScript+99

View profile

Yuan Xu

Screened

Junior Machine Learning Engineer specializing in multimodal AI and audio deepfakes detection

Berkeley, California3y exp

Scam AICarnegie Mellon University

“Internship experience building production-oriented AI systems, including a real-time voice scam/spoof detector (RawNet + AASIST) hardened for noisy audio via aggressive augmentation and Zoom-based noise simulation, evaluated with EER on clean and wild datasets. Also built an LLM-driven UI automation agent using Playwright for apps like Linear/Notion with modular tool design, unit tests, and replayable scripted scenarios, and has AWS Step Functions experience orchestrating Lambda/Cognito workflows.”

Python C C++Java Linux SQL+78

View profile

Bernard Griffin

Screened

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp

IntelIllinois Institute of Technology

“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”

A/B Testing Amazon Bedrock Amazon EC2 Amazon Kinesis Amazon Redshift Amazon S3+130

View profile

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp

CitigroupUniversity of Missouri

“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”

Python JavaScript Shell Scripting SQL MySQL PostgreSQL+110

View profile

Nafeezuddin Mohammed

Screened

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp

SonyFitchburg State University

“Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.”

Agile Amazon CloudWatch Amazon Redshift Amazon S3 Anomaly Detection Apache Airflow+124

View profile

Bhavya Sree Ganja

Screened

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp

First National BankTexas A&M University-Corpus Christi

“Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.”

Agile Amazon API Gateway Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon Kinesis+210

View profile

Ajay Madhusudhan Thumala

Screened

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp

GeisingerUC Irvine

“Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.”

API Development API Gateway AWS Bash C C+++206

View profile

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp

Quant AI ResearchNYU

“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”

Python SQL C C++Java HTML+120

View profile

Manasa Mangipudi

Screened

Mid-level Machine Learning Engineer specializing in NLP and computer vision

3y exp

Columbia UniversityRutgers University–New Brunswick

“AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.”

Python Java R SQL C++MATLAB+106

View profile

Bhanu Prakash Reddy Dakilli

Screened

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp

Bank of AmericaNew England College

“Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.”

Python SQL PySpark Apache Spark Java Power BI+97

View profile

Aditya Jhaveri

Screened

Mid-level Software Engineer specializing in AI, big data, and distributed systems

Jersey City, NJ3y exp

New York UniversityNYU

“Software Developer at NYU (GEMSS) focused on scaling and optimizing a data-heavy asset management web app, including migrating/optimizing data access via Google Sheets API and Firestore. Previously an SDE at Sainapse working on Spring Boot microservices POCs (Kafka, Hadoop at 2B+ record scale). Built an end-to-end Apple Wallet coupon generation/redemption system using PassKit + Google Apps Script with measurable ops impact (40% efficiency gain).”

Agile Algorithms Anomaly Detection Apache Hadoop Apache Hive Apache Kafka+124

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?