Vetted Apache Spark Professionals

Pre-screened and vetted.

Chaitanya Prasad Reddy Narala - Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems in USA

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

USA4y exp
ServiceNowSaint Louis University

Senior AI/ML engineer focused on production LLM systems, combining RAG, fine-tuning, distributed training, and AI safety to ship scalable real-time moderation and conversational AI platforms. Stands out for pairing deep AWS/Kubernetes MLOps expertise with measurable impact: 40% lower latency/cost, 30-50% fewer hallucinations, and major reliability gains through observability and automation.

View profile
MC

Manish Challa

Screened

Mid-level AI/ML Engineer specializing in Generative AI and financial services

OR, USA5y exp
JPMorgan ChaseSeattle University

ML/AI engineer with hands-on experience shipping regulated financial AI systems at JPMC and Capgemini, spanning credit risk, fraud detection, and generative AI assistants. Stands out for combining modern LLM/RAG architectures with strong MLOps, real-time infrastructure, and explainability/compliance practices, while delivering measurable business impact in latency, accuracy, cost, and risk reduction.

View profile
Sachin Komati - Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML in Florida, USA

Sachin Komati

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML

Florida, USA5y exp
BlackRockFlorida International University

Built an end-to-end GenAI/RAG platform for financial compliance and research at BlackRock, focused on safe, auditable answers in a highly regulated environment. Combines strong LLM engineering depth with production platform skills and delivered clear business impact, including reducing research/compliance turnaround from hours to seconds, improving retrieval relevance by 22%, and cutting inference costs by 75%.

View profile
MG

Mid-level Software Development Engineer specializing in cloud-native AI/ML systems

California, USA4y exp
ServiceNowCal State Long Beach

AI/ML-focused engineer with practical experience building RAG-based and multi-agent systems, including architectures for retrieval, reasoning, context processing, and response generation. Stands out for combining LLM productivity gains with disciplined software engineering practices like validation, monitoring, and reproducibility.

View profile
YR

Mid-level Machine Learning Engineer specializing in MLOps and NLP

4y exp
Goldman SachsAvila University

ML engineer with production experience at Goldman Sachs and Medtronic, focused on real-time AI systems in fraud detection and healthcare. Brings a rare mix of backend ML infrastructure, MLOps, and product-minded UX thinking, including dashboard and API design that made complex model outputs usable for analysts and clinical users.

View profile
HV

Hariom Vyas

Screened

Senior Business Analyst specializing in BFSI reporting and BI

Dallas, TX4y exp
Goldman SachsUniversity of Maryland, Baltimore County

Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.

View profile
AK

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and production ML systems

5y exp
ComcastUniversity of Central Missouri

Backend/founding-engineer-style builder who designed and evolved a near-real-time customer churn prediction platform (FastAPI + AWS SageMaker/Lambda + Redis + MLflow) to enable real-time retention actions, reporting ~18% churn reduction. Demonstrates strong production engineering in secure API design, incremental migrations with data integrity safeguards, and robustness improvements in async pipelines (idempotency, DLQs, retry visibility).

View profile
MP

Entry-Level Software Engineer specializing in ML and backend systems

Remote1y exp
Easley-Dunn ProductionsUSC

Built and deployed a production LLM-based real-time stance detection system for social media, fine-tuning LLaMA 3.1 on A100s with DeepSpeed ZeRO/FSDP and iteratively refining data to handle sarcasm and context-dependent meaning. Also has Kubernetes operations experience (Kafka/Logstash/Elasticsearch observability pipeline) and delivered an OCR automation project during a Worley India internship that saved 20+ hours/week for on-site energy safety stakeholders.

View profile
DV

Mid-level AI/ML Engineer specializing in MLOps, NLP/LLMs, and computer vision

Remote, USA4y exp
BarclaysYeshiva University

Built and shipped a production LLM/RAG risk-case summarization and triage system used by fraud/compliance analysts, with strong grounding controls (evidence-cited outputs and refusal on low confidence). Demonstrates end-to-end ownership across retrieval quality, Airflow-orchestrated indexing pipelines, and compliance-grade privacy (PII redaction, RBAC, encrypted redacted logging, and auditable prompt/model versioning) plus a tight feedback loop with non-technical domain experts.

View profile
SD

Sai Dev

Screened

Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP

Newark, CA4y exp
Lucid MotorsCleveland State University

GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.

View profile
VM

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp
Texas InstrumentsTrine University

Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.

View profile
JV

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp
Wells FargoSt. Francis College

Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.

View profile
SZ

Junior AI/Backend Software Engineer specializing in ML and scalable systems

Dallas, TX2y exp
PMGUniversity of Maryland, College Park

Backend engineer with strong AWS/CI/CD experience (multi-repo deployments, Lambda + core app, immutable ECR and image promotion) and a published master’s thesis building an ML framework for Solar PV energy prediction and CO2 reduction impact modeling using ensemble and meta-learning approaches benchmarked against SAM.

View profile
YX

Yuan Xu

Screened

Junior Machine Learning Engineer specializing in multimodal AI and audio deepfakes detection

Berkeley, California3y exp
Scam AICarnegie Mellon University

Internship experience building production-oriented AI systems, including a real-time voice scam/spoof detector (RawNet + AASIST) hardened for noisy audio via aggressive augmentation and Zoom-based noise simulation, evaluated with EER on clean and wild datasets. Also built an LLM-driven UI automation agent using Playwright for apps like Linear/Notion with modular tool design, unit tests, and replayable scripted scenarios, and has AWS Step Functions experience orchestrating Lambda/Cognito workflows.

View profile
BG

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp
IntelIllinois Institute of Technology

ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).

View profile
YL

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp
CitigroupUniversity of Missouri

Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.

View profile
NM

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp
SonyFitchburg State University

Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Ajay Madhusudhan Thumala - Junior Software Engineer specializing in data engineering and LLM applications in Irvine, CA

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp
GeisingerUC Irvine

Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.

View profile
Nikhil Soni - Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation in New York, NY

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp
Quant AI ResearchNYU

Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.

View profile
Manasa Mangipudi - Mid-level Machine Learning Engineer specializing in NLP and computer vision

Mid-level Machine Learning Engineer specializing in NLP and computer vision

3y exp
Columbia UniversityRutgers University–New Brunswick

AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.

View profile
Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
Aditya Jhaveri - Mid-level Software Engineer specializing in AI, big data, and distributed systems in Jersey City, NJ

Mid-level Software Engineer specializing in AI, big data, and distributed systems

Jersey City, NJ3y exp
New York UniversityNYU

Software Developer at NYU (GEMSS) focused on scaling and optimizing a data-heavy asset management web app, including migrating/optimizing data access via Google Sheets API and Firestore. Previously an SDE at Sainapse working on Spring Boot microservices POCs (Kafka, Hadoop at 2B+ record scale). Built an end-to-end Apple Wallet coupon generation/redemption system using PassKit + Google Apps Script with measurable ops impact (40% efficiency gain).

View profile

Need someone specific?

AI Search