Vetted Apache Spark Professionals

Pre-screened and vetted.

MV

Mid-level Applied AI Engineer specializing in LLM agents and RAG systems

Houston, TX5y exp
Neptune TechnologiesNortheastern University
View profile
MK

Senior Java Full-Stack Developer specializing in microservices and cloud (AWS)

Plano, TX9y exp
Beal BankFitchburg State University
View profile
VM

Mid AI/ML Engineer specializing in NLP and generative AI

Saint Louis, MO3y exp
EpsilonSaint Louis University
View profile
NH

Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise SaaS

Dallas, TX7y exp
PuzzleHRNorth American University
View profile
DS

Junior Full-Stack Engineer specializing in mobile apps and backend systems

Scottsdale, AZ2y exp
BLUSVN TechnologiesArizona State University
View profile
SN

Mid AI/ML Engineer specializing in LLMs, MLOps, and FinTech analytics

India, India3y exp
Eudaimonic Inc.Northeastern University
View profile
SS

Mid-level AI/ML Engineer specializing in fraud detection and enterprise ML systems

Oklahoma City, OK6y exp
MidFirst Bank
View profile
Pranava Reddy Kothapally - Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization in Hyderabad, India

Pranava Reddy Kothapally

Screened ReferencesStrong rec.

Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization

Hyderabad, India2y exp
TechwaveCleveland State University

LLM/AI engineer who has deployed production RAG conversational analytics and Text-to-SQL systems over Snowflake and curated data marts, emphasizing enterprise-grade guardrails for accuracy, security, and cost. Notable for a structured approach to reducing hallucinations (curated metric/table registry, SQL validation, RBAC, and citation-backed responses) and for building resilient, observable multi-step agent workflows using LangChain/LlamaIndex and Airflow.

View profile
VP

Vikesh Patel

Screened ReferencesStrong rec.

Senior AI/ML Engineer & Data Scientist specializing in LLMs, RAG, and MLOps

Eagan, MN8y exp
Intertech, Inc.Metropolitan State University

ML/NLP practitioner who has delivered production systems in regulated domains, including a healthcare compliance pipeline using RAG (GPT-4/Claude) plus TF-IDF retrieval that increased document review throughput 4.5x. Also has hands-on experience improving fraud detection data quality via entity resolution (Levenshtein, Dedupe.py) validated with A/B testing, and building scalable, monitored workflows with Airflow, CI/CD, and AWS SageMaker.

View profile
AS

Adithya Sharma

Screened ReferencesModerate rec.

Mid-level AI/ML Engineer specializing in MLOps, NLP, and Generative AI

Remote, USA5y exp
EncoraUniversity of Michigan-Dearborn

Built and deployed a production LLM-powered text-to-SQL/document intelligence chatbot on AWS that lets non-technical business users query complex enterprise databases in plain English. Demonstrates deep practical expertise in schema-aware prompting, embeddings-based schema retrieval, SQL safety/validation guardrails, and rigorous offline/online evaluation with human-in-the-loop approvals for risky queries.

View profile
AS

Ashish Shah

Screened

Mid-level Data Engineer / Software Engineer specializing in streaming and cloud data platforms

Arlington, TX3y exp
The University of Texas at ArlingtonUniversity of Texas at Arlington

Backend engineer with deep Kafka/FastAPI microservices experience who redesigned a notification pipeline to cut end-to-end latency from ~5s to ~3s (including custom partition assignment and consumer tuning). Led a high-stakes ClickUp-to-Oracle migration of 1M+ records using idempotent ETL, reconciliation, and shadow deployment to achieve >99% integrity with zero downtime, and has hands-on production security implementation with Django/DRF (JWT + RBAC).

View profile
SS

Mid-level AI Engineer and Data Scientist specializing in LLM agents and RAG systems

Palo Alto, CA5y exp
LemmataUniversity at Buffalo

Built a production-grade LLM evaluation and regression system that stress-tests models across hundreds of iterations, combining LLM-as-judge, semantic similarity, statistical metrics, and rule-based checks, with results delivered via stakeholder-friendly HTML reports and dashboards. Experienced orchestrating multi-agent RAG workflows using LangChain/LangGraph and event-driven GenAI pipelines in n8n integrating OCR, speech-to-text, and external APIs, with strong emphasis on reliability, observability, and explainable failures.

View profile
AG

Junior Data Analyst specializing in marketing analytics and machine learning

Dallas, Texas1y exp
Maverick Digital TechnologiesUniversity of Texas at Arlington

Built and deployed a production LLM-assisted recommendation and insights platform that unifies structured, semi-structured, and unstructured data via a modular ingestion pipeline, canonical schemas, embeddings, and late-fusion modeling. Experienced in operationalizing ML/LLM systems with Airflow and Kubernetes (Dockerized services, autoscaling, rolling updates) and emphasizes reliability through layered testing, guardrails, monitoring, and A/B experimentation while partnering closely with non-technical stakeholders.

View profile
VS

VIJAY SAGI

Screened

Mid-level Data Engineer specializing in cloud-native batch and streaming pipelines

Prosper, TX5y exp
ACL DigitalTrine University

Data/ML platform engineer with ~6 years in financial services and enterprise data platforms, building regulated fraud/credit-risk pipelines on AWS (Airflow, EMR/Spark, MLflow) and an Azure lakehouse ingesting 50+ sources and serving ~100M records/day. Also led an early-stage deployment of a RAG-based internal AI search tool using AWS Bedrock and LangChain with automated evaluation to validate LLM accuracy.

View profile
Gautam Agrawal - Mid-Level Software Engineer specializing in backend systems, cloud, and applied LLM/NLP in IN, USA

Mid-Level Software Engineer specializing in backend systems, cloud, and applied LLM/NLP

IN, USA4y exp
Project 990Indiana University Bloomington

Applied LLMs to classify long nonprofit mission statements into 8 segments without labeled data, using an ensemble of clustering/embedding methods plus zero-shot RoBERTa/BART and a Tree-of-Thought prompting pipeline with LLM-as-judge evaluation (Gemma). Also built LangChain/LlamaIndex agentic RAG workflows including a text-to-SQL data analysis assistant grounded on DB schema with retries and performance optimizations on an HPC cluster.

View profile
Vengalarao Pachava - Junior AI Data Engineer specializing in Azure Databricks lakehouse and GenAI RAG systems in Irving, TX

Junior AI Data Engineer specializing in Azure Databricks lakehouse and GenAI RAG systems

Irving, TX2y exp
Cloud Rack SystemsIllinois Institute of Technology

Backend/applied AI engineer from Cloud Rack Systems who built production GenAI/RAG and data platforms on Azure/Databricks at enterprise scale (2.5M records/day). Known for making LLM systems behave like deterministic services via strict retrieval contracts, citation-based validation, and strong observability—shipping a knowledge assistant used daily by 50+ users while driving hallucinations near zero and materially improving latency and cost.

View profile
BZ

Bill Zoheb

Screened

Senior AI Engineer specializing in LLMs, RAG, and production ML systems

New York, NY8y exp
HKA EnterprisesUtica University

Built GynAI, an end-to-end maternal clinical decision support platform for OB/GYN practices and hospitals in North America, combining predictive ML with RAG-based LLM explainability. The candidate emphasizes real production ownership across experimentation, deployment, monitoring, and iteration, with reported impact including fewer delayed interventions in high-risk pregnancies and a 15-20% reduction in false positives.

View profile
VB

Mid-level AI Engineer specializing in NLP, computer vision, and healthcare analytics

Dartmouth, US3y exp
Integrated MonitoringUniversity of Massachusetts Dartmouth

Data scientist who has built production LLM agents (GPT-4o + LangChain + RAG) to automate analyst-style ad hoc CSV analysis with guardrails and GPT-as-a-judge evaluation. Also delivered an explainable healthcare NLP system for ICD code classification by collaborating closely with clinicians, using a hybrid rule-based decision tree + BERT model to reach 97% accuracy and cut manual review time.

View profile
RM

Senior Data Scientist / AI Engineer specializing in LLMs, RAG, and production ML

New York, NY5y exp
Bluesap SolutionsDePaul University

Data science professional who has built a production RAG-based LLM question-answering system ("Flash Query") to deliver fast, accurate answers over large document collections, focusing on retrieval quality and grounded responses. Also collaborates with non-technical retail/jewelry stakeholders to turn business questions into predictive models and dashboards for decision-making.

View profile
Sriram Krishna - Mid-Level Software Engineer specializing in AI/ML and cloud-native platforms in Redmond, WA

Mid-Level Software Engineer specializing in AI/ML and cloud-native platforms

Redmond, WA5y exp
Quadrant TechnologiesSeattle University

Backend/AI engineer who has built production LLM orchestration and agentic workflow systems in Python/FastAPI on Kubernetes across AWS/Azure. Demonstrated strong reliability engineering by debugging a real-world memory retention issue that caused latency spikes/timeouts, and strong data/performance chops with a PostgreSQL optimization that cut query latency from ~1.2s to ~15ms. Targets roles building scalable, guardrailed AI-driven workflow automation with robust observability and human-in-the-loop controls.

View profile
Homak Patel - Junior Software Engineer specializing in Agentic AI and Data Systems

Homak Patel

Screened

Junior Software Engineer specializing in Agentic AI and Data Systems

2y exp
EasyBee AINorth Carolina State University

Forward Deployed Engineer at EasyBee AI who productionized a self-storage customer’s multi-agent LLM system end-to-end—rebuilding it with LangGraph/CrewAI, integrating with real property management + CRM systems via an MCP server, and adding observability/guardrails for reliable daily use. Experienced in live troubleshooting of agentic workflows, developer demos/workshops (including an open-source project, MerryQuery), and partnering with sales to close deals through customer-specific technical demos and fast integration feedback loops.

View profile
Annie Suzan Edwin Arokiaraj Caroline - Junior Full-Stack Software Engineer specializing in distributed systems and data pipelines in Tempe, AZ

Junior Full-Stack Software Engineer specializing in distributed systems and data pipelines

Tempe, AZ1y exp
Arizona State UniversityArizona State University

Backend engineer with hands-on experience building distributed data and API platforms (Kafka + Neo4j on Kubernetes), including processing 3M+ NYC taxi trip records and achieving sub-second graph analytics queries. Strong focus on reliability and performance in Python/FastAPI systems—async refactors, caching, timeouts/retries, feature-flagged rollouts, and JWT/RBAC security for production services.

View profile

Need someone specific?

AI Search