Vetted Apache Spark Professionals

Pre-screened and vetted.

PremKumar Gandla - Mid-level AI/ML Engineer specializing in MLOps, NLP, and scalable model deployment in Texas, USA

Mid-level AI/ML Engineer specializing in MLOps, NLP, and scalable model deployment

Texas, USA4y exp
BlackbaudSouthern Arkansas University

Built and deployed a production autonomous AI data analyst agent (LangChain + GPT + Streamlit on AWS) that turns natural-language questions into validated SQL, visualizations, and insights, cutting manual analysis time by ~50%. Emphasizes reliability and MLOps: schema-aware validation/guardrails to prevent hallucinations, scalable large-data processing, and Azure DevOps CI/CD + MLflow for automated deployment and experiment tracking.

View profile
Hari Billa - Mid-level Data Scientist specializing in machine learning, NLP, and healthcare AI in USA

Hari Billa

Screened

Mid-level Data Scientist specializing in machine learning, NLP, and healthcare AI

USA3y exp
HCA HealthcareSouthern Arkansas University

Senior data scientist with hands-on ownership of production ML and GenAI systems across enterprise churn, clinical Q&A, and real-time fraud detection. Stands out for combining strong MLOps discipline with measurable business impact, including $2M+ retained revenue, 10K TPS low-latency fraud infrastructure, and a clinician-reviewed RAG system that improved retrieval accuracy by ~38%.

View profile
Ruturaj Dixit - Junior Data Scientist specializing in AI/ML and product analytics in New York, NY

Ruturaj Dixit

Screened

Junior Data Scientist specializing in AI/ML and product analytics

New York, NY2y exp
Pace UniversityPace University

Applied ML/data scientist who has owned backend-heavy AI systems end-to-end, including a market-signal platform on FastAPI/AWS and rapid MVP delivery in medical computer vision. Particularly interesting for teams needing someone who can combine model development, backend APIs, production debugging, and pragmatic low-latency architecture decisions.

View profile
SREYAS GANGJI - Mid-level Software Engineer specializing in AI/ML backend systems in Chicago, IL

SREYAS GANGJI

Screened

Mid-level Software Engineer specializing in AI/ML backend systems

Chicago, IL4y exp
ZSDePaul University

AI/data engineer at ZS Associates focused on production-grade agentic systems, FastAPI microservices, and cloud-native ETL/RAG pipelines at significant scale. They’ve built multi-agent validation and diagnostic workflows inspired by their Copilot/KUBEPILOT AI work, supporting 500K+ records per day while improving ML inference performance by ~30% and cutting manual troubleshooting by 60%.

View profile
Nidhip Patel - Mid-level Software Engineer specializing in AI/ML and full-stack development in United States

Nidhip Patel

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack development

United States3y exp
UnumWebster University

Backend Java engineer with strong platform/DevOps experience: modernized an insurance claims legacy monolith into DDD-aligned microservices, deployed containerized services on Kubernetes with Jenkins CI/CD and static analysis gates, and implemented GitOps using ArgoCD. Also led major AWS migration planning with dependency mapping and network monitoring to uncover hidden dependencies, and built Kafka-based real-time event streaming with schema-registry-driven evolution.

View profile
SR

Mid-level AI/ML Engineer specializing in RAG systems and Python cloud backends

USA4y exp
CignaSoutheast Missouri State University

Frontend engineer with hands-on experience building AI-powered document search and analytics products, including RAG-based knowledge retrieval interfaces with citations, filters, and document previews. Stands out for combining React/TypeScript architecture with production performance tuning using profiling tools, memoization, lazy loading, and debounced data flows to keep complex, document-heavy UIs responsive.

View profile
SA

SAM AUSTIN

Screened

Senior Full-Stack Engineer specializing in Python, cloud, and SaaS platforms

California, USA8y exp
TackleCalifornia State University, Sacramento

Lead full-stack engineer with strong Python backend depth and hands-on React/TypeScript experience, working in startup-like B2B SaaS environments serving enterprise software customers such as CrowdStrike, HashiCorp, and VMware. Stands out for redesigning tightly coupled systems into modular AWS-based microservices and building configurable integration platforms that improved enterprise customer onboarding and marketplace workflow scalability.

View profile
KG

Mid-level Generative AI Engineer specializing in LLM agents and RAG

Chesterfield, MO4y exp
Reinsurance Group of AmericaUniversity of Central Missouri

GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.

View profile
MS

Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot, React, and cloud

United States4y exp
SubaruCentral Michigan University

Backend/platform engineer who built real-time connected-vehicle telemetry analytics at Subaru, spanning Kafka streaming, Python/FastAPI ETL, and low-latency WebSocket delivery (minutes to <2s). Strong Kubernetes + GitOps practitioner across AWS EKS and Azure AKS (Helm, ArgoCD, Jenkins/GitLab) and has led major on-prem-to-cloud migrations for financial microservices using Terraform and AWS DMS with measurable cost and reliability gains.

View profile
YS

Yash Sanap

Screened

Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications

Virginia Beach, VA2y exp
City of Virginia BeachGeorge Mason University

Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.

View profile
VK

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp
TCSUniversity of New Haven

LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
AJ

Aman Jain

Screened

Mid-level Software Engineer specializing in cloud-native data pipelines and ML platforms

Boston, MA4y exp
Community Dreams FoundationBoston University

Backend engineer who has owned end-to-end delivery of Python/FastAPI microservices for real-time data processing and alerting, including performance tuning (Postgres optimization, caching, async processing). Strong DevOps/GitOps background: Docker + Kubernetes deployments with GitHub Actions CI/CD and ArgoCD-driven GitOps, plus experience supporting phased on-prem to AWS migrations and building Kafka-based streaming pipelines.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
SB

Senior DevOps/Platform Engineer specializing in Kubernetes and cloud infrastructure

12y exp
DXC TechnologyVinayaka Missions University

DevOps/Infrastructure engineer with hands-on production experience building Jenkins CI/CD pipelines that provision Kubernetes infrastructure and process data into a MapR cluster. Uses Terraform to provision AWS resources (EC2, S3, VPC, subnets) with remote state in S3, separate environment state files, and code review/validation practices; targeting $135k base.

View profile
Vidit Naik - Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems in San Francisco, CA

Vidit Naik

Screened

Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems

San Francisco, CA2y exp
Checksum AIUC Riverside

Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.

View profile
Santhoshi Priya Sunchu - Mid-level Data Scientist specializing in NLP and predictive modeling in Massachusetts, USA

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp
Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.

View profile
Snehitha Penumaka - Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines in Dallas, TX

Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines

Dallas, TX3y exp
Cambard LLCUniversity of Texas at Dallas

LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.

View profile
Jitesh Kumar S - Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps in Lafayette, IN

Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps

Lafayette, IN3y exp
YaarcubesUniversity of Maryland, College Park

ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.

View profile
AK

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp
Black & VeatchUniversity of Central Missouri

Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.

View profile
KB

Keerthi Basam

Screened

Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare

United States4y exp
IBMWright State University

Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.

View profile
DK

Mid-level AI/ML Engineer specializing in applied AI for banking and healthcare

Kentwood, MI5y exp
Fifth Third BankUniversity of Central Missouri

Built end-to-end AI products across fintech and healthcare, including a real-time loan risk prediction system and a patient feedback insights platform. Stands out for combining full-stack delivery, production ML/MLOps on AWS, and pragmatic human-in-the-loop safeguards; reported a 22% improvement in prediction accuracy.

View profile
MB

Senior AI Engineer specializing in machine learning, IoT, and data platforms

Winterville, NC16y exp
FreelanceEast Carolina University

Backend/cloud engineer who built an AWS serverless IoT system that computes Bluetooth beacon locations from telemetry using heavy scientific Python (NumPy/SciPy/pandas) packaged as Dockerized Lambda, integrated with Java microservices and scheduled batch orchestration. Has deep AWS delivery experience (CI/CD with Code* tools, CloudFormation, cost controls) and has led high-severity incident response including CloudTrail forensics and infrastructure recovery after a compromised-keys crypto-mining attack.

View profile

Need someone specific?

AI Search