Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache Hadoop Python SQL Docker AWS Apache Spark

Sai Harshitha Sivalingala

Screened

Junior Full-Stack Software Engineer specializing in cloud-native systems and ML tooling

United States2y exp

Veterinary Diagnostic Laboratory at Iowa State UniversityIowa State University

“New-grad backend engineer who built a real-time genome analysis pipeline, replacing a slow batch system with an event-driven distributed architecture in Python/Redis and a React progress dashboard. Reports ~6x improvement and cutting analysis time from days to hours with zero data loss under peak load, emphasizing reliability patterns like retries and idempotency plus API security (JWT/RBAC/HTTPS).”

AWS Caching CI/CD Cloud Computing C++CSS+85

View profile

Bhargavi Karuku

Screened

Mid-level AI Engineer specializing in ML, NLP, and Generative AI

Atlanta, GA4y exp

CGIUniversity of New Haven

“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”

A/B Testing Agile AWS Azure Machine Learning BigQuery Claude+129

View profile

Sakshi More

Screened

Mid-level Full-Stack Software Engineer specializing in cloud, data science, and ML systems

Texas, USA4y exp

Granite ConstructionUniversity of Texas at San Antonio

“Backend/data engineer focused on AWS-based, low-latency event processing for market data and social-signal sentiment systems. Has led a monolith-to-event-driven migration with feature-flagged incremental rollout, and emphasizes production-grade security (OAuth2/JWT, secrets management, Supabase RLS) and data integrity (deduplication/idempotency) under high-volume spike conditions.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon ECS Amazon RDS Ansible+128

View profile

Sunayana Rongali

Screened

Junior Marketing Specialist specializing in AI-powered performance and lifecycle marketing

Dallas, Texas1y exp

ShoebaccaUniversity of Texas at Dallas

“Growth-creative/performance marketer who owned end-to-end paid social creative testing for Shoebacca’s seasonal footwear launch, driving ~22% CPA reduction and ~30% ROAS improvement by shifting from product-focused ads to UGC with stronger early hooks and clearer value messaging. Experienced translating platform-specific performance signals into modular creative iterations across Meta, TikTok, and YouTube, and aligning performance + creative teams via structured briefs and weekly review cadences.”

Predictive analytics Email marketing Performance optimization A/B testing Customer segmentation Market research+78

View profile

Phani Tarun Munukuntla

Screened

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp

University at BuffaloUniversity at Buffalo

“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”

Python PySpark Apache Airflow Java JavaScript SQL+121

View profile

Gopichand Amaraneni

Screened

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp

CitiusTechNorthwest Missouri State University

“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”

Python NumPy Pandas JSON SQL PostgreSQL+151

View profile

Rahul Vemuri

Screened

Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines

Malvern, PA4y exp

PQ CorporationPenn State Great Valley School of Graduate Professional Studies

“Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.”

Amazon Bedrock Amazon Redshift Amazon S3 Apache Airflow Anomaly Detection AWS+137

View profile

Jaideep Janapati

Screened

Mid-level Data Engineer specializing in cloud data platforms and real-time pipelines

Denton, TX5y exp

Real DynamicsUniversity of North Texas

“Data engineer who has owned production pipelines end-to-end—from Kafka/Airflow ingestion through SQL/Python validation and dbt transformations into Redshift/BI. Also built and operated a large-scale distributed web scraping platform (50–100 sites daily, ~5–10M records/day) with Kubernetes, Kafka queues, robust retries/DLQ, anti-bot measures, and backfill-safe raw HTML storage.”

SDLC Agile Waterfall R Python SQL+134

View profile

Yash Amre

Screened

Intern Data Scientist specializing in machine learning and NLP

California, USA1y exp

LexTrack AIUniversity of Colorado Boulder

“Analytics-focused early-career candidate with internship experience owning reporting and system performance analysis projects end to end. They combine SQL data preparation, Python automation, and dashboard delivery with measurable impact, including roughly 50% less manual reporting and about 20% better forecast accuracy.”

Python R SQL C HTML CSS+164

View profile

Aneri Patel

Screened

Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval

Washington, D.C.2y exp

Enquire AI, Inc.George Washington University

“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”

Python TypeScript SQL R Java Machine Learning+133

View profile

Anita Bhagashetti

Screened

Mid-Level Software Engineer specializing in distributed systems and cloud microservices

3y exp

ZeOmegaBinghamton University

“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”

Go Redis Idempotency Node.js Apache Kafka MongoDB+150

View profile

hetvi patel

Screened

Mid-level Software/Data Engineer specializing in cloud ETL pipelines and data infrastructure

New Jersey5y exp

Plore AIAvila University

“Backend/data engineer who built a production analytics data service (Python/FastAPI on AWS/Postgres with PySpark ETL) handling millions of records per day and drove major latency improvements (10–15s to <2s) via indexing, Redis caching, and shifting aggregations into ETL. Also shipped an LLM-based natural-language-to-SQL assistant end-to-end with strong guardrails (schema restrictions, read-only validation, RBAC, masking) and designed a multi-step agent workflow with verification and fallback logic.”

Agile Angular API Design Apache Hadoop Apache Spark Automation+128

View profile

Srisailam Gitte

Screened

Mid-level Data Analyst specializing in ETL pipelines and business intelligence

Albany, NY4y exp

Office of the New York State ComptrollerUniversity at Albany

“Analytics-focused candidate with hands-on experience building compliance and contract utilization reporting from messy contract, vendor, subcontractor, and payment data. They combine SQL and Python automation to improve reporting speed and accuracy, and show strong stakeholder discipline through validation sessions, documentation, and dashboard adoption.”

Python Pandas NumPy SQL PostgreSQL MySQL+83

View profile

RAUNAQ BEDI

Screened

Entry-level Software Engineer specializing in AI, data engineering, and cloud DevOps

San Francisco, CA1y exp

mParticleRochester Institute of Technology

“Product-minded full-stack engineer with strong React/TypeScript, serverless AWS, and Postgres depth, highlighted by owning real-time personalization and onboarding experiences at mParticle. Stands out for combining deep performance debugging with measurable product impact—improving activation by 28%, reducing time-to-insights by 35%, and building reusable internal platform primitives adopted by 12 teams.”

AWS Docker Jenkins Ansible Shell Scripting CI/CD+135

View profile