Vetted Data Pipelines Professionals

Pre-screened and vetted.

PK

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp
NeocisCarnegie Mellon University

CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.

View profile
RK

Rutuja Kawade

Screened

Mid-level Software Engineer specializing in cloud infrastructure and distributed systems

Atlanta, GA3y exp
RakutenGeorgia Tech

Cloud infrastructure/product engineer with end-to-end ownership of cloud-native storage/observability products, including taking an internal CMS to Google Cloud Marketplace and scaling to ~40,000 deployments. Strong in Kubernetes-based platforms (Operators, microservices, RabbitMQ) and performance/scalability work (e.g., 200% cluster capacity increase) plus internal tooling that materially improved SRE/QA debugging and release velocity.

View profile
KL

Ke Liu

Screened

Mid-Level Software Engineer specializing in search platforms and distributed systems

New York, NY4y exp
Fitch RatingsColumbia University

JavaScript/React-focused engineer with meaningful open-source impact: redesigned cache key normalization for a client-side data fetching/caching library using deterministic hashing, added robust test coverage, and collaborated closely with maintainers through GitHub PRs/issues. Also drives measurable runtime improvements by profiling hot paths, refactoring core abstractions, and validating with benchmarks/load tests; has taken ownership of unowned initiatives like improving relevance/ranking in an internal search platform.

View profile
SR

Mid-level Data & Business Analyst specializing in analytics engineering and BI

6y exp
AdobeUniversity of Wisconsin–Madison

Data/analytics professional with experience across manufacturing and enterprise environments (Wisconsin School of Business project with CNH Industrial; roles/projects at Ascensia Technologies, S&C, and Adobe). Has hands-on work combining warranty/lifecycle tables with technician free-text notes using TF-IDF + tree models (XGBoost/Random Forest), and deep experience in entity resolution/reconciliation across mismatched financial systems using Python/SQL and fuzzy matching, with production-grade pipeline practices in Azure Data Factory/Databricks.

View profile
CS

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.

View profile
HC

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp
SplunkUniversity of Colorado Boulder

Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.

View profile
Felix Li - Intern Software Engineer specializing in data pipelines and full-stack web development in New York, NY

Felix Li

Screened

Intern Software Engineer specializing in data pipelines and full-stack web development

New York, NY1y exp
RadarUniversity of Waterloo

Internship at Radar (geolocation infrastructure) where they owned automation of multiple geospatial data ingestion pipelines (including US/Canadian address ingestion), orchestrating Spark (Scala) jobs via Python-based Airflow and using GitOps-style CI/CD workflows.

View profile
Lamar Petty - Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems in San Francisco, CA

Lamar Petty

Screened

Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems

San Francisco, CA13y exp
Wikimedia FoundationGeorgia Tech

Full-stack engineer with production experience shipping a healthcare-focused web app (Pregnancy-Pal) using Next.js/TypeScript on GCP, integrating a Python/Flask middleware and FHIR server for patient/practitioner dashboards and messaging. Former Wikimedia Foundation Android engineer who led the end-to-end 'Year in Review' feature and built robust automated testing/CI practices (Espresso, GitHub Actions matrix). Strong emphasis on reliability via rigorous validation, comprehensive Postman testing, and detailed API documentation.

View profile
PS

Senior Software Engineer specializing in backend infrastructure, cloud automation, and reliability

Mountain View, CA8y exp
OracleStony Brook University

End-to-end deployment owner for Oracle document delivery/print services in a hospital-like production environment, focused on reliability/performance at scale (thousands of systems). Also describes implementing event-driven RAG/agentic LLM workflows with attention to embeddings/index consistency, latency, and measurable improvements in response relevance and operational efficiency.

View profile
YJ

YASH JADHAV

Screened

Junior Data Scientist specializing in customer and growth analytics

New York, NY2y exp
Stanford AIMINew York University

Candidate combines fraud analytics experience at Citi with a clinical AI capstone involving reproducible ML pipelines for imaging and notes data. They stand out for turning messy, high-volume data into decision-ready reporting, automating evaluation workflows, and translating analytics into operational impact—from fraud rule changes to retention metric adoption.

View profile
HR

Mid-level Software Engineer specializing in cloud, backend, and healthcare systems

Virginia, USA5y exp
Amazon Web ServicesUniversity of Maryland, Baltimore County

Full-stack engineer with hands-on ownership of a customer-facing advanced performance metrics experience in the Amazon S3 console, spanning React UI, Python/Node services, Redshift/RDS data access, and AWS IaC/CI-CD with CloudWatch/Route53 operational readiness. Demonstrates strong production instincts around resilience (partial failures, multi-region inconsistencies), progressive rollouts/feature flags, and reliable ETL/integration patterns (idempotency, backfills, reconciliation).

View profile
SB

Sowmya BALUVU

Screened

Mid-Level Software Engineer specializing in full-stack development and AWS

Santa Clara, California3y exp
Frugal Innovation HubSanta Clara University

Backend-focused Python engineer who built an end-to-end personalized chatbot service integrating Amazon Redshift context retrieval with Amazon Bedrock, including prompt construction and production-grade reliability controls. Strong platform experience deploying containerized services to Kubernetes with GitOps/ArgoCD, plus hands-on Kafka streaming and phased infrastructure migration execution.

View profile
SP

Mid-level Backend Software Engineer specializing in Python APIs and payment systems

USA6y exp
StripeSouthern Illinois University Carbondale

Backend/ML systems engineer with Stripe payments experience who built an asynchronous processing upgrade handling millions of API requests, cutting peak latency ~20–25% while preserving strict financial consistency via idempotency-safe retries and robust validation/fallbacks. Also built scalable ETL pipelines for messy CSV/Excel/API data with strong observability (structured logging/monitoring) and reliability mechanisms.

View profile
RR

Rahul Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

New York, NY6y exp
CVS HealthSouthern Arkansas University

Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.

View profile
ABHIJOY SARKAR - Senior AI Engineer specializing in LLMs, agentic systems, and MLOps in San Francisco Bay Area, CA

Senior AI Engineer specializing in LLMs, agentic systems, and MLOps

San Francisco Bay Area, CA8y exp
FlipkartIIT Ropar

Built and shipped PromptGuard, a production middleware proxy that secures GenAI RAG/agent systems against prompt injection and unsafe tool use using risk scoring, graded policy actions, and least-privilege tool gating. Also replaced LangChain abstractions with a custom state-machine runner for a production voice agent to reduce latency and improve traceability, and delivered a clinic call assistant by converting front-desk/doctor requirements into scenario-based guardrails and measurable evals.

View profile
Madhu Swetha Kommi - Intern Full-Stack Software Engineer specializing in cloud data pipelines and internal tools

Intern Full-Stack Software Engineer specializing in cloud data pipelines and internal tools

1y exp
MetaUniversity of Cincinnati

Built an internal Meta tool (HiVA Bot) for notification customization and end-to-end task tracking around advertiser-reported issues, including chat-thread creation, org-hierarchy opt-ins, SLA reminders, and search/typeahead features. Implemented the system with a Java/Spring Boot microservices approach and asynchronous patterns, and supported adoption via internal wiki documentation.

View profile
Shriya Bannikop - Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems in Seattle, WA

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.

View profile
Antara Bhavsar - Mid-level Software Engineer specializing in cloud-native systems and Android development in Bloomington, IN

Mid-level Software Engineer specializing in cloud-native systems and Android development

Bloomington, IN3y exp
Indiana UniversityIndiana University Bloomington

Application-focused software engineer with experience at Amazon and Motorola shipping production systems ranging from developer monitoring/on-call tooling (Alcazar, ~40% MTTR improvement) to consumer AI features used by 100K+ users. Currently building an AI/ML-driven platform with a Python/FastAPI backend on AWS (ECS/RDS/S3) and has handled real production latency/scaling incidents end-to-end.

View profile
SJ

Shalini Jeela

Screened

Senior Data Engineer specializing in data pipelines, APIs, and machine learning

Austin, TX6y exp
ExpediaTrine University

Data engineer with experience at Expedia building SQL Server and Azure Data Factory pipelines for business reporting and analytics. Stands out for pragmatic end-to-end pipeline ownership in ambiguous environments, with a strong emphasis on data quality, rerunnability, query performance, and making downstream datasets reliable for other teams.

View profile
Abraham Musa - Senior Solutions Architect specializing in cloud AI infrastructure and security in Union City, NJ

Abraham Musa

Screened

Senior Solutions Architect specializing in cloud AI infrastructure and security

Union City, NJ9y exp
FreelanceRutgers University

Cloud-native architect focused primarily on AWS, with experience designing Kubernetes and AI/ML infrastructure for customers rather than owning day-to-day operations. Particularly interesting for AI platform roles: they described using Amazon Bedrock to analyze Terraform and automatically generate compliant IaC templates and runbooks for new multi-cloud AI environments.

View profile
BC

Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI

4y exp
Cardinal HealthRivier University

Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.

View profile
Anson T P - Staff Software Engineer specializing in Salesforce, mobile, and IoT platforms in Bengaluru, India

Anson T P

Screened

Staff Software Engineer specializing in Salesforce, mobile, and IoT platforms

Bengaluru, India12y exp
SalesforceAnna University

Frontend-leaning product engineer with strong end-to-end ownership across mobile observability, real-time systems, and scalable React/Next.js architecture. Built a reusable iOS logging framework integrated with Splunk, designed IoT operator dashboards, and shaped a JSON-driven website rendering platform by aligning product, backend, and frontend models.

View profile
AN

Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps

5y exp
PayPalUniversity of New Haven

Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.

View profile

Need someone specific?

AI Search