Vetted ETL Professionals

Pre-screened and vetted.

KS

Karan Shah

Screened

Mid-level Software & Robotics Engineer specializing in autonomous systems and ROS 2

USA3y exp
Boston DynamicsUniversity of Texas at Arlington

Robotics software engineer focused on production-grade autonomy in GPS-denied environments, building full navigation stacks (perception, EKF/UKF sensor fusion, planning, control) in ROS2. Integrated YOLOv8/semantic segmentation/RL policies into real-time NAV2 pipelines via a custom perception-aware costmap layer, with emphasis on deterministic control loops, embedded GPU performance, and robust system observability/fault tolerance.

View profile
MP

Senior Director of Software Engineering specializing in cloud-native microservices for streaming platforms

San Jose, CA20y exp
XperiAnna University

Engineering leader who drove TiVo IPTV’s client-facing API modernization from a monolith to AWS-based microservices (API Gateway, Lambda, EKS, Kafka, DynamoDB/RDS), including phased/blue-green production routing of millions of calls. Emphasizes org scaling through skill-based hiring, mentorship, and a you-build-you-run ownership culture, while balancing technical leadership with executive stakeholder communication and budgeting.

View profile
NS

Mid-Level Software Engineer specializing in cloud-native systems, automation, and LLM-enabled robotics

Sunnyvale, CA6y exp
AmazonIndiana University Bloomington

React-focused engineer who built a full-stack analytics/test-metrics dashboard (React frontend + Python backend) and turned common UI pieces (data tables, filter panels, chart wrappers) into a reusable internal component library with docs, examples, and basic tests. Strong on profiling-driven performance optimization (React Profiler, memoization) and on owning ambiguous internal-tool projects end-to-end; now planning to package internal patterns into public open-source components.

View profile
SS

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp
NVIDIAKennesaw State University

Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).

View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
YL

Yuqi Lei

Screened

Mid-level Software Engineer specializing in financial data platforms and quantitative research tooling

New York City, NY3y exp
BloombergWashington University in St. Louis

Owned and built Bloomberg’s end-to-end bitemporal dividend & dividend-forecast data platform powering BQL for 400k+ terminal users. Architected real-time Kafka ingestion (5k–10k msgs/sec) across 100k+ tickers with strong correctness guarantees (PIT/bitemporal time-travel, immutable history to avoid look-ahead bias) and achieved sub-100ms p95 query latency through indexing and caching, deployed with Kubernetes + DLQ and robust monitoring.

View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.

View profile
LM

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp
eBayTexas Tech University

Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.

View profile
Travoy Spelling - Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP in Texarkana, TX

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
Akanksha Agrawal - Mid-Level Full-Stack Software Engineer specializing in event-driven data platforms in Bangalore, India

Mid-Level Full-Stack Software Engineer specializing in event-driven data platforms

Bangalore, India5y exp
SAPUniversity of Illinois Urbana-Champaign

Backend engineer with SAP experience modernizing a legacy Flask/PostgreSQL product master data platform into a modular, stateless, containerized service with Kafka-based background processing and improved observability. Also has hands-on academic/side-project experience operationalizing ML (NLP retrieval with TF-IDF/BERT via FastAPI and CV lane-edge detection inference APIs using PyTorch).

View profile
Saiteja Gaddam - Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp
IntuitUniversity at Buffalo

Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.

View profile
Ranganayak Meravath - Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

5y exp
LPL FinancialUniversity of North Texas

Senior engineer who built and productionized an Azure-based Enterprise AI Copilot for financial/compliance teams, focused on grounded, auditable answers with citations to reduce hallucinations in regulated workflows. Experienced designing multi-step agent orchestration and improving reliability through targeted iterations (e.g., fixing chunking/parsing to materially improve citation accuracy), plus building defensive pipelines for messy ERP/operational finance data.

View profile
Vismay Patel - Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps in Berkeley, CA

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp
Kaiser PermanenteSan Francisco State University

ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.

View profile
SA

Director-level GenAI product leader specializing in conversational AI and platform strategy

San Diego, CA18y exp
AmazonUniversity of North Carolina at Charlotte

Product leader with experience at Amazon and Temenos, focused on AI-powered consumer experiences at scale. They led critical Alexa+ launches, including the first Suno music-generation integration on Alexa, combining strong UX instincts, LLM quality frameworks, and partner negotiation to unblock launches and drive major engagement gains. They also bring people leadership and a thoughtful human-in-the-loop perspective on AI, especially across multilingual and culturally nuanced markets.

View profile
SA

Mid-level Full-Stack Engineer specializing in AI-driven data platforms

Santa Barbara, CA5y exp
UberUniversity of Alabama at Birmingham

Full-stack engineer with 5+ years of experience who built real-time data visualization and analytics systems at Uber, spanning React/TypeScript frontends, Node/GraphQL services, Kafka pipelines, and PostgreSQL. Particularly compelling for teams needing a hands-on builder who can turn ambiguous customer needs into scalable products, and who has also applied RAG with LangChain/OpenAI over 1.8M support files to surface actionable insights.

View profile
CS

Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization

San Jose, CA3y exp
AMDUSC

LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.

View profile
PP

Intern Software Engineer specializing in distributed systems and security

San Jose, CA6y exp
AnyLogUniversity of Pennsylvania

Built a production LLM-powered analyst assistant at Discern Security to speed up SOC investigations using a RAG pipeline over security vendor documentation (Python PDF ingestion, vector search). Demonstrates deep, security-critical LLM engineering: structure-aware chunking with custom table parsing, grounded/cited responses, prompt-injection defenses, and post-generation validation, validated via golden datasets and adversarial testing; tool is used daily by analysts.

View profile
SS

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.

View profile
VS

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

View profile
PJ

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp
JPMorgan ChaseUniversity of Massachusetts Amherst

Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.

View profile
LK

Junior Full-Stack & Data Engineer specializing in cloud platforms and cybersecurity ML

New York, NY2y exp
AccentureNYU

Built a hackathon "Patient Summary Assistant" backend focused on healthcare workflows, combining RAG-based summarization with HIPAA-minded privacy controls (NER redaction + encryption). Demonstrated strong infra skills by deploying on Kubernetes with Helm/HPA and GitOps (ArgoCD), plus migrating from OpenAI to an on-prem Llama 3 stack (vLLM, quantization, shadow-mode testing) and adding real-time Kafka ingestion for patient vitals/anomaly alerts.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile
SV

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp
CVS HealthUniversity of Central Missouri

Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.

View profile

Need someone specific?

AI Search