Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Engineering Professionals

Pre-screened and vetted.

Data Engineering Python SQL Docker AWS CI/CD

Akhil Reddy Edla

Screened

Senior Data Engineer specializing in cloud data platforms and automated data quality

Houston, TX4y exp

CenterPoint EnergyUniversity of Central Missouri

“Data engineer at CenterPoint Energy who built and operated multiple production-grade GCP data systems: a daily Snowflake→BigQuery replication framework (150+ tables) with Monte Carlo/Atlan-driven observability and schema-drift protection, plus a FastAPI metrics service for pipeline health. Demonstrated measurable impact (40% faster dashboard queries, 70% less manual refresh work, zero data loss) and strong operational rigor (scaling Cloud Run jobs, SAP SLT reconciliation, quarantine patterns, CI/CD via GitHub Actions + Terraform).”

Apache Airflow Apache Kafka Apache Spark API Development AWS AWS Glue+116

View profile

Rushir Bhavsar

Screened

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp

Cadence Design SystemsArizona State University

“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”

AI Agents Angular Apache Spark AWS AWS CloudFormation AWS Lambda+121

View profile

Nidhish Rao Bairineni

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp

Wells FargoSouthern Methodist University

“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”

Artificial Intelligence Machine Learning Generative AI Large Language Models OpenAI Claude+173

View profile

Saumay Killa

Screened

Mid-level Full-Stack Engineer specializing in AI SaaS and web applications

New York, NY3y exp

HumAInorityNYU

“Built a career platform feature end-to-end that generates tailored resumes and cover letters using a React/TypeScript frontend, Postgres, and AWS Lambda/SQS backend. Strong in event-driven, serverless architecture and pragmatic product iteration, with a quantified 60% improvement in onboarding completion after redesigning the UX with resume parsing and a multi-step flow.”

JavaScript Node.js TypeScript Python SQL React+145

View profile

Jayakumar Velayutham

Screened

Director-level automotive strategy leader specializing in GTM, data, AI, and value creation

Plano, TX14y exp

Kaizen AnalytixUniversity of Illinois Springfield

“Automotive-focused GTM and strategy leader who built Kaizen Analytix's Automotive and Mobility practice from roughly $250K to $5M in recurring revenue by turning complex enterprise problems into repeatable offerings. Brings a rare mix of consulting, sales, operations, and delivery execution, with deep expertise in trade/tariff workflows and emerging AI use cases for automotive and mobility.”

Go-to-Market Strategy Salesforce Team Building Data Engineering Analytics Business Development+152

View profile

Manali Shetye

Screened

Mid-level Applied AI & Data Engineer specializing in automation and enterprise analytics

Irving, Texas4y exp

Trend MicroUniversity of Texas at Arlington

“Backend engineer with experience evolving a high-volume agricultural loan processing platform (APMS) at HDFC Bank, emphasizing transactional integrity, auditability, and modularity while integrating with credit bureaus, document management, and risk engines. Also improved automation/reporting robustness at Trend Micro by catching duplicate-event retry edge cases and adding idempotency safeguards.”

Python R C#SQL JavaScript C+95

View profile

Sharath Bandi

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation

Saint Louis, Missouri4y exp

LSEGAvila University

“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”

A/B Testing Apache Airflow Apache Hadoop Apache Hive Apache Kafka Apache Spark+225

View profile

Manav Bhasin

Screened

Junior Full-Stack Machine Learning Engineer specializing in production ML systems

San Jose, CA2y exp

AgroFocal Technologies IncSan José State University

“Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.”

Python SQL JavaScript TypeScript C C+++65

View profile

Hanish Kukkala

Screened

Mid-level Data Scientist specializing in Generative AI and NLP

USA6y exp

CVS HealthUniversity of Central Missouri

“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”

A/B Testing Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+170

View profile

Hongye Xiong

Screened

Intern Software Engineer specializing in backend, cloud data platforms, and microservices

Renton, WA0y exp

PACCARSeattle University

“Full-stack engineer who shipped a group scheduling SaaS feature with live availability updates using Next.js App Router + TypeScript, owning production reliability after launch (auth debugging, monitoring, polling/backoff tuning). Has hands-on experience with Postgres schema/index design and query optimization (EXPLAIN ANALYZE) and building durable orchestrated backend workflows with retries and idempotency.”

API Gateway Angular AWS AWS Lambda Automated Testing CI/CD+82

View profile

Vamshi Arempula

Screened

Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems

6y exp

Wellmark Blue Cross and Blue ShieldIndiana Wesleyan University

“GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.”

A/B Testing Agile Amazon API Gateway Amazon Bedrock Amazon CloudWatch Amazon Redshift+212

View profile

Shashank Garg

Screened

Engineering leader specializing in FinTech ML/AI platforms

San Francisco, CA12y exp

TravelBankSan José State University

“Engineering Manager/player-coach leading Data Infrastructure, ML/DS, and AI Engineering pods who recently shipped multiple production agentic GenAI features. Built privacy-preserving LLM workflows (PII redaction via Microsoft Presidio) and drove an AI expense-approval agent from ambiguous ask to GA, cutting approval time from ~2.5 days to <4 hours with >85% accuracy. Also owned a major LLM cost overrun incident and implemented cost observability plus circuit breakers to prevent runaway agent loops.”

Leadership Team Building Agile Generative AI MLOps LangGraph+102

View profile

Chris Colinsky

Screened

Executive Technology Leader/CTO specializing in data platforms, AI agents, and e-commerce/payments

Los Angeles, CA23y exp

Howl TechnologiesAcademy of Art University

“Engineering leader with hands-on coding time who has driven major commerce and data-platform transformations: defined goop’s omnichannel strategy, unified payments to Square, and rebuilt real-time NetSuite inventory flows plus forecasting tools. Currently reorganized engineering into Product/Data/Support teams to hit aggressive seasonal roadmaps, and led a data-lake/medallion ELT refactor feeding embedded analytics (Tinybird) with improved reliability and cost efficiency; also accelerates onboarding via AI coding tools in a serverless, event-driven architecture.”

AI agents Analytics AWS Business intelligence CRM Data engineering+115

View profile

Kunal Kulkarni

Screened

Intern AI/ML Researcher specializing in computer vision and data engineering

Palo Alto, CA1y exp

TieSetUCLA

“Built a production-oriented multimodal RAG "Fix Assistant" with FastAPI, Tavily search, BM25 + cross-encoder reranking, and a local Phi-3.5 model, emphasizing strict grounding and fallback/verification modes to prevent hallucinations. Also has hands-on federated learning experience using STADLE to orchestrate edge-node training and aggregation for EV telemetry data, plus experience communicating AI results to non-technical stakeholders (traffic RL/congestion outcomes).”

AWS Bash C C++CI/CD Computer Vision+128

View profile

Fangjian Xiong

Screened

Junior Machine Learning Engineer specializing in NLP and biomedical entity extraction

Boston, MA2y exp

Northeastern UniversityNortheastern University

“Built and deployed a production LLM-powered biomedical knowledge extraction pipeline that processed millions of papers to identify tools/techniques and produce a unified knowledge graph via active learning NER (Prodigy + spaCy transformers) and entity linking (Bio-tools/Wikidata). Addressed hard NLP engineering challenges like WordPiece span-offset alignment and scaled inference over ~1.5M documents using batching/caching, containerized services, async workers, and orchestration with Prefect/Airflow.”

AI Agents AWS BigQuery C#C++Data Preprocessing+94

View profile

Hadi Jaffery

Screened

Junior Data Engineer specializing in Snowflake and investment data platforms

Boston, MA3y exp

Liberty MutualUniversity of Maryland, College Park

“Private markets/private credit data engineer owning core Snowflake/AWS data infrastructure (S3 → ActiveBatch → Snowflake) with automated iceDQ quality checks and curated datasets for internal Power BI/React reporting. Drove major reliability and delivery improvements, including cutting DB CI/CD deploy time 50% and reducing downstream table errors by 90%+, and also built an internal React/FastAPI app to visualize the team’s data infrastructure in an ambiguous early-stage environment.”

AWS AWS Lambda CI/CD C C++Data Engineering+84

View profile

Sai Vardhan Reddy

Screened

Mid-Level Data Engineer specializing in cloud data platforms and governed analytics

5y exp

OptumUniversity of Central Missouri

“Data engineer with Optum experience building end-to-end healthcare data pipelines for HL7/FHIR, processing millions of records daily across Kafka streaming and Databricks/Spark batch. Strong focus on data quality (schema enforcement/validations), reliability (Airflow monitoring/alerts), and analytics-ready serving in Snowflake powering Power BI/Tableau, with CI/CD via Git and Jenkins.”

AWS Amazon EC2 AWS Lambda AWS Glue Amazon S3 Amazon Kinesis+94

View profile

sanketh koritikanti

Screened

Mid-level Full-Stack Python Developer specializing in cloud, data engineering, and AI/ML

Washington, USA4y exp

Fannie MaeSt. Francis College

“Full stack Python developer who actively integrates AI coding assistants into day-to-day engineering work, including code generation, debugging, testing, and documentation. Has also coordinated multi-agent workflows across backend, frontend, testing, and code review, showing an applied, productivity-focused approach to AI-enabled software delivery.”

Python SQL JavaScript Scala HTML CSS+103

View profile

Vinodini Bassetti

Screened

Entry Data Scientist specializing in data engineering and automotive analytics

Bangalore, India1y exp

Tata ElxsiUniversity of Cincinnati

“Frontend-focused candidate with hands-on experience building React and TypeScript dashboards for searching, filtering, and analyzing large datasets in real time. Demonstrates practical performance tuning skills using React DevTools, memoization, debouncing, and pagination, and has also built a Mapbox-based location data dashboard with interactive markers and popups.”

Python SQL PySpark Shell Scripting Git GitHub+73

View profile

Shanmukha Jayavarapu

Screened

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Missouri, USA4y exp

KPMGUniversity of Central Missouri

“Built and deployed a production LLM-powered calorie-counting chatbot that turns plain-English meal descriptions into normalized food entities, quantities, and calorie estimates using a hybrid transformer + rule-engine pipeline. Emphasizes reliability with schema/constraint guardrails, confidence-based routing (including embedding similarity search fallbacks), and strong observability/metrics (hallucination rate, calibration, latency, cost). Partnered closely with nutritionists to encode domain standards into mappings and validation logic.”

Python PyTorch TensorFlow Scikit-learn XGBoost LightGBM+97

View profile

Manikanta Kadiyam

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

Irving, TX5y exp

VerizonUniversity of Houston

“Master’s-in-Data-Science candidate (UHV) with 4+ years in AI engineering building production LLM and multimodal systems. Designed an LLM-powered workflow automation platform using RAG over vector stores with guardrails (schema/output validation, fallbacks) and a rigorous evaluation/monitoring framework including drift tracking and shadow deployments. Experienced orchestrating large-scale vision-language pipelines with Airflow and Kubernetes (OCR, distributed training) and partnering with non-technical ops stakeholders to cut cycle time and reduce errors.”

AI agents LangChain LlamaIndex Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)Embeddings+103

View profile

Saloni Patadia

Screened

Mid-level Machine Learning Engineer specializing in LLM systems and healthcare data automation

California, USA2y exp

Prime HealthcareUSC

“React performance-focused engineer who contributed performance patches back to an open-source context+reducer state helper after profiling and fixing excessive re-renders in an enterprise project management platform at Easley Dunn Productions. Also built an end-to-end LLM-driven pipeline at Prime Healthcare to normalize millions of supply-chain records, reducing defects by 80% and saving 160+ hours/month.”

LangChain LlamaIndex FAISS Vector Search Semantic Search Prompt Engineering+100

View profile

Pooja Murigappa

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp

Charles SchwabUniversity of Central Missouri

“ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.”

Amazon DynamoDB Apache Airflow Apache Kafka Apache Spark AWS AWS Glue+183

View profile

Ruijing Wang

Screened

Intern Data Scientist specializing in healthcare AI and experimentation

Boulder, CO1y exp

EchoPlus AIStevens Institute of Technology

“Human-AI Design Lab practitioner who productionized a wearable-health anomaly detection system by evolving a standalone autoencoder into a hybrid autoencoder + GPT-based approach, backed by PySpark ETL and MLOps on AWS SageMaker/MLflow. Also has applied LLM troubleshooting experience (fine-tuned FLAN-T5 summarization) and partnered with BI teams to run A/B tests and improve retention via feature stores and experimentation.”

Python Pandas Scikit-Learn PyTorch TensorFlow SQL+97

View profile

Machine Learning Engineers Software Engineers Data Scientists Data Engineers AI Engineers Data Analysts AI & Machine Learning Engineering Data & Analytics Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?