Vetted ETL Professionals

Pre-screened and vetted.

NA

Niveditha A

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and LLM/RAG systems

USA4y exp
UnitedHealth GroupBowling Green State University

AI/LLM engineer with recent production experience at UnitedHealth Group building an end-to-end RAG system over structured EMR data and unstructured clinical notes, including evidence retrieval, GPT/LLaMA-based reasoning, and a validation layer for reliability. Strong in orchestration (Kubeflow/Airflow/MLflow), prompt engineering for noisy healthcare text, and rigorous evaluation/monitoring with gold-standard benchmarking, plus close collaboration with clinical operations stakeholders.

View profile
PT

Phyo Thant

Screened

Intern Robotics/ML Engineer specializing in autonomy, networking, and systems software

San Diego, CA2y exp
CaltransUC San Diego

Robotics software engineer who built a lightweight, ROS-free distributed control and telemetry stack for a Caltrans long-range culvert inspection robot. Strong in integrating heterogeneous hardware (UART motor controllers, Ethernet sensors, MJPEG cameras) and delivering real-time operator data via FastAPI/WebSockets, including reverse-engineering undocumented protocols and debugging network-induced latency with control-loop redesign.

View profile
HR

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp
Dreamline AICalifornia State University, Fullerton

At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.

View profile
RK

Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications

Mountain View, CA5y exp
IntuitUniversity of Central Missouri

AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.

View profile
PM

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp
Charles SchwabUniversity of Central Missouri

ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.

View profile
LB

Junior Software Developer specializing in full-stack, data platforms, and Azure cloud

California, USA2y exp
Our National ConversationCalifornia State University, Los Angeles

Backend engineer with hands-on experience designing and refactoring scalable Node.js/MongoDB systems and building Python/FastAPI services. Emphasizes production-grade security (JWT, refresh tokens, RBAC, Supabase Auth, RLS) and reliability practices like strong testing, monitoring, and rollback planning, including resolving concurrency and token/validation edge cases.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp
UnitedHealthcareKent State University

AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.

View profile
RW

Ruijing Wang

Screened

Intern Data Scientist specializing in healthcare AI and experimentation

Boulder, CO1y exp
EchoPlus AIStevens Institute of Technology

Human-AI Design Lab practitioner who productionized a wearable-health anomaly detection system by evolving a standalone autoencoder into a hybrid autoencoder + GPT-based approach, backed by PySpark ETL and MLOps on AWS SageMaker/MLflow. Also has applied LLM troubleshooting experience (fine-tuned FLAN-T5 summarization) and partnered with BI teams to run A/B tests and improve retention via feature stores and experimentation.

View profile
HG

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp
Bank of AmericaUniversity of Cincinnati

Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.

View profile
MV

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Seattle, WA8y exp
SafecoFitchburg State University

Data engineer focused on building reliable, production-grade pipelines and external data collection systems on AWS (S3/Lambda/SQS/Glue/EMR) using PySpark/SQL, serving curated datasets to Snowflake/Redshift for finance and fraud teams. Has operated a large-scale crawler ingesting millions of records/day with anti-bot tactics, schema versioning/quarantine, and CloudWatch/Datadog monitoring, and also shipped a versioned REST API with caching and query optimization.

View profile
AA

Aayush Anand

Screened

Intern Full-Stack/Software Engineer specializing in web apps, cloud, and data/ML systems

New York, NY1y exp
The NorthStar GroupNYU

Built and productionized LLM-driven content intelligence/SEO agents for a high-traffic media platform, automating tagging/summarization/metadata with FastAPI + async orchestration and strict JSON-schema outputs. Demonstrated measurable impact (40% faster publishing, +20% organic traffic in 3 months) and strong reliability practices (offline evals, shadow mode, canaries, fallbacks, idempotency, and monitoring).

View profile
SR

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

Columbus, OH4y exp
Western Alliance BankUniversity of Missouri-Kansas City

Data engineer focused on production-grade pipelines and data services: ingests millions of records/day into S3, performs SQL/Python quality validation and PySpark/SQL transformations, and serves curated datasets via Athena/Redshift. Has experience hardening external data collection with retries/rate-limit handling and shipping versioned internal data APIs with backward compatibility, monitoring, and CI/CD in early-stage environments.

View profile
NS

Mid-level ML Data Engineer specializing in MLOps and scalable healthcare data pipelines

Boston, MA5y exp
CignaNortheastern University

Data/ML platform engineer with healthcare (Cigna) experience owning an end-to-end pipeline spanning Airflow + Debezium CDC ingestion, PySpark/SQL transformations, rigorous data quality gates, and feature-store/API serving for ML training and inference. Worked at 10+ TB scale and cites a ~30% latency reduction plus stronger reliability via idempotent design, monitoring, and backfill-safe reprocessing; also built pragmatic early-stage data pipelines at Frankenbuild Ventures.

View profile
Ganesh Medepalli - Mid-level Java Developer specializing in Spring Boot microservices and AWS in USA

Mid-level Java Developer specializing in Spring Boot microservices and AWS

USA3y exp
Berkshire HathawayMissouri University of Science and Technology

Backend engineer with primary experience in Java/Spring Boot microservices, AWS (EC2/ECS/Lambda), and CI/CD automation with Jenkins. Supported modernization/migration efforts at Berkshire Hathaway and Citius Infotech by containerizing legacy components with Docker, refactoring services to be stateless, and managing infra changes via Terraform and Git-based workflows; has limited but practical Python API prototyping experience (Flask/FastAPI) and solid conceptual grounding in Kubernetes and Kafka.

View profile
SUMIT MAMTANI - Mid-level Data Scientist specializing in ML, MLOps, and customer analytics in Tempe, AZ

SUMIT MAMTANI

Screened

Mid-level Data Scientist specializing in ML, MLOps, and customer analytics

Tempe, AZ4y exp
QlikArizona State University

ML/NLP practitioner focused on insurance/claims analytics for a large financial firm, working with millions of fragmented structured and unstructured records. Built production-grade pipelines for entity extraction, entity resolution, and semantic search using Sentence-BERT + vector DB, including fine-tuning with contrastive learning (reported ~15% recall lift) and scalable ETL/containerized deployment on Kubernetes.

View profile
Prateek Pravanjan - Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines in Remote

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp
MercorStevens Institute of Technology

LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.

View profile
Pravalika Kasojjala - Mid-level AI/ML Engineer specializing in LLM, RAG/GraphRAG, and fraud analytics in Charlotte, NC

Mid-level AI/ML Engineer specializing in LLM, RAG/GraphRAG, and fraud analytics

Charlotte, NC5y exp
Bank of AmericaUniversity of Wisconsin–Milwaukee

LLM/agent engineer who has deployed a production internal assistant to reduce employee inquiry resolution time while maintaining regulatory compliance. Experienced with RAG, hallucination risk triage, and graph-based orchestration (LangGraph) for enterprise/banking-style workflows, emphasizing schema-validated, citation-backed, tool-constrained agent designs and tight collaboration with non-technical business/compliance stakeholders.

View profile
Saniya Shinde - Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems in Washington, DC

Saniya Shinde

Screened

Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems

Washington, DC4y exp
World BankGeorge Washington University

Built and deployed a production-style vision-language pipeline that generates structured medical reports from chest X-rays using BioViLT embeddings, an image-text alignment module, and BiGPT fine-tuned with LoRA, delivered via Streamlit and hosted on AWS EC2. Also collaborating experience presenting EDA findings, feature importance, and model performance to Ford managers while working with vehicle parts data at Bimcon.

View profile
Ashwini Ramesh Kumar - Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows in Remote

Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows

Remote1y exp
UMass Chan Medical SchoolUniversity of Massachusetts Amherst

Backend/ML-leaning engineer who built a content-based event recommender for FlowMingle using embeddings + HNSW vector search on Google Cloud, with Firebase as the backend and a managed recommendation lifecycle (15 recs/user, daily async generation, weekly deletion) now serving 1500+ users. Also led a cost-driven migration of ConvAI services to Azure AI using parallel request testing from a Unity client, with post-migration monitoring via logs and model evals; contributed to a Massachusetts law-enforcement conversation analysis system by expanding ingestion to PDF/TXT/Excel and multi-file inputs.

View profile
Nikitha Margadi - Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps in Texas, USA

Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps

Texas, USA5y exp
AT&TCal State Fullerton

Data engineer at AT&T focused on large-scale telecom (5G/IoT) data platforms, owning end-to-end pipelines from Kafka/Azure ingestion through Databricks/Delta Lake transformations to serving analytics and ML. Has operated at very high volumes (~50+ TB/day) and delivered measurable performance gains (25–30% faster processing) plus improved reliability via Airflow monitoring, robust data quality checks, and resilient external data collection patterns (rate limiting, retries, dynamic schemas).

View profile
Krishnamraju Penumatsa - Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines in Fort Worth, TX

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Fort Worth, TX6y exp
American AirlinesUniversity of North Texas

Data engineer currently at American Airlines who built and owned end-to-end flight operations and booking data pipelines (batch + real-time) using Azure Data Factory, Kafka, Spark/Databricks, Synapse, and Snowflake—processing hundreds of GBs/day. Strong focus on reliability and data quality (idempotency, checkpointing, retries, validation/alerts) and delivered near-real-time analytics powering Power BI dashboards; previously helped stand up an early-stage data platform at Sysco on AWS (Glue/S3/Redshift) with Airflow and Jenkins CI/CD.

View profile
FS

Firoz Shaik

Screened

Mid-level Data Analyst specializing in business intelligence and customer analytics

4y exp
Molina HealthcareUniversity of Missouri-Kansas City

Healthcare-focused data analyst with hands-on experience at Molina Healthcare building SQL and Python workflows for retention and churn analytics. They combined enrollment, CRM, and claims data into Power BI reporting, automated predictive churn analysis, and tied their work to measurable outcomes including faster processing, better reporting accuracy, and reduced churn.

View profile
Yuchen Wang - Intern Software Engineer specializing in full-stack development and AI/ML in New York, NY

Yuchen Wang

Screened

Intern Software Engineer specializing in full-stack development and AI/ML

New York, NY1y exp
AdasEcoNYU

Built and maintains an AI Finance Tracker end-to-end as a solo full-stack product owner, from Figma designs and React frontend to Flask APIs, Firestore, auth, deployment, and AI insights. Stands out for combining product instinct with pragmatic engineering decisions like pre-aggregating financial data to control LLM costs and adding OCR receipt scanning based on real user feedback.

View profile
Rami Jaloudi - Senior Applications Engineer specializing in legal technology and eDiscovery in New York, NY

Rami Jaloudi

Screened

Senior Applications Engineer specializing in legal technology and eDiscovery

New York, NY16y exp
ConduentNJIT

Early-stage founder candidate exploring an AI-enabled legal tech startup focused on document intelligence, secure workflows, and enterprise automation. Brings a rare blend of technical architecture fluency and product/business thinking, with clear firsthand insight into legal and document-heavy operational pain points.

View profile

Need someone specific?

AI Search