No cost, no commitment - we'll make a personal intro
RT
Rakesh Thota
Mid-level Data Engineer specializing in multi-cloud real-time data pipelines
Molina HealthcareUniversity at BuffaloCalifornia, USA5 Years ExperienceMid LevelWorks Remote
Connect with Rakesh
Rakesh already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.
Typically responds within 24 hours
Recommended
Already have an account?
About
Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.
Hire with Reval
Find your next great hire
Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.
Owned end-to-end clinical trial data pipeline processing 100TB+ monthly
Implemented schema validation/data quality checks and schema drift detection to prevent silent downstream failures
Improved reporting latency by 20% and reduced manual reporting effort by 40%
Achieved 99.9% pipeline processing reliability via Airflow/MWAA tuning, alerting, retries, and idempotent design
Designed resilient API ingestion with response validation, quarantine patterns, and alerting to prevent downstream table corruption
Shipped production Flask REST APIs on Cloud Run with OAuth2, caching, BigQuery optimization, CI/CD, and URL-based versioning
Managed breaking API changes by running parallel v1/v2 endpoints and deprecating safely with sunset headers
Stood up an ambiguous early-stage RAG system for non-technical clinical researchers (LangChain + GPT-4) with monitoring and retrieval-quality evaluation
Like what you see? We'll introduce you to Rakesh directly.
Experience
Data EngineerMolina Healthcare · Jul 2025 – Present
ML Application EngineerEgen · Jul 2022 – Dec 2023
Associate Data EngineerPersistent Systems · Jul 2020 – Jun 2022
Education
University at Buffalomaster, Computer Science (AI/ML) (2025)
Mid-level Data Engineer specializing in AI/ML platforms and cloud data pipelines
USA4y exp
MetaTexas Tech University
“Built and shipped an LLM-powered data quality assistant that generates maintainable validation checks from metadata while executing validations via Great Expectations, exposed through FastAPI and integrated into Airflow-managed pipelines. Emphasizes production reliability (structured outputs, guardrails, monitoring, versioning, human review) and works closely with compliance/operations teams to deliver clear, auditable, user-friendly AI outputs.”