Vetted Data Cleaning Professionals

Pre-screened and vetted.

AG

Mid-Level Game Designer specializing in live ops and content design

Orlando, Florida3y exp
Electronic ArtsFull Sail University

Game/content designer with experience at EA and on a mobile title (Ultimate Fishing), owning PvP mode UX/gameplay improvements and building UE5 Blueprint systems (state machine + modular weapon/ability architecture) to support multi-developer teams. Also designed College Football 2025/2026 challenge content grounded in real football culture and research, improved broken data-driven toolsets via spreadsheet formula work, and partnered with economy to rebalance rewards by rapidly shipping additional challenges.

View profile
HC

Mid-level Data Engineer specializing in cloud data platforms and scalable ETL pipelines

USA, USA3y exp
HCLTechUniversity of New Haven

Data engineer (~4 years) with full-stack delivery experience (Next.js App Router/TypeScript + React) building a real-time operations monitoring dashboard backed by Kafka and orchestrated data pipelines. Strong production focus: Airflow + CloudWatch monitoring, automated Python/SQL validation (99.5% accuracy), and CI/CD with Jenkins/Docker; has delivered measurable improvements in latency, pipeline reliability, and query performance (Postgres/Redshift).

View profile
SP

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

New York, NY4y exp
Wells FargoUniversity of Birmingham

Data engineer with Wells Fargo experience owning an end-to-end lakehouse ETL pipeline on Databricks/Azure Data Factory, processing ~480GB daily and implementing robust data quality/reconciliation across 40+ tables to reach ~99.3% reliability. Strong in performance optimization (cut runtime 5.5h→3.8h), CI/CD and monitoring, and resilient external/API ingestion with retries, schema validation, and backfills.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile
Laasya Muktevi - Intern Machine Learning Engineer specializing in forecasting, NLP, and RAG systems in San Jose, CA

Intern Machine Learning Engineer specializing in forecasting, NLP, and RAG systems

San Jose, CA5y exp
Featurebox AICalifornia State University, Long Beach

Intern who built and deployed a production LLM-powered contract analysis system for finance teams: Azure Document Intelligence for text/table extraction plus Gemini prompting to surface key terms and risks via an async API and simple UI. Emphasizes reliability in production with fallbacks, guardrails against hallucinations, and operational concerns like latency/cost/versioning, delivering summaries in under 30 seconds instead of hours.

View profile
SM

Surya Mahesh

Screened

Mid-level Software Engineer specializing in backend and real-time automotive systems

India3y exp
Bosch Global SoftwareCal State Long Beach

Hands-on ML practitioner who built and deployed an end-to-end phishing email classifier (CLI + simple web app), achieving 98% accuracy and reducing manual security triage. Emphasizes production reliability through input validation, graceful failure modes, monitoring/logging, and iterative error analysis, with experience hardening pipelines against messy backend/database data using fallbacks and idempotent processing.

View profile
HJ

Mid-level Full-Stack Software Engineer specializing in AI and data applications

Remote4y exp
Yiddish Arts and Academics Association of North AmericaUniversity of Texas at Dallas

Analytics-focused candidate with experience building SQL/Python pipelines and dashboards for donor, campaign, and website performance reporting. They have worked with messy multi-source data, standardized metric definitions, and delivered automated reporting that reportedly reduced manual effort by about 80%.

View profile
Atharv Sankpal - Mid-level Data Analyst specializing in financial and healthcare analytics in Baltimore, MD

Mid-level Data Analyst specializing in financial and healthcare analytics

Baltimore, MD4y exp
AIGUMBC

Analytics professional with experience at JPMorgan and Deloitte, focused on financial and risk data. They stand out for building scalable SQL/Python data pipelines, KPI and forecasting dashboards, and retention/cohort metrics that improved reporting reliability, forecast accuracy, and planning speed.

View profile
SG

Mid-level Data Analyst specializing in business intelligence and cloud data platforms

Stamford, CT4y exp
Franklin TempletonUniversity of Bridgeport

Healthcare analytics professional with TCS/Humana experience turning messy claims and eligibility data into reliable reporting assets using SQL and Python. They combine strong data engineering and analytics execution with stakeholder management, including automating monthly claims reporting from half a day to under 5 minutes and driving a provider outreach effort that reduced claim rejection rates by about 20%.

View profile
JS

Jeevan Satish

Screened

Mid-level Business Analyst specializing in healthcare and enterprise technology

San Jose, CA4y exp
UnitedHealth GroupDrexel University

Analytics professional with healthcare experience at United Health Group, focused on turning messy claims and transaction data into reliable reporting assets. They combine SQL, Python, and Power BI to automate analysis, define operational KPIs, and build dashboards that improved stakeholder visibility and helped reduce processing time by about 22%.

View profile
NS

Nisarg Shah

Screened

Junior Software Engineer specializing in data, systems, and AI engineering

Arizona, USA2y exp
Arizona State UniversityArizona State University

Early-career/new-grad candidate who built TrendScout AI, an evidence-first market intelligence agent that ingests messy news, extracts entities/events, builds a Neo4j knowledge graph, and answers questions via RAG with citations. Achieved ~95% retrieval relevance by combining ChromaDB semantic search with graph-based retrieval and validating outputs through human evaluation and guardrails to prevent hallucinations.

View profile
SR

Senior Full-Stack Software Engineer specializing in AI agents and data platforms

Remote7y exp
AT&TCalifornia State University, Los Angeles

Full-stack and AI-focused builder who has shipped both customer-facing personalization at AT&T and internal LLM-powered automation/agent systems in startup environments. Stands out for combining TypeScript-heavy engineering rigor with practical AI orchestration, evaluation, and measurable business impact—from reducing support escalation through personalization to saving 10-11 hours per week by automating fragmented operational workflows.

View profile
MN

Madhuri Naik

Screened

Mid-level Data Scientist specializing in predictive analytics and LLM-powered data pipelines

Buffalo, NY3y exp
University at BuffaloUniversity at Buffalo

Early-career engineer from BNP Paribas who drove a large-scale observability modernization—selecting and implementing Prometheus/Grafana for a 2000+ server estate, then productionizing it on Kubernetes via Docker/Jenkins. Known for hands-on demos, strong documentation/templates, and pragmatic troubleshooting (including custom Python metrics) that improved visibility and cut debugging time by ~60%.

View profile
AK

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

USA4y exp
CignaTexas Tech University

ML/AI engineer with healthcare payer experience (Signal Healthcare, Cigna) who has shipped production fraud/claims prediction systems using Python/TensorFlow and exposed them via FastAPI/Flask microservices integrated with EHR and Salesforce. Emphasizes operational reliability and trust—Airflow-orchestrated pipelines with data quality gates plus SHAP-based interpretability, A/B testing, and drift/debug workflows—backed by reported outcomes of 22% lower false payouts and 17% higher model accuracy.

View profile
AP

Mid-Level Software Engineer specializing in FinTech and cloud microservices

Remote, USA3y exp
AssurantMidwestern State University

Backend/platform engineer with hands-on ownership of high-stakes data migrations in regulated domains (core banking and insurance), including a Python ETL framework that migrated 100,000+ customer records within a cutover window. Strong DevOps/GitOps background deploying Python and Spring Boot microservices to Kubernetes with Helm and ArgoCD, plus real-time Kafka transaction streaming for fraud/analytics with reliability patterns (DLQs, retries, partition tuning).

View profile
HK

Mid-level Data Analyst specializing in cloud ETL, BI, and machine learning

Texas, 752235y exp
UnitedHealth GroupUniversity of Texas at Arlington

Data/ML practitioner with experience at UnitedHealth Group building a fraud claims detection solution combining structured claims data and unstructured notes, validated with compliance stakeholders to improve actionable accuracy. Also applied embeddings, vector databases, and fine-tuned language models in a Bank of America capstone to detect threats/anomalies in financial documents, with production-minded Python ETL workflows using Airflow.

View profile
KE

Senior Data Scientist specializing in NLP and explainable machine learning

8y exp
Miro HealthRensselaer Polytechnic Institute

NLP/ML practitioner who built an explainable, clinician-aligned system to detect cognitive decline (Alzheimer’s/stroke-related) from audio responses, achieving 97% accuracy on only a few hundred data points. Also has experience with healthcare claims entity resolution and prototyped a word2vec-based patent search vector database in Elasticsearch, with strong emphasis on testing, interpretability, and scalable Python data workflows.

View profile
UO

Principal Data Scientist specializing in Generative AI, NLP, and MLOps

San Francisco, CA12y exp
CognizantUniversity at Buffalo

ML/NLP practitioner with banking experience (M&T Bank) who has built a GPT-4 RAG system using LangChain and Pinecone to connect unstructured customer data with internal knowledge bases, improving accuracy and reducing manual lookup time by 50%+. Strong in entity resolution and productionizing scalable Python data workflows, including major performance wins by migrating bottleneck joins from Pandas to Dask.

View profile
KT

Kavita Tamire

Screened

Mid-level Data Engineer specializing in AWS cloud data platforms

California, USA3y exp
Charter CommunicationsUniversity of South Florida

Data engineer with Charter Communications experience modernizing large-scale AWS data lake pipelines: ingesting S3 data, validating against legacy systems, transforming with PySpark/Spark SQL, and serving via Iceberg/Delta tables. Worked at 50M–300M record scale, delivered >99.5% data match, and built monitoring/alerting (CloudWatch/SNS) plus retry orchestration (Step Functions) and data quality gates (Great Expectations).

View profile
DP

Mid-level AI/ML Engineer specializing in LLMs, RAG, and enterprise MLOps

Baltimore, MD4y exp
CVS HealthUniversity of Maryland, Baltimore County

Backend engineer who built an AI-driven "Smart Feedback Analyzer" API (Flask → FastAPI) that processes user feedback with NLP (Hugging Face + OpenAI) and returns structured insights. Demonstrates strong production-minded architecture: stateless services, Cloud Run + Docker deployment, Redis/Celery background processing, and Postgres/SQLAlchemy performance tuning (EXPLAIN ANALYZE, indexing, N+1 fixes), plus multi-tenant data isolation via JWT/API-key derived tenant IDs.

View profile
Fatemeh Taghvaei - Junior AI/ML Engineer and Instructor specializing in deep learning, computer vision, and NLP in Chicago, IL

Junior AI/ML Engineer and Instructor specializing in deep learning, computer vision, and NLP

Chicago, IL2y exp
National Louis UniversityUniversity of Illinois Chicago

Computer-vision practitioner and educator who built a real-time license plate recognition system (OpenCV/Python + KNN) optimized to run on a Raspberry Pi with camera integration. Also designs hands-on deep learning coursework, incorporating recent transformer-based vision research (Vision Transformers) into practical labs on real datasets.

View profile
Nisarg Shah - Junior Machine Learning Engineer specializing in geospatial analytics and computer vision in Tempe, Arizona

Nisarg Shah

Screened

Junior Machine Learning Engineer specializing in geospatial analytics and computer vision

Tempe, Arizona1y exp
Arizona State UniversityArizona State University

Built and evolved a geospatial ETL + API platform that processes pixel-wise satellite imagery in PostgreSQL/PostGIS into low-latency farm-level time-series metrics for an interactive dashboard, using precomputed hotspot analysis to reduce latency by 75–80%. Experienced in FastAPI-style API contract design (OpenAPI), caching, server-side filtering/compression, and production-minded security patterns (RBAC, session-derived authorization, password hashing) with disciplined rollback/versioning practices.

View profile
Ashrita Mishra - Mid-level Data Analyst specializing in analytics, ETL, and cloud data platforms in Jersey City, NJ

Mid-level Data Analyst specializing in analytics, ETL, and cloud data platforms

Jersey City, NJ4y exp
CitigroupPace University

Data analyst with 4 years of experience spanning banking and retail/marketing analytics. Has hands-on experience building churn analytics pipelines in SQL and Python, optimizing large-query performance, and turning stakeholder-aligned metrics into recurring dashboards and business actions.

View profile
YA

Mid-level Data Analyst specializing in BI, analytics automation, and cloud data platforms

Charlotte, NC4y exp
SkyWest AirlinesUniversity of North Carolina at Charlotte

Analytics professional with hands-on experience building SQL/Python pipelines, customer ID mapping logic, and self-serve BI dashboards across marketing/CRM and regulated aviation reporting environments. Particularly strong in turning messy multi-source data into trusted reporting assets, with repeated claims of major efficiency gains, faster decision-making, and high-confidence stakeholder adoption.

View profile

Need someone specific?

AI Search