Vetted Data Cleaning Professionals

Pre-screened and vetted.

JL

Joseph Lin

Screened ReferencesModerate rec.

Intern Software Engineer specializing in full-stack development and applied AI

New York, NY0y exp
Real Value CapitalNYU

Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.

View profile
SM

Syed Muhammad Aun Jafri

Screened ReferencesStrong rec.

Mid-level GTM Strategy & RevOps professional specializing in sales operations

New York City, NY5y exp
MotiveBilkent University

Startup operator with experience spanning Series D scale-up GTM strategy at Motive and earlier-stage marketplace operations at Fleek and BridgeLinx. Stands out for building operating infrastructure, redesigning sales compensation, and automating leadership reporting with measurable impact, including 12% sales efficiency gains, 40% less reporting overhead, and 9% better rep performance.

View profile
VK

Vamsi Koppala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

Barrington, IL4y exp
ComericaTexas Tech University

LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.

View profile
AK

Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps

San Francisco Bay Area, CA5y exp
VerizonCalifornia State University

Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.

View profile
KP

Kavya Paluvai

Screened

Mid-level Data Scientist specializing in fraud detection and healthcare ML

North Carolina, USA4y exp
Wells FargoUniversity of North Carolina at Charlotte

Applied NLP/ML in healthcare and financial services, including fine-tuning BERT on unstructured EHR text and building embedding-based similarity search for clinical concepts. Also redesigned a Wells Fargo fraud detection data pipeline using modular Python + AWS Glue/Step Functions, cutting runtime ~40% with improved monitoring and reliability.

View profile
AG

Anagha Ghate

Screened

Mid-level Backend Software Engineer specializing in FinTech microservices

USA4y exp
JPMorgan ChaseBinghamton University

Engineer with production experience in both high-throughput banking risk systems and LLM agent platforms. Built a real-time transaction risk scoring middleware at JPMorgan Chase (1M+ requests/day) emphasizing HA, observability, and audit/PII compliance, and also architected multi-step LLM agents with strict schema-based tool calling, evaluation loops, and safety guardrails for messy enterprise data.

View profile
Cary Burdick - Senior Data Scientist specializing in data engineering and analytics in Chicago, IL

Cary Burdick

Screened

Senior Data Scientist specializing in data engineering and analytics

Chicago, IL6y exp
USDAAuburn University

Data/NLP practitioner with experience in both financial services (Truist) and government (USDA), including an NLP-driven analysis of EU regulations to anticipate US regulatory focus and a major redesign/cleaning of complex pathogen lab-test public datasets. Built production data-quality pipelines with Dagster, Pandera, and Azure Synapse, and is comfortable validating hypotheses with historical backtesting and SME-driven quality controls.

View profile
HIMANSHU SHARMA - Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation in Orlando, FL

Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation

Orlando, FL6y exp
Kore.aiUniversity of South Florida

Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.

View profile
Sagar Patel - Mid-level Full-Stack Python Developer & Data Engineer specializing in ETL and web platforms in Arizona, United States

Sagar Patel

Screened

Mid-level Full-Stack Python Developer & Data Engineer specializing in ETL and web platforms

Arizona, United States6y exp
GoDaddyCampbellsville University

Backend engineer who led major modernization efforts at GoDaddy, migrating legacy Perl services to Python/FastAPI with an incremental rollout strategy, containerization (Docker/Kubernetes), and CI/CD (Jenkins/GitHub Actions). Strong focus on secure, reliable API design (JWT, RBAC, PostgreSQL row-level security), rigorous testing, and data integrity—plus experience hardening an automated web-scraping pipeline against changing site structures and downtime.

View profile
Kavyashree Sudhakar - Junior Business Analyst specializing in operations and banking workflows in Tempe, AZ

Junior Business Analyst specializing in operations and banking workflows

Tempe, AZ2y exp
AramarkArizona State University

Entry-level data/business analytics candidate with hands-on experience building SQL and Python workflows to clean fragmented subcontractor data, generate risk scores, and feed Power BI dashboards. Also demonstrated strong operational analytics impact at Amazon by defining and operationalizing process-quality metrics that reduced CPO rate from roughly 10% to 0.6%.

View profile
shivapriya pillalamarri - Mid-level AI/ML Engineer specializing in financial analytics and production ML systems in Boston, MA

Mid-level AI/ML Engineer specializing in financial analytics and production ML systems

Boston, MA4y exp
KenshoUniversity of New Haven

Analytics candidate with experience in financial transaction and fraud detection projects, combining SQL data preparation, Python-based automation, and dashboarding. They have owned projects from stakeholder alignment and metric definition through rollout, with emphasis on reducing false positives, improving operational efficiency, and making analytics outputs easy for business teams to adopt.

View profile
KD

Mid-level Business Analyst specializing in banking analytics and data engineering

Hollywood, FL4y exp
SantanderIndiana University Bloomington

Analytics professional at Santander Bank with hands-on experience building SQL and Python workflows for transaction reporting, reconciliation, and monitoring across messy multi-source financial data. They combine strong data validation and exception-handling practices with stakeholder-friendly dashboards, and also bring digital analytics experience from a Google Analytics UI optimization project focused on funnel drop-off and engagement.

View profile
Mahima Baddur - Mid-level Business Analyst specializing in data analytics and enterprise operations

Mahima Baddur

Screened

Mid-level Business Analyst specializing in data analytics and enterprise operations

5y exp
Johnson & JohnsonWebster University

Business/data analyst with Johnson & Johnson supply chain experience, focused on turning messy SAP, legacy, and Excel data into validated reporting datasets and Power BI dashboards. Stands out for combining SQL and Python automation with strong KPI design around inventory planning, inventory turnover, and demand analysis in a complex enterprise environment.

View profile
HA

Hassan Abrar

Screened

Mid-level Analytics Professional specializing in marketing and business intelligence

Frisco, TX5y exp
TIAAPurdue University

Analytics professional at TIAA with hands-on experience combining SQL, Python, and statistical modeling to unify complex marketing, product, finance, and customer datasets. Has worked on advisor-tool adoption analysis, 10-year wealth diagnostics, forecasting, cohort analysis, and escalation-risk modeling, with findings used by marketing and contact-center stakeholders.

View profile
SK

Mid-level Data Analyst and Data Engineer specializing in healthcare and financial analytics

3y exp
UnitedHealth GroupUniversity of North Texas

Analytics professional with healthcare and operations experience who turns messy enterprise data from platforms like Teradata, GCP, SQL Server, and Snowflake into trusted reporting layers and reproducible analysis workflows. They combine SQL, Python, PySpark, Power BI, and Tableau to improve reporting accuracy and performance, including a 30% dashboard refresh improvement and 20-25% accuracy gains in healthcare reporting.

View profile
LR

Mid-level Business Analyst specializing in BI and analytics

New York, NY3y exp
DellSacred Heart University

Analytics professional with Dell experience unifying global online sales, web analytics, SAP, and planning data across 20+ countries into scalable reporting pipelines and Power BI dashboards. Stands out for combining deep SQL/ETL work with Python automation, KPI design, and experimentation—delivering measurable outcomes like 80% less manual effort, a 2% conversion lift worth millions, and faster business decision-making.

View profile
SR

Mid-level Generative AI Engineer specializing in LLMs and enterprise AI

Texas, USA5y exp
PNCUniversity of Texas at Arlington

Built and owned an enterprise LLM/RAG document intelligence platform for PNC Financial Services in a compliance-heavy environment, focused on grounded answers over internal finance and policy documents. Stands out for combining GenAI product delivery with production engineering discipline, delivering 60% faster document review and materially better answer quality while creating reusable FastAPI-based AI services for multiple teams.

View profile
Anudeep Eloori - Mid-level Software Developer specializing in full-stack enterprise applications in USA

Mid-level Software Developer specializing in full-stack enterprise applications

USA3y exp
EpsilonUniversity of South Florida

Software engineer with experience building and iterating high-volume Spring Boot microservices on AWS (Docker/Kubernetes) and integrating with React front-ends. Also delivered an LLM-powered document summarization system using embeddings + retrieval (RAG) with grounding/guardrails and built evaluation loops that directly drove retrieval and chunking improvements. Has scaled Kafka-based pipelines processing millions of messy financial/infrastructure records with reliability and cost/latency tradeoff management.

View profile
Namratha Medaboina - Mid-level Software Engineer specializing in backend systems for healthcare and FinTech

Mid-level Software Engineer specializing in backend systems for healthcare and FinTech

3y exp
CVS HealthUniversity at Buffalo

Built Python-based clinical data processing workflows at CVS Health, automating ingestion, validation, transformation, and ML prediction across multiple healthcare systems. Stands out for combining AI-assisted development with rigorous human review, validation checkpoints, and production monitoring in regulated healthcare environments, including a reported ~26% efficiency improvement.

View profile
RC

Mid-level operations manager specializing in business development and project delivery

Córdoba, Argentina6y exp
FreelanceNational University of Córdoba

Project/operations leader who has grown from owning small implementation projects to running PMO governance on a multi-year SAP migration at Accenture, and now independently delivers end-to-end automation, CRM, and workflow projects for 4-5 freelance clients across varied industries. Stands out for blending hands-on systems building with strong executive reporting, risk management, and client enablement.

View profile
SR

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp
VerizonUniversity of Massachusetts Dartmouth

Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile
AS

Aditya Sairam

Screened

Mid-Level Software Engineer specializing in cloud data platforms and AI search

Troy, MI6y exp
Robotics Technologies LLCCleveland State University

Open-source JavaScript contributor focused on data visualization, extending Chart.js/React with custom plugins for real-time streaming dashboards. Designed an end-to-end telemetry pipeline using Apache Kafka and Azure Cosmos DB, optimizing partitioning, batching, caching, and client throttling to keep latency low and support thousands of concurrent users. Demonstrates strong ownership in fast-changing environments, including building full-stack AI applications and ingestion/ETL pipelines at Robotics Technologies LLC.

View profile
PV

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.

View profile

Need someone specific?

AI Search