Vetted Data Validation Professionals

Pre-screened and vetted.

RM

Junior Full-Stack Software Engineer specializing in React and AI-powered applications

Bloomington, IN4y exp
Indiana UniversityIndiana University Bloomington

Full-stack/AI-focused builder who shipped a production Career Advisor app using LLMs + RAG + vector DB (React/Node/MongoDB/Claude API) and grew it to 2000+ users, handling real deployment issues and CI/CD on Vercel/Render. Also developing an AI-powered iOS “3D World Explorer” (text-to-3D) and has cloud experience across Azure and AWS (S3/SageMaker/EC2).

View profile
SB

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.

View profile
AS

Avijit Saha

Screened

Junior Software Engineer specializing in cloud-native microservices and AI/ML observability

Bedford, TX3y exp
JPMorgan ChaseUniversity of the Cumberlands

Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.

View profile
BC

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp
BlackRockWebster University

Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.

View profile
SS

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp
JPMorgan ChaseUniversity of South Dakota

Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.

View profile
MT

Mihir Trivedi

Screened

Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems

New York, NY3y exp
Astera HoldingsColumbia University

Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.

View profile
SR

Senior Data Scientist specializing in machine learning and customer analytics

Illinois, USA7y exp
Northern TrustBradley University

Data/ML practitioner with experience applying NLP and classical ML to large-scale customer data (2B+ records) for segmentation, prediction, and survey-text classification, delivering measurable business impact (~18% engagement efficiency). Has hands-on entity resolution across multi-source datasets and has built embedding-based semantic search using SentenceBERT + a vector database with domain fine-tuning (~20% relevance improvement), plus production workflow experience with Spark/Airflow and cloud tooling (AWS/Azure).

View profile
SU

Intern Software Engineer specializing in AWS cloud architecture and GenAI systems

Seattle, WA2y exp
Amazon Web ServicesSan José State University

AWS Solutions Architect intern who advised customers on securing a multi-tenant LLM-based SaaS, including isolation strategy tradeoffs and production guardrails against prompt injection. Has experience investigating a prompt-injection incident using logs/traces and TTP-style documentation, and designing scalable SDK/agent integrations via asynchronous worker architecture with prompt versioning.

View profile
AP

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp
Bank of AmericaUniversity of North Carolina at Charlotte

Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.

View profile
SS

Intern AI/ML Engineer specializing in GenAI pipelines and cloud automation

Tempe, AZ1y exp
Catalyst SolutionsArizona State University

Built and productionized a Python/LLM-based pipeline at Catalyst Solutions to automate healthcare RFP processing, turning unstructured documents into validated JSON/Excel with schema validation, confidence scoring, and human-review routing. Delivered major operational impact (hours-to-minutes processing, ~60% efficiency gain; 50+ RFPs processed) and modernized legacy scripts into a staged, more reliable architecture using incremental refactoring and fallback comparisons.

View profile
SK

Mid-level Full-Stack Developer specializing in FinTech and enterprise web platforms

USA4y exp
JPMorgan ChaseChristian Brothers University

Financial-services AI engineer who shipped a production investment research assistant using RAG over internal research reports, SEC filings, and meeting transcripts, with a strong emphasis on truthfulness and guardrails. Built a structured evaluation loop (200+ golden test cases, RAG Triad metrics) that directly improved retrieval quality (e.g., fixing year-mismatch retrieval, boosting sensitive-query performance by 18% and cutting hallucinations to near zero) and scaled ingestion to ~10k messy documents with RabbitMQ + OpenTelemetry.

View profile
Vaibhav Sharma - Mid-level Software Engineer specializing in AI/ML and data platforms in Remote, USA

Mid-level Software Engineer specializing in AI/ML and data platforms

Remote, USA5y exp
GoogleIndiana University Bloomington

AI/ML engineer who built a production agentic system to automate computational research experiments (simulation execution, parameter exploration, and numerical analysis) and mitigated context-window failures using constrained tool-calling/prompt-chaining patterns in LangChain with OpenAI tool-enabled models. Also has adtech/big-data pipeline experience at InMobi, orchestrating Spark jobs in Airflow to filter bot-like user IDs and publish clean IDs to an online NoSQL store for live serving, plus Apache open-source collaboration experience.

View profile
Prasannakumar B Vardi - Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems in Santa Clara, CA

Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems

Santa Clara, CA9y exp
CardlyticsStony Brook University

Backend/platform engineer who built a high-scale audience segmentation and real-time targeting system using Spark/Glue + S3/Hudi and low-latency API services backed by Redis/relational stores. Demonstrates strong production rigor: Spark performance tuning to eliminate OOM failures, API idempotency/caching to cut p95 latency ~40%, and careful dual-run/feature-flag migrations with reconciliation and rollback runbooks. Experienced implementing layered security with JWT/OAuth, RBAC/ABAC, and database row-level security to prevent privilege escalation.

View profile
Sankalp Tiwari - Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines in New York, NY

Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines

New York, NY4y exp
Goldman SachsSan José State University

Backend engineer at Goldman Sachs who built LLM-powered reconciliation/reporting services and high-throughput Kafka pipelines (8M+ events/day). Strong in production-grade Python/FastAPI microservices on Kubernetes with GitOps-style CI/CD, plus experience migrating legacy reporting/settlement services onto an internal Kubernetes platform using shadow deployments and gradual cutovers.

View profile
Belal Beydoun - Intern Full-Stack Software Engineer specializing in AI and data analytics in Detroit, MI

Belal Beydoun

Screened

Intern Full-Stack Software Engineer specializing in AI and data analytics

Detroit, MI2y exp
DTE EnergyUniversity of Michigan

Software engineer focused on real-time, low-latency AI pipelines: built an end-to-end mobile-to-backend image classification system using React Native/Expo, Node.js, gRPC, MySQL, and Google Vision AI, optimizing throughput and latency. Also integrated an AI model into a real-time field workflow at DTE via Node.js + Azure Databricks, adding data cleaning/validation and safe fallback logic for reliability in operations.

View profile
Akshit Modi - Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps in Remote, USA

Akshit Modi

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

Remote, USA5y exp
TempusArizona State University

Healthcare/clinical ML practitioner who built and productionized ClinicalBERT-based pipelines to extract and standardize oncology EHR data, improving downstream model F1 from 0.81 to 0.92 while controlling training cost via LoRA/QLoRA. Experienced orchestrating real-time AWS ETL/ML workflows (Glue, Lambda, SageMaker) and partnering with clinicians using SHAP-based interpretability, contributing to an 18% reduction in readmissions and full adoption.

View profile
Utkarsh Mittal - Intern Data Scientist specializing in computer vision and LLM agents in Sunnyvale, CA

Intern Data Scientist specializing in computer vision and LLM agents

Sunnyvale, CA0y exp
Covalent MetrologyNYU

Software engineering candidate with hands-on experience building and shipping LLM agents: created a production AI enrichment/coding agent at Covalent Metrology using Apollo.io + OpenAI, and built a Mistral hackathon router that dynamically selects among models to reduce token cost while maintaining quality. Also developed a real-time financial margin analysis agent that emails actionable insights and iterated on reliability issues (e.g., fixing misrouted emails, improving news relevance filtering).

View profile
Bhavyasree Chinthala - Mid-level Data Engineer specializing in cloud data pipelines and real-time streaming in USA, USA

Mid-level Data Engineer specializing in cloud data pipelines and real-time streaming

USA, USA5y exp
PNCSaint Peter's University

Data engineer with PNC Bank experience owning high-volume financial transaction pipelines end-to-end (Kafka/REST ingestion through Spark/Glue transformations to Redshift serving) for risk and fraud analytics. Built strong reliability and data quality practices (Great Expectations, reconciliation, Airflow alerting, idempotent retries, incremental/windowed processing), reporting 40% ingestion efficiency gains and ~99.9% data accuracy.

View profile
Suloni Praveen - Entry-Level Software Engineer specializing in data engineering and ML systems in Los Angeles, CA

Entry-Level Software Engineer specializing in data engineering and ML systems

Los Angeles, CA0y exp
Easley-Dunn ProductionsUSC

Built an end-to-end Next.js/TypeScript LLM-based scientific PDF analyzer using local Ollama/Llama inference to prioritize privacy and cost, producing structured research artifacts (e.g., authors/methods/findings) with ~92% extraction accuracy. At Qualtrics, helped replace a batch pipeline with a real-time, low-latency ML inference service (Python/Go on Kubernetes) using Redis caching, Grafana-based observability, and graceful fallbacks to protect UX during failures.

View profile
Lance Chou - Intern Machine Learning Engineer specializing in NLP and MLOps in Canada

Lance Chou

Screened

Intern Machine Learning Engineer specializing in NLP and MLOps

Canada1y exp
VosynColumbia University

PhD-led research engineer who has shipped LLM-powered agents for automated knowledge extraction from STEM textbooks/papers into a graph database, reporting a 90% accuracy improvement and major reductions in manual curation time. Also built an end-to-end multi-agent news aggregation/sentiment pipeline using the Agno framework with Pydantic-structured outputs, retries, and monitoring, and has experience processing messy SEC filings.

View profile
BZ

Binghan Zhang

Screened

Intern Data Analyst specializing in business intelligence and financial analytics

San Francisco, CA1y exp
Innova AI TechUCLA

Analytics candidate with hands-on experience in both fraud and churn use cases, including SQL-based preparation of 6.5M transaction records and reproducible Python modeling workflows. Stands out for combining technical rigor in data quality, feature engineering, and imbalance handling with strong stakeholder alignment, metric definition, and dashboard adoption.

View profile
AM

Mid-level analytics professional specializing in AI, strategy, and business intelligence

Seattle, WA5y exp
Dell TechnologiesUniversity of Washington

Analytics-focused candidate with hands-on experience using SQL and Python to clean messy business data, automate reporting, and build practical customer analytics solutions. Notable examples include a 70% reduction in reporting time through Python-based Excel automation at Shell and stakeholder-friendly retention/RFM segmentation work for small business clients in freight and winery contexts.

View profile
Atharva Bhide - Entry Software Engineer specializing in AI/ML and multimodal systems in Los Angeles, CA

Atharva Bhide

Screened

Entry Software Engineer specializing in AI/ML and multimodal systems

Los Angeles, CA1y exp
Sigma HealthsenseUSC

Built and shipped a production healthcare AI platform for a clinic in Brea, LA that combined LLM-based clinical report generation, voice agents for appointment workflows, and camera-based patient monitoring. Stands out for pairing multimodal AI architecture with production-grade reliability and compliance practices, while delivering concrete business results including 90% workflow automation, 200 hours saved per month, and a 60% improvement in customer retention.

View profile
PP

Prutha Patel

Screened

Mid-level Business Analyst specializing in healthcare and data analytics

Texas, USA3y exp
Blue Cross Blue ShieldUniversity of Texas at Arlington

Analytics candidate with hands-on experience at BCBS building HIPAA-compliant SQL/Snowflake/Tableau pipelines across fragmented legacy healthcare systems. Stands out for turning a 5-day claims reporting process into a near real-time 10-minute dashboard and for pairing strong data engineering discipline with reproducible Python-based churn modeling that drove measurable retention outcomes.

View profile

Need someone specific?

AI Search