Vetted Data Ingestion Professionals

Pre-screened and vetted.

HK

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.

View profile
AR

Anagha Rumade

Screened

Senior Applied AI/ML Engineer specializing in GenAI, LLMs, RAG and agents

Palo Alto, California9y exp
JPMorgan ChaseStevens Institute of Technology

Applied AI/ML Engineer at JPMorgan Chase who led a banker-facing LLM chatbot from an OpenAI-API POC to a production RAG workflow, including hallucination mitigation, automated evaluation in SageMaker, and operational monitoring with Dynatrace. Also delivers external technical education—hosted a hands-on Grace Hopper Celebration 2025 workshop teaching LangChain/LangGraph agentic workflows.

View profile
SM

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).

View profile
CT

Chad Thomas

Screened

Executive Technology Leader (CTO/Chief Architect) specializing in AI, FinTech, and scalable platforms

Remote, FL34y exp
Intech InvestmentsColorado State University

Serial entrepreneur who built Verb Technology from a garage startup to a Nasdaq IPO, raising multiple rounds of capital along the way. Invented interactive live streaming technology that was acquired by Amazon and demonstrated rapid product/market response during COVID by prototyping and launching a solution for users while tightly managing AWS costs.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Ajay Madhusudhan Thumala - Junior Software Engineer specializing in data engineering and LLM applications in Irvine, CA

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp
GeisingerUC Irvine

Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.

View profile
Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
SB

Mid-level Data Analyst specializing in financial and telecom analytics

Remote, USA5y exp
AT&TLewis University

Analytics candidate with hands-on experience at AT&T building SQL/Python pipelines for churn, usage, billing, and network-performance data at multi-million-row scale. Stands out for combining strong data quality and reconciliation practices with measurable operational impact, including a 30% query runtime improvement and ~8 hours/week of reporting automation savings.

View profile
RS

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp
JPMorgan ChaseCalifornia State University, Long Beach

Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.

View profile
AG

Amit Gaur

Screened

Mid-level AI Engineer specializing in LLMs and production ML systems

Long Beach, CA4y exp
California State University, Long BeachCalifornia State University, Long Beach

Engineering leader with hands-on AI/ML systems experience spanning production inference infrastructure and consumer-facing LLM products. At Jio, they led a 17-person AI features team and delivered measurable execution gains, including 40% faster deployments and 35% lower prediction latency, while also building an end-to-end RAG-based meal recommendation product using OpenAI and Gemini.

View profile
Justin Edwards - Senior Product Manager specializing in GTM systems and data infrastructure in New York, NY

Senior Product Manager specializing in GTM systems and data infrastructure

New York, NY14y exp
Bank of AmericaIndiana University Kelley School of Business

Solutions-oriented technical consultant with enterprise experience spanning Bank of America cloud migration and large-scale marketing architecture redesigns at Disney Streaming and SiriusXM. Stands out for combining pre-sales solutioning, compliance-heavy enterprise integration, and hands-on building—including a self-built gen AI child nutrition app that evaluates 1.4 trillion meal combinations in about 4 seconds.

View profile
PE

Mid-Level Software Engineer specializing in distributed systems and cloud-native backends

Dallas, USA5y exp
T-MobilePurdue University

AI/LLM engineer with production experience at Charles Schwab building a RAG-based assistant to help 5,000+ reps answer complex financial policy questions. Implemented a multi-layer anti-hallucination approach (GNN-driven ontology/graph retrieval + citation-only answers) and compliance-focused guardrails (Azure AI Content Safety) in partnership with audit/compliance stakeholders.

View profile
AS

Avijit Saha

Screened

Junior Software Engineer specializing in cloud-native microservices and AI/ML observability

Bedford, TX3y exp
JPMorgan ChaseUniversity of the Cumberlands

Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.

View profile
SK

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp
ServiceNowRivier University

Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.

View profile
GJ

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and Computer Vision

USA5y exp
WalmartUniversity of New Haven

ML/AI engineer with production experience across retail and healthcare: built a real-time computer-vision shelf monitoring system at Walmart and optimized edge inference latency by ~30% using TensorRT/ONNX and pruning. Also partnered with CVS Health clinical/pharmacy teams to deliver a medication-adherence predictive model, using Streamlit explainability dashboards and achieving an 18% adherence improvement.

View profile
DK

Senior Data Engineer specializing in Azure Lakehouse, Databricks/Spark, and Snowflake

Richardson, TX6y exp
PwCUniversity of Central Missouri

Data engineer/platform builder with experience across PwC and Liberty Mutual delivering high-volume, production-grade pipelines and real-time data services. Has owned end-to-end streaming + batch architectures on AWS and Azure, including web scraping systems, with quantified reliability gains (99.9% availability, 90%+ error reduction, 30% latency reduction) and strong observability/CI-CD practices.

View profile
SK

Mid-level Full-Stack Developer specializing in FinTech and enterprise web platforms

USA4y exp
JPMorgan ChaseChristian Brothers University

Financial-services AI engineer who shipped a production investment research assistant using RAG over internal research reports, SEC filings, and meeting transcripts, with a strong emphasis on truthfulness and guardrails. Built a structured evaluation loop (200+ golden test cases, RAG Triad metrics) that directly improved retrieval quality (e.g., fixing year-mismatch retrieval, boosting sensitive-query performance by 18% and cutting hallucinations to near zero) and scaled ingestion to ~10k messy documents with RabbitMQ + OpenTelemetry.

View profile
Wei Jiang - Junior Machine Learning Engineer specializing in MLOps and statistical modeling in Greenwood, SC

Wei Jiang

Screened

Junior Machine Learning Engineer specializing in MLOps and statistical modeling

Greenwood, SC3y exp
ES FoundryNortheastern University

Integration engineer at ES Foundry who led deployment of ELsentinel, a production EL image-based solar cell quality monitoring system using a Swin Transformer classifier (>0.8 F1 across 15+ classes) plus a live real-time prediction dashboard. Strong in solving messy labeling/data-quality problems with process-team collaboration and shipping ML systems despite limited compute/infrastructure.

View profile
Junhui Huang - Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP in Providence, RI

Junhui Huang

Screened

Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP

Providence, RI1y exp
Harvard UniversityBrown University

Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.

View profile
Prasannakumar B Vardi - Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems in Santa Clara, CA

Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems

Santa Clara, CA9y exp
CardlyticsStony Brook University

Backend/platform engineer who built a high-scale audience segmentation and real-time targeting system using Spark/Glue + S3/Hudi and low-latency API services backed by Redis/relational stores. Demonstrates strong production rigor: Spark performance tuning to eliminate OOM failures, API idempotency/caching to cut p95 latency ~40%, and careful dual-run/feature-flag migrations with reconciliation and rollback runbooks. Experienced implementing layered security with JWT/OAuth, RBAC/ABAC, and database row-level security to prevent privilege escalation.

View profile
Sankalp Tiwari - Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines in New York, NY

Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines

New York, NY4y exp
Goldman SachsSan José State University

Backend engineer at Goldman Sachs who built LLM-powered reconciliation/reporting services and high-throughput Kafka pipelines (8M+ events/day). Strong in production-grade Python/FastAPI microservices on Kubernetes with GitOps-style CI/CD, plus experience migrating legacy reporting/settlement services onto an internal Kubernetes platform using shadow deployments and gradual cutovers.

View profile
Bhavyasree Chinthala - Mid-level Data Engineer specializing in cloud data pipelines and real-time streaming in USA, USA

Mid-level Data Engineer specializing in cloud data pipelines and real-time streaming

USA, USA5y exp
PNCSaint Peter's University

Data engineer with PNC Bank experience owning high-volume financial transaction pipelines end-to-end (Kafka/REST ingestion through Spark/Glue transformations to Redshift serving) for risk and fraud analytics. Built strong reliability and data quality practices (Great Expectations, reconciliation, Airflow alerting, idempotent retries, incremental/windowed processing), reporting 40% ingestion efficiency gains and ~99.9% data accuracy.

View profile
AS

Mid-level Software Engineer specializing in backend systems and AI automation

San Francisco, CA5y exp
For Women’s HealthUC Santa Cruz

Built a production Python microservice around Grafana Loki focused on reliability, with checkpointing, idempotency, replay tooling, tracing, and alerting to prevent data loss and silent lag. Also has hands-on experience hardening brittle Playwright automations against dynamic UIs, auth expiry, rate limits, MFA, and bot-detection constraints, plus turning tribal-knowledge SOPs into explicit state-machine-driven workflows.

View profile

Need someone specific?

AI Search