Vetted Data Ingestion Professionals

Pre-screened and vetted.

Manpreet Kour - Senior Data Scientist specializing in Generative AI and NLP in Seattle, USA

Manpreet Kour

Screened

Senior Data Scientist specializing in Generative AI and NLP

Seattle, USA6y exp
SOTIDr. B. R. Ambedkar National Institute of Technology, Jalandhar

ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.

View profile
vineetha Pulipati - Mid-level Software Engineer specializing in backend microservices and cloud data pipelines in MO, USA

Mid-level Software Engineer specializing in backend microservices and cloud data pipelines

MO, USA4y exp
Morgan StanleyWebster University

Backend engineer with Morgan Stanley experience building and owning an end-to-end Python FastAPI microservice for high-volume market data used by trading and risk systems. Strong in performance tuning and reliability (PySpark, Redis caching, async APIs), real-time streaming with Kafka, and production operations (Docker/Kubernetes, GitOps-style CI/CD, monitoring). Has led cloud/on-prem migration work across AWS and Azure, including fixing Azure Synapse performance issues via query and pipeline redesign.

View profile
Madhu Sriram Sengottu Velan - Entry-Level Software Engineer specializing in backend systems and distributed services in Chennai, IN

Entry-Level Software Engineer specializing in backend systems and distributed services

Chennai, IN2y exp
Oracle Financial Services SoftwareShiv Nadar University

Backend/AI engineer from an early-stage Japan-based startup (WorkAI) who built a multi-tenant RAG system integrating Notion/Slack/Google Drive with Pinecone and OpenAI, including a chatbot retrieval workflow. Experienced in production reliability (rate limits, retries, verification layers), strong Python/FastAPI engineering practices, and PostgreSQL performance optimization; currently based in India and needs sponsorship.

View profile
Abhishek Saraswat - Mid-level Software Engineer specializing in Java backend microservices in Arlington, TX

Mid-level Software Engineer specializing in Java backend microservices

Arlington, TX4y exp
KPMGUniversity of Texas at Arlington

Backend/distributed-systems engineer focused on automation and near-real-time processing, building Java/Spring Boot microservices with Kafka, PostgreSQL, and AWS. Strong in scaling and reliability work—debugging tricky asynchronous messaging issues (delays, duplicates, out-of-order events) and improving resilience/observability with retries, fallbacks, logging, and monitoring. No production ROS/ROS2 experience yet, but has studied core ROS concepts and draws clear parallels to event-driven architectures.

View profile
AA

Agna Antony

Screened

Mid-level Data Engineer specializing in cloud-native healthcare and enterprise data platforms

Michigan, USA5y exp
MedStar HealthAPJ Abdul Kalam Technological University

Data Engineer (TCS) who owned an end-to-end CRM analytics pipeline for Bayer’s eSalesWeb integration, ingesting from Salesforce APIs/databases/S3 and serving analytics-ready datasets via PostgreSQL/S3 for Tableau. Drove measurable outcomes: ~60% reduction in manual data-quality effort, ~30% lower latency through SQL optimization, and ~35% improved stability via monitoring, retries, and idempotent processing.

View profile
WF

Wyatt Fong

Screened

Entry-level Full-Stack Software Engineer specializing in AI and healthcare tech

La Jolla, CA1y exp
University of California San DiegoUC San Diego

Built a Python pipeline to monitor and classify public posts from sources like Hacker News and Reddit for SWE/tech job opportunities, with a strong focus on reliability, observability, and recoverable failures. Also currently building a court queueing system for the UCSD Badminton Club, showing an ability to turn messy, informal real-world processes into practical automation through iterative user feedback.

View profile
Bhagyashree Patil - Intern Applied AI Engineer specializing in LLM systems and data engineering in Los Angeles, CA

Intern Applied AI Engineer specializing in LLM systems and data engineering

Los Angeles, CA1y exp
USC Auxiliary ServicesUSC

Full-stack engineer with hands-on production experience across both traditional SaaS and LLM-powered support tooling. They owned a real-time ecommerce order tracking dashboard that improved support response times by 40%, and helped ship an AI support assistant using the OpenAI GPT API that cut ticket handling time by 30% through strong prompt design, retrieval grounding, validation, and human-in-the-loop safeguards.

View profile
AJ

Anthony Jin

Screened

Junior Software Engineer specializing in full-stack, data, and AI systems

Remote2y exp
NVIGENUC Santa Barbara

Full-stack engineer who independently built and still operates a live e-commerce clothing site, owning everything from frontend UX to GraphQL backend, auth, and Stripe payments. Also helped ship an AI-powered survey follow-up product using GPT-4.1, with hands-on experience in prompt design, Lambda-based architecture, and pragmatic LLM integration in an early-stage startup environment.

View profile
MR

Senior Software Engineer specializing in cloud-native microservices (AWS, Java, Kafka)

Dallas, TX4y exp
AccentureUniversity of Houston

Backend engineer with hands-on experience modernizing high-volume transactional systems by decomposing monoliths into Spring Boot microservices on AWS, using Kafka for async workflows and Redis/SQL tuning for latency. Has built Python/FastAPI services with strong API contracts and production-grade security (OAuth2/JWT, RBAC, row-level security), and proactively hardened payment flows against race conditions and double-charging via idempotency.

View profile
SP

Surya Pavan

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications

Baltimore, MD5y exp
AcerCalifornia State University, Northridge

GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.

View profile
VM

Mid-level Machine Learning & Full-Stack Engineer specializing in GenAI platforms

San Francisco, CA5y exp
WellDhanNortheastern University

LLM/agent builder who has shipped production AI systems in the wellness space, including an LLM-powered food tracking product used by 5000+ users and a voice/call-routing onboarding workflow using LangGraph/LangChain with LiveKit and Twilio. Strong focus on practical reliability work: latency reduction, retrieval/embedding tuning, and CI-driven evaluation with simulations and metrics.

View profile
SK

Mid-level Data Scientist specializing in real-time fraud detection and MLOps

San Francisco, CA5y exp
Charles SchwabCUNY Graduate Center

ML/NLP engineer with experience at Charles Schwab building an NLP + graph (Neo4j) entity-resolution system to unify fragmented user/device/transaction data and improve downstream model quality and analyst querying. Has applied embeddings (SentenceTransformers + FAISS) with domain fine-tuning to boost hard-case matching recall by ~12% while maintaining precision, and has a track record of hardening scalable Python/Spark pipelines and productionizing fraud models via A/B tests and shadow-mode monitoring.

View profile
AI

Intern Software Engineer specializing in AI systems and backend infrastructure

West Lafayette, IN2y exp
Acuvity AIPurdue University

Full-stack engineer with early-stage startup experience who shipped and owned production Next.js (App Router + TypeScript) features end-to-end, including auth-aware APIs, caching, and post-launch monitoring/iteration. Demonstrates strong performance and reliability chops across React UX optimization, Postgres analytics modeling/query tuning (validated via query plans), and durable ingestion workflows with retries/idempotency.

View profile
KR

Mid-Level Backend Engineer specializing in SaaS, FinTech, and AI document intelligence

San Francisco, CA3y exp
IntraEdgeNYU

Full-stack engineer who built an AI-driven document analysis and processing workflow end-to-end, including large-document ingestion, queued async processing, and low-latency retrieval for user-facing flows. Demonstrated practical performance tuning (moving heavy work off request path, polling, caching) and Postgres optimization validated with EXPLAIN ANALYZE, plus durable workflow resilience via retries and dead-letter queues.

View profile
VR

Mid-level Backend/AI Software Developer specializing in data pipelines for FinTech and healthcare

6y exp
TMV InvestmentsWright State University

Data engineer/backend data services builder with end-to-end ownership of production pipelines for a Pfizer client, combining Python/SQL ingestion and transformation with strong data quality controls. Delivered measurable performance gains (~30% faster queries) and improved reliability through monitoring/alerting (Splunk, Prometheus/Grafana), structured logging, and incident response; also built internal REST APIs with versioning and caching and set up GitLab-based CI/CD with containerized deployments.

View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
Ram Usarty - Mid-level Full-Stack Software Engineer specializing in cloud-native distributed systems in USA

Ram Usarty

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native distributed systems

USA4y exp
OnesynergeeUniversity of Cincinnati

Backend/platform-focused engineer who has shipped production LLM agents for messy research dataset submissions, turning manual validation into an automated, reliable ingestion pipeline. Strong on production hardening (streaming large uploads, strict schema/function-calling outputs, idempotency, RBAC) plus eval/monitoring loops that improved data quality, reduced support burden, and increased adoption.

View profile
Srilekha Jakkula - Senior Data Engineer specializing in scalable data pipelines and API-driven data services in Chicago, IL

Senior Data Engineer specializing in scalable data pipelines and API-driven data services

Chicago, IL5y exp
Northern TrustNorthern Illinois University

Data engineer focused on building scalable, reliable end-to-end data pipelines and backend REST data services, spanning API ingestion plus batch/stream processing with Airflow, Kafka, Spark/PySpark, and SQL. Emphasizes strong data quality validation, monitoring/fault tolerance, and performance tuning for large datasets, with experience deploying in cloud environments using containerization and CI/CD.

View profile
AP

Axel Paredes

Screened

Mid-level Business Analyst specializing in operations data and reporting

Miami, FL6y exp
Cole, Scott & Kissane, P.A.Miami Dade College

Candidate has hands-on project experience in healthcare analytics, using SQL, Python, and Power BI to analyze CMS hospital readmissions and HRRP penalty risk in Florida. Their work centers on turning messy CMS flat files into reporting-ready datasets, benchmarking hospitals against national references, and surfacing financial risk through dashboards.

View profile
PN

Mid-level AI Engineer specializing in LLMs, RAG, and production ML systems

Oregon, USA3y exp
HexawareOregon State University

Backend engineer who built an AI-powered grant matchmaking platform for researchers and professors, combining semantic matching, embeddings, and Semantic Scholar enrichment with rule-based eligibility filters. Stands out for pragmatic AI engineering: they focused on reliability through confidence scoring, logging, manual validation, and production-minded backend design.

View profile
MC

Staff Software Engineer specializing in FinTech and payroll platforms

Albany, GA15y exp
WrapbookMississippi State University

Full-stack engineer with startup experience building real-time collaboration and meeting platforms for enterprise customers. Has worked across product ownership, React/TypeScript frontends, Go and Node.js backend services, PostgreSQL data modeling, and production performance optimization in B2B SaaS environments.

View profile
SP

Junior Software Engineer specializing in backend systems and AI infrastructure

Redwood City, CA2y exp
WindBorne SystemsEmory University

Backend/full-stack engineer with deep experience building weather and geospatial data systems at WindBorne, spanning Next.js/TypeScript frontends through PostgreSQL, Redis, Sidekiq, Rails, Rust, and object-storage-backed forecast pipelines. Particularly strong in production reliability work—self-healing jobs, zero-downtime migrations, query/index optimization, and event-driven ingestion architectures that reduce latency and operational waste.

View profile

Need someone specific?

AI Search