Vetted Data Ingestion Professionals

Pre-screened and vetted.

Raj Kale - Junior Software Engineer specializing in full-stack systems and FinTech in Los Angeles, CA

Raj Kale

Screened

Junior Software Engineer specializing in full-stack systems and FinTech

Los Angeles, CA3y exp
University of Southern CaliforniaUSC

Full-stack engineer with experience building financial and hiring-product systems, spanning React/TypeScript dashboards, Flask/Kafka/Postgres backends, and multi-tenant configuration for 3,000+ clients. Stands out for combining deep technical debugging and performance work with product-minded UX improvements, including a 41% lift in resume matching accuracy and ~40% latency reduction through batching and query tuning.

View profile
AD

Junior AI Engineer specializing in ML, LLM systems, and RAG

Bangalore, India2y exp
NxtGen Cloud TechnologiesUniversity at Buffalo

Built and deployed an LLM/applied-ML system enabling efficient extraction of useful information from large unstructured multimodal datasets, owning the full pipeline from ingestion to inference and APIs with a strong emphasis on production reliability, latency, and monitoring. Also delivered a voice-based AI workflow for Hindi policy document access for the Election Commission of India by translating non-technical usability needs into iterative demos and a successful implementation.

View profile
SC

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp
Universal Health ServicesUniversity of New Haven

Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
KR

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp
HCLTechArizona State University

LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.

View profile
Shivam Lahoti - Junior Full-Stack Software Engineer specializing in TypeScript/React and microservices in Boston, USA

Shivam Lahoti

Screened

Junior Full-Stack Software Engineer specializing in TypeScript/React and microservices

Boston, USA2y exp
Northeastern UniversityNortheastern University

Software engineer who built and owned an internal workflow automation + analytics platform end-to-end (TypeScript/React/Node) with a microservices, RabbitMQ-based async architecture. Drove adoption by shipping iterative prototypes and prioritizing reliability/performance (Redis caching, query optimization), delivering ~30–35% latency improvements and ~30–40% reduction in manual operational work.

View profile
Jitesh Kumar S - Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps in Lafayette, IN

Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps

Lafayette, IN3y exp
YaarcubesUniversity of Maryland, College Park

ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.

View profile
Butchi Venkatesh Adari - Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception in NewYork, NY

Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception

NewYork, NY4y exp
Alpheva AIWorcester Polytechnic Institute

Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.

View profile
Snehitha Penumaka - Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines in Dallas, TX

Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines

Dallas, TX3y exp
Cambard LLCUniversity of Texas at Dallas

LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.

View profile
RC

RIYA CHADDHA

Screened

Mid-level Data Engineer and Business Analyst specializing in cloud ETL and analytics

Remote, US5y exp
MellicellNortheastern University

Data analyst with cross-industry experience spanning insurance analytics at L&T Infotech and experimental imaging analytics at Mylyser. Stands out for building scalable SQL/PySpark data pipelines, standardizing business-critical metrics like claims lifecycle and policy retention, and delivering measurable impact such as 50%+ faster query performance and a 15% reduction in claims settlement time.

View profile
SK

Intern Software Engineer specializing in cloud, full-stack, and AI systems

United States1y exp
Five9California State University, East Bay

Built a production LLM-assisted workflow for customer configuration data migrations, combining agentic parsing with deterministic validation and fail-safe pipeline design. Stands out for turning messy ERP and operational data into reliable, repeatable transformations while improving accuracy and cutting manual effort by more than 80%.

View profile
IK

Mid-level Software Engineer specializing in AI systems and distributed platforms

Alexandria, VA4y exp
Virginia TechVirginia Tech

Built OpenGPU features spanning React/TypeScript, Go orchestration, PostgreSQL, Redis, and Stripe, with a strong focus on reliability, transaction integrity, and low-latency distributed systems. Also shipped LLM product infrastructure, including persona-conditioned frameworks and reusable prompt/model abstractions, showing a blend of systems engineering and fast product iteration.

View profile
Chaitanya Annabathana - Mid-level Software Engineer specializing in AI pipelines and enterprise integrations in USA

Mid-level Software Engineer specializing in AI pipelines and enterprise integrations

USA5y exp
AFBA Life InsuranceCalifornia State University, East Bay

Candidate has 4 years of experience and appears strongest in customer-facing implementation and AI-enabled workflow automation. They describe owning deployments end-to-end, putting an LLM support assistant with RAG and function calling into production, and improving support operations with a 30% reduction in resolution time and 25% gain in agent productivity.

View profile
Naveen K - Mid-level Full-Stack Software Engineer specializing in AI agents and RAG workflows in San Francisco, CA

Naveen K

Screened

Mid-level Full-Stack Software Engineer specializing in AI agents and RAG workflows

San Francisco, CA3y exp
Wells FargoUniversity of Central Missouri

Candidate is highly focused on AI-native software development, using tools like GitHub Copilot and OpenAI models within structured plan-code-review-test workflows. They stand out for designing multi-agent coding systems with planner, coder, and tester roles, and for applying tech-lead style governance through constraints, quality gates, and validation-first practices.

View profile
Austin Pierce-Ptak - Mid-level Software Engineer specializing in full-stack and ETL systems in Seattle, WA

Mid-level Software Engineer specializing in full-stack and ETL systems

Seattle, WA5y exp
BroadridgeLoyola University Maryland

Backend engineer with end-to-end ownership experience across enterprise SaaS and high-volume data systems, including PostgreSQL/.NET services at Visual Lease and ETL pipelines at Broadridge processing millions of records for Fortune 500 clients. Stands out for combining production support, observability thinking, and pragmatic architecture tradeoffs, while also experimenting with LLM-powered job application automation using Claude.

View profile
SR

Shahbaz Raza

Screened

Mid-level Software Engineer specializing in ML infrastructure and cloud-native data platforms

Lahore, Pakistan4y exp
MotiveNational University of Computer and Emerging Sciences

Backend/data engineer focused on high-scale, event-driven AWS ingestion systems (SQS/Lambda/EKS) processing millions of events per day, with strong reliability patterns (idempotency, DLQs, bounded retries) and deep observability using Datadog distributed tracing. Has delivered Terraform/GitHub Actions CI/CD and improved secret rotation via Secrets Manager + IRSA, plus Glue-based ETL with schema-evolution handling and Postgres SQL optimization (including JSONB/GIN indexing). Candidate is currently living outside the US and states they do not have US work authorization.

View profile
SR

Mid-level Full-Stack Software Engineer specializing in cloud-deployed web apps and APIs

Dayton, OH3y exp
Wells FargoWright State University

Software engineer who has shipped both core web platform features (secure user authentication/profile management) and production LLM systems. Built an internal documentation knowledge assistant using a full RAG pipeline (OpenAI embeddings, vector DB, semantic search, reranking) with evaluation loops and a scalable document-ingestion pipeline for PDFs/FAQs, iterating based on metrics and user feedback.

View profile
AG

Mid-level AI/ML Engineer specializing in MLOps and cloud-deployed ML systems

Austin, TX3y exp
PurevisitxUniversity of Illinois Springfield

ML/AI engineer who built and productionized an NLP system at PurevisitX, orchestrating end-to-end ML workflows with Airflow (S3 ingestion through auto-retraining) and optimizing for drift and low-latency inference. Also partnered with Citibank risk teams on a fraud detection model, translating results via dashboards and iterating thresholds based on stakeholder feedback.

View profile
VS

Mid-level Full-Stack Software Developer specializing in cloud-native microservices

WI, USA5y exp
HCLTechWright State University

Product-focused full-stack engineer (Spring Boot/Django + React/TypeScript) with deep experience building multi-tenant, enterprise workflow and supply-chain/order-tracking systems. Owned an end-to-end Workflow SLA Breach Prediction & Alerting feature integrating Azure ML for a cloud workflow platform used by ~10,000 enterprise users, and has hands-on AWS operations experience resolving real production latency/scaling incidents via query optimization and Redis caching.

View profile
GD

Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots

Houston, TX3y exp
University of HoustonUniversity of Houston

Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.

View profile
MK

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Arlington, TX4y exp
micro1University of Texas at Austin

Built and shipped a production RAG assistant using GPT-4, LangChain, and Pinecone/FAISS to search 50K+ institutional documents, with a strong focus on groundedness and hallucination reduction through retrieval optimization and re-ranking. Pairs this with a metrics-driven evaluation/monitoring approach (BLEU/ROUGE, manual sampling, logging) and workflow automation via Airflow, and has experience translating stakeholder needs into iterative AI prototypes.

View profile
HP

Hansitha P

Screened

Mid-level Data Engineer specializing in scalable ETL/ELT and real-time streaming pipelines

USA4y exp
CVS HealthUniversity of Cincinnati

Built and shipped a production LLM-powered customer support agent for an EV charging platform using RAG plus internal APIs, automating session/payment issues and ticket routing. Emphasizes production readiness via guardrails, schema validation, state-machine orchestration, monitoring, and continuous evals, delivering a reported 35–40% reduction in support tickets and improved customer satisfaction.

View profile
Dhairya Desai - Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics in Chicago, IL

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
Ponugoti Sushma - Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML in Texas, USA

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp
AllstateTexas A&M University-Corpus Christi

Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.

View profile

Need someone specific?

AI Search