Vetted Pinecone Professionals

Pre-screened and vetted.

PN

Mid-level AI Engineer specializing in LLMs, RAG, and production ML systems

Oregon, USA3y exp
HexawareOregon State University

Backend engineer who built an AI-powered grant matchmaking platform for researchers and professors, combining semantic matching, embeddings, and Semantic Scholar enrichment with rule-based eligibility filters. Stands out for pragmatic AI engineering: they focused on reliability through confidence scoring, logging, manual validation, and production-minded backend design.

View profile
RM

Principal AI/ML Leader specializing in Generative AI, MLOps, and NLP

CA, USA11y exp
iBase-tNortheastern University

Founding member of Tausight, building AI systems to detect and protect PHI for healthcare organizations; helped take the company through post–Series A funding and exited after ~6 years. Drove a strategic collaboration with Intel’s OpenVINO team—becoming the first to deploy it in a real production system and improving model performance by ~30% on customer Intel-CPU machines.

View profile
SV

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp
Summit Design and TechnologyNorthwest Missouri State University

Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.

View profile
BS

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp
DiscoverUniversity of South Dakota

Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
HE

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp
LexisNexisUniversity of South Florida

AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.

View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp
Capital OneRobert Morris University

AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).

View profile
Somil Shah - Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents in San Francisco, CA

Somil Shah

Screened

Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents

San Francisco, CA4y exp
INTERACT Animal LabNortheastern University

AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).

View profile
Bala Venkateswarlu K - Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps in USA

Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps

USA5y exp
MetLifeHarrisburg University of Science and Technology

Built and deployed an LLM-powered claims-document summarization system (insurance domain) that cut agent review time from 4–5 minutes to under 2 minutes and saved 1,200+ hours per quarter. Hands-on across orchestration and production infrastructure (Airflow retraining DAGs, Kubernetes, SageMaker endpoints, FastAPI) and recent RAG workflows using n8n + Pinecone, with a strong focus on reliability, cost, and explainability for non-technical stakeholders.

View profile
CT

Mid-level AI Engineer specializing in LLMs, MLOps, and healthcare NLP

4y exp
HCA HealthcareUniversity of South Florida

Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.

View profile
SN

Mid-level AI/ML Engineer specializing in GenAI, NLP, and financial systems

Texas, USA5y exp
CitibankConcordia University, St. Paul

GenAI/ML engineer with hands-on experience building production financial intelligence and document summarization systems at Citibank. Stands out for combining LLM fine-tuning, hybrid RAG, multi-agent workflows, and strong MLOps/observability practices to deliver measurable business impact, including 60% faster analyst retrieval, 31% higher precision, and 99%+ uptime.

View profile
NN

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
VibeSeaCalifornia State University, Chico

Software engineer currently building AI-powered backend systems for interview analysis, with end-to-end ownership of an LLM-based monitoring platform. Stands out for combining practical product delivery in an ambiguous early-stage environment with measurable impact: over 40% reduction in manual review effort and roughly 20% lower inference cost.

View profile
Koushik Ravikumar - Mid-level Full-Stack Engineer specializing in cloud-native Java and AI platforms in Kyle, TX

Mid-level Full-Stack Engineer specializing in cloud-native Java and AI platforms

Kyle, TX4y exp
InfosysStevens Institute of Technology

Full-stack engineer with strong cloud and platform experience spanning React, Go, AWS, Kafka, and Terraform. Has led complex migrations from monolithic/containerized systems to microservices and cloud deployments, built compliance-oriented logging infrastructure, and improved a broken frontend codebase to achieve a 3x performance gain while making it easier for other developers to extend.

View profile
SG

Sadhana Guda

Screened

Mid-level Full-Stack AI Engineer specializing in LLM agents and RAG systems

3y exp
General MotorsUniversity of Bridgeport

AI product engineer with hands-on experience shipping enterprise LLM systems at General Motors, including NL2SQL analytics, RAG-based enterprise search, and multi-agent document analysis. Stands out for combining strong technical depth in LangChain/Vertex AI/Pinecone/Redshift with disciplined evals, human-in-the-loop design, and clear business impact such as 70% to 90%+ accuracy gains, 3x analyst throughput, and rapid MVP delivery in 6 weeks.

View profile
YM

Junior AI Engineer specializing in LLM systems and applied machine learning

San Francisco, CA2y exp
LangChainUniversity of the Pacific

Yogesh is an AI/full-stack engineer from LangChain who says he was the sole developer and core maintainer of OpenSWE/OpenSpeed, an asynchronous coding agent in LangSmith Cloud that turns requests from Slack, Linear, and GitHub into reviewable PRs. He emphasizes production-grade agent infrastructure: event-driven workflow design, typed run states, observability, retries, and latency improvements via pre-warmed sandboxes.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.

View profile
MP

Meghana P

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and NLP

Illinois, USA5y exp
State FarmSaint Louis University

AI/ML engineer with forensic analytics and healthcare claims experience (Optum), building production LLM/RAG systems to surface context-driven fraud patterns from unstructured claim notes and explain risk to investigators. Strong in large-scale retrieval performance tuning, legacy API integration with reliability patterns (SQS, circuit breakers), and MLOps orchestration on Airflow/Kubernetes with rigorous testing, monitoring, and stakeholder-friendly interpretability.

View profile
HS

Harsha Sikha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and data engineering

Armonk, New York4y exp
IBMSaint Peter's University

IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.

View profile
AS

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

United States5y exp
CVS HealthUniversity of Maryland, Baltimore County

At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.

View profile
Ishaan Umesh Mandliya - Mid-Level Full-Stack Software Engineer specializing in AI/ML and cloud-native systems in Los Angeles, CA

Mid-Level Full-Stack Software Engineer specializing in AI/ML and cloud-native systems

Los Angeles, CA3y exp
DevolvedAIUSC

At BondiTech, built and deployed customer-facing backend improvements for enterprise dashboards handling 1M+ records, redesigning a .NET/Entity Framework API with server-side pagination/filtering and feature-flagged rollout to cut latency from ~15s to ~2s. Experienced integrating customer systems into existing APIs, including stabilizing a legacy CRM sync by normalizing inconsistent IDs, handling strict rate limits with batching, and adding DLQs plus reconciliation reporting.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile

Need someone specific?

AI Search