Vetted Latency Optimization Professionals

Pre-screened and vetted.

Latency Optimization Python Docker CI/CD SQL AWS

DurgaPrasad Sakala

Mid-level Full-Stack Engineer specializing in AI-native cloud systems

4y exp

Johnson & JohnsonUniversity at Buffalo

Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon ECS Amazon S3 Amazon SNS+86

View profile

Anshul Joshi

Screened ReferencesStrong rec.

Mid-Level Software Engineer specializing in distributed systems and GenAI

Austin, TX4y exp

University of Texas at AustinUniversity of Texas at Austin

“Capgemini engineer with 4+ years building and deploying high-availability, low-latency fraud detection APIs and multi-cluster distributed systems for a Fortune 20 bank, including zero-downtime production rollouts and multi-layer (SQL/network/hardware) performance debugging. Also built a Python + OpenAI/LangChain LLM-powered grading workflow for Austin School for Women, cutting feedback time from 90 minutes to 5 minutes per submission for 200+ learners.”

Java Python C++Go TypeScript Spring Boot+126

View profile

Mayank Pratap

Screened

Intern Robotics Engineer specializing in autonomous navigation and SLAM

West Lafayette, IN1y exp

Nanyang Technological UniversityPurdue University

“Robotics software engineer with deep ROS2 Humble/Nav2 experience who built an SDF-based navigation system (RRT* global planning + gradient-based local avoidance) and implemented scan-matching localization. Proven real-time performance debugging and optimization on hardware (Unitree B1), including halving compute-cycle latency and resolving ROS2 jitter/message-drop issues through explicit QoS and executor/callback-group design.”

Gazebo PyTorch OpenCV Python C MATLAB+83

View profile

Lavrenti DeLavrenti

Screened ReferencesStrong rec.

Director-level Technology Leader specializing in cloud-native platforms, AI/ML, and SaaS

Remote15y exp

Alioni Tech LabsGeorgian Technical University

“Engineering leader (Director/VP level) who has repeatedly aligned product and engineering through ROI-driven quarterly roadmaps and strong stakeholder communication, including board presentations. Built a parallel cloud team to migrate an on-prem product to the cloud, credited with delivering $9M ARR, and led a Python monolith-to-serverless event-driven microservices transformation. Currently manages distributed teams across Mexico, India, and the US using pod-based structures, clear KPIs, and a supportive accountability culture.”

API Design API Gateway AWS AWS Lambda Budget Management CI/CD+220

View profile

Suparshwa Patil

Screened ReferencesStrong rec.

Mid-level Software Engineer specializing in Agentic AI and RAG systems

Remote, California4y exp

One CommunityPurdue University

“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”

Python Java TypeScript Go SQL FastAPI+75

View profile

Abnik Ahilasamy

Screened ReferencesModerate rec.

Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference

Chennai, India0y exp

Larsen & ToubroArizona State University

“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”

Python PyTorch TensorFlow C++SQL Bash+153

View profile

Julian Lee

Screened

Intern Software Engineer specializing in AI/LLMs and full-stack development

New York, New York1y exp

Highlight.AIUSC

“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”

Algorithms API Integration AWS AWS Lambda CI/CD C#+152

View profile

Sathwik Alavala

Screened

Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics

Charlotte, NC6y exp

Bank of AmericaCampbellsville University

“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”

A/B Testing Anomaly Detection API Development AWS Azure Machine Learning ChromaDB+94

View profile

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp

VerizonUniversity of Massachusetts Dartmouth

“Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.”

Machine Learning Artificial Intelligence Deep Learning PyTorch TensorFlow Keras+110

View profile

Sai Chatrathi

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

NY, USA4y exp

HumanaSyracuse University

“Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.”

Python Pandas NumPy Scikit-learn SQL TensorFlow+77

View profile

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp

GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B Testing Agile Apache Kafka Apache Spark AWS Glue AWS Lambda+170

View profile

Yaoxin Liu

Screened

Intern Full-Stack Software Engineer specializing in real-time web systems

New York, NY0y exp

VenuePilotNYU

“Built and iterated an end-to-end virtual waiting room for a real-time ticketing prototype, making concrete architecture tradeoffs (polling + Redis Pub/Sub) and improving performance post-launch with Redis caching (+30% throughput, -15% p99 latency). Also has hands-on experience building Spark/HDFS ETL pipelines with strong reliability/observability patterns and running disciplined NLP model evaluation loops on review-rating classification.”

Python Java JavaScript TypeScript SQL C+89

View profile

Koti Sai venkata Bhargav Edupuganti

Screened

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp

UnitedHealth GroupKent State University

“Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.”

A/B Testing Agile AWS Bash BigQuery CI/CD+131

View profile

Krishna Kandlakunta

Screened

Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning

United States5y exp

CitigroupUniversity of North Texas

“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”

A/B Testing Agile Anomaly Detection Apache Hadoop Apache Hive Apache Kafka+167

View profile

Manichandra Reddy Bethi

Screened

Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines

Overland Park, Kansas5y exp

MinutentagWilmington University

“Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.”

A/B Testing Alerting AWS AWS Lambda BERT CI/CD+120

View profile

Ram Kottala

Screened

Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms

Michigan, USA5y exp

FordWebster University

“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”

Python PySpark Scala Java R SQL+173

View profile

Samarth Saxena

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp

Cloud9USC

“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”

Python SQL Scala TypeScript Bash Java+162

View profile

Monisha DhanaVijeya

Screened

Junior Software Engineer specializing in AI, backend systems, and AWS cloud

Sunnyvale, CA2y exp

LinkedInNortheastern University

“Built and shipped a production multi-agent conversational AI platform (Monitor agent + RAG + 4 additional agents) with enterprise REST APIs, using ChromaDB-grounded WCAG knowledge to keep responses accurate while varying tone via personality modes and conversation memory. Has experience at LinkedIn delivering technical demos and pre-sales guidance to both engineering teams and C-level stakeholders, acting as a translator between sales and technical teams to drive adoption.”

Python Java C TypeScript JavaScript SQL+151

View profile

AliasgarZakir Merchant

Screened

Mid-level AI Engineer specializing in multi-agent LLM systems and multimodal tutoring

Boston, United States3y exp

PearsonUniversity of Illinois Urbana-Champaign

“LLM/agentic systems builder who has deployed multi-agent educational chatbots using LangChain + LangGraph, with LangFuse-based tracing and FastAPI hosting. Focused on production reliability and performance (latency reduction via agent decomposition and caching) and on evaluation/testing (routing test scenarios, LLM-as-judge). Partnered with product to add image understanding by parsing and storing images in S3, expanding chatbot coverage to 30+ books with images.”

Python FastAPI SQL LangChain LangGraph Redis+70

View profile

Vinay Kumar

Screened

Mid-level Backend Software Engineer specializing in Java microservices and AWS

Cincinnati, OH3y exp

AmazonUniversity of Cincinnati

“Backend/distributed-systems engineer (Amazon; also Bank of America) pivoting into robotics software. Built and owned an end-to-end cross-region event processing service for Aurora Global Databases, emphasizing correctness under latency/clock skew, fault tolerance, and strong observability; brings deep Docker/Kubernetes and CI/CD experience to robotics infrastructure and reliability work while ramping up on ROS 2.”

Java Python Spring Boot Node.js REST APIs Microservices+79

View profile

Varsha Hemakumar

Screened

Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems

USA3y exp

FintaUniversity at Buffalo

“Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.”

Python R SQL MongoDB Pandas NumPy+95

View profile

Brian Weatherill

Screened

Executive Enterprise Architecture & Cloud Transformation Leader

Lakeland, FL20y exp

METRCBrooklands College

“Technically oriented operator with experience driving a strategic migration to Microsoft Azure to modernize a company toward microservices and CI/CD, improving scalability and positioning for long-term optimization. Evaluates product ideas through an operational lens (efficiency, decision support, process optimization) and emphasizes building viable products with paying customers while maintaining revenue resilience.”

Infrastructure as Code DevOps CI/CD Kanban JIRA Microservices+93

View profile

Nidhish Rao Bairineni

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp

Wells FargoSouthern Methodist University

“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”

A/B Testing Apache Airflow Apache Kafka Apache Spark AWS AWS Glue+126

View profile

Hamidreza Lotfalizadeh

Screened

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems

Bay Area, CA6y exp

Inertia SystemsPurdue University

“At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.”

Anomaly Detection Apache Spark AWS CI/CD Classification Cross-functional Collaboration+93

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Education Data & Analytics

Need someone specific?

AI Search

Related

Need someone specific?