Reval Logo
Home Browse Talent Skilled in Latency Optimization

Vetted Latency Optimization Professionals

Pre-screened and vetted.

Latency OptimizationPythonDockerCI/CDSQLAWS
SR

Shivam Ratnani

Junior Software Engineer specializing in full-stack and AI/LLM products

2y exp
Expert.aiUniversity of Wisconsin–Madison
PythonJavaScriptTypeScriptJavaC#C+91
View profile
DS

Divya Sree G

Mid-level Generative AI Engineer specializing in LLMs, RAG, and semantic search

McLean, VA4y exp
Capital OneJawaharlal Nehru Technological University
AWSCI/CDClusteringData ingestionData pipelinesData preprocessing+75
View profile
ES

Eswar Sai Yemprala

Mid-Level Full-Stack Software Engineer specializing in cloud-native FinTech systems

Remote, USA5y exp
Capital OneUniversity of South Dakota
JavaPythonJavaScriptTypeScriptSQLC+80
View profile
NB

Nitin Bojji

AI/ML Intern specializing in LLMs and Vision-Language Models

Princeton, NJ2y exp
SiemensBoston University
API GatewayAuthenticationAWSAWS Step FunctionsCI/CDCode Reviews+94
View profile
AK

AJAY KUMAR NADIDE

Senior Software Engineer specializing in cloud-native Java microservices

Westlake, TX9y exp
Fidelity InvestmentsJawaharlal Nehru Technological University, Hyderabad
JavaSpring BootSpring MVCSpring SecurityHibernateREST+71
View profile
VB

Vamsi bhavana

Mid-level Backend Engineer specializing in JavaScript/TypeScript runtimes and OSS developer experience

Birmingham, AL6y exp
O'Neal IndustriesTrine University
JavaScriptTypeScriptNode.jsPythonAPI DesignREST APIs+218
View profile
SK

Shreya Kalyanaraman

Junior Robotics Engineer specializing in reinforcement learning and robot manipulation

College Park, MD1y exp
Maryland Robotics CenterUniversity of Maryland, College Park
PythonC++ROS 2GitLinuxPyTorch+72
View profile
VV

Vijay Vignesh Vijay Vignesh

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and computer vision

Zeeland, MI6y exp
GentexAnna University
AlgorithmsAmazon SageMakerAWSAWS LambdaAzure DevOpsBlender+68
View profile
SS

Sri Sai Durga Katreddi

Mid-level AI Engineer specializing in production LLM, RAG, and agentic AI systems

6y exp
Bank of America
A/B TestingAnomaly DetectionAnsibleArgo CDAudit LoggingAWS+217
View profile
SP

Saikrishna Paila

Screened

Junior AI Engineer specializing in RAG pipelines and agentic AI systems

San Francisco, CA2y exp
Avenio CorporationGeorge Washington University

“Built and shipped production RAG/agentic systems in high-stakes domains (biomedical and legal), including an enterprise biomedical document retrieval platform over ~10k scientific docs and a multilingual African-law assistant at the World Bank. Deep hands-on experience with LangChain/LangGraph/LlamaIndex and evaluation tooling (LLM-as-a-judge, safety/hallucination detection), with measurable gains in retrieval quality and hallucination reduction.”

PythonPyTorchTensorFlowHugging Face TransformersFastAPIDjango+81
View profile
RP

Ruudra Patel

Screened

Junior Data Scientist specializing in ML, LLMs, and RAG applications

Atlanta, GA3y exp
Georgia State UniversityGeorgia State University

“University hackathon finalist (2nd place) who built CareerSpark, a production-style multi-agent career guidance app in 24 hours using a hierarchical debate architecture with a moderator/judge agent. Has startup internship experience at LiveSpheres AI using LangChain for multi-LLM orchestration, and demonstrates a structured approach to testing/evaluation (golden sets, integration sims, latency/accuracy KPIs) plus strong non-technical stakeholder communication.”

PythonSQLRJavaJavaScriptReact+112
View profile
AM

Ankita Mungalpara

Screened

Mid-level Data Scientist specializing in Generative AI and multimodal systems

Irving, TX5y exp
University of Massachusetts DartmouthUniversity of Massachusetts Dartmouth

“Recent J&J intern who built a conversational RAG agent and led a shift from a monolithic model to a modular RAG workflow, cutting response time from several days to under a second by tackling data fragmentation, context retention, and embedding/latency optimization. Also worked on a large (7B-parameter) multimodal VQA pipeline for healthcare research and stays current via NeurIPS/ICLR and open-source contributions.”

A/B TestingAmazon BedrockAmazon EC2Amazon RDSAmazon RedshiftAmazon S3+154
View profile
AK

Akshay Khan

Screened

Mid-level Backend Software Developer specializing in cloud-native microservices

USA5y exp
American ExpressUniversity of Central Missouri

“Backend engineer with American Express experience maintaining an internal Python/Flask rewards simulation microservice used by product analysts and QA. Demonstrated strong performance and scalability work: moved batch simulations to Celery, added Redis caching to cut DynamoDB latency, and tuned Postgres/SQLAlchemy queries with EXPLAIN ANALYZE and composite indexes (bringing API responses under ~200ms by queueing jobs). Also has experience integrating ML via Flask-based model-serving APIs (scikit-learn/LightGBM packaged with joblib) and designing multi-tenant data isolation and tenant-specific configuration systems.”

JavaPythonJavaScriptSQLShell ScriptingSpring Boot+165
View profile
RS

Ramya Sree Kanijam

Screened

Junior AI/ML Engineer specializing in RAG systems and cloud-native MLOps

Austin, TX2y exp
UpstartTexas A&M University-Corpus Christi

“Built and shipped a production LLM-powered RAG system at Upstart enabling natural-language search across 50k+ scattered internal technical docs. Delivered sub-300ms p95 latency for ~50 active users with strong hallucination safeguards (retrieval-first, thresholds, citations) plus robust testing/monitoring and cost controls (prompt caching cutting API spend ~20%).”

PythonJavaRetrieval-Augmented Generation (RAG)LangChainPrompt EngineeringVector Search+149
View profile
OP

Ojasmitha Pedirappagari

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms

Jersey City, NJ5y exp
Nurture HoldingsUC Santa Cruz

“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”

PythonSQLC#TypeScriptJavaScriptAWS+83
View profile
BS

BHEEMA SABILLA

Screened

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp
DiscoverUniversity of South Dakota

“Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.”

PythonPySparkSQLScalaPandasNumPy+157
View profile
SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

“Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.”

CI/CDCross-Functional CollaborationData AnalyticsDockerDocumentationEmbeddings+56
View profile
CT

Chethan Thimapuram

Screened

Mid-level AI/ML Engineer specializing in LLM systems, RAG, and MLOps

5y exp
HCA HealthcareUniversity of South Florida

“Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.”

Amazon CloudWatchApache AirflowApache KafkaApache SparkAWS GlueAWS IAM+125
View profile
GP

Gowri Priya Gorla

Screened

Junior Robotics & Embedded Software Engineer specializing in Linux-based distributed robotic systems

Minnesota, United States3y exp
Quantronic CorporationConcordia University

“Robotics software engineer focused on system-level C++/Linux stacks for multi-robot platforms, owning the communication layer and validation/testing infrastructure. Built Python simulation/replay and fault-injection tooling integrated with Docker + GitLab CI/CD, and debugged real-time localization issues by instrumenting IPC timing and refactoring multi-threaded pipelines for deterministic performance.”

C++PythonRoboticsROS 2LinuxDistributed Systems+70
View profile
1...111213...22

Related

Software EngineersMachine Learning EngineersData ScientistsAI EngineersResearch AssistantsSoftware DevelopersAI & Machine LearningEngineeringEducationData & Analytics

Need someone specific?

AI Search