Reval Logo

Vetted Observability Professionals

Pre-screened and vetted.

OS

Senior DevOps / Cloud / Site Reliability Engineer specializing in AWS and Kubernetes

United States (Remote)10y exp
Bank of AmericaRutgers University
View profile
VG

Mid-level Full-Stack Engineer specializing in cloud-native microservices and data integrations

5y exp
Johnson & JohnsonPurdue University
View profile
SG

Mid-Level Software Engineer specializing in AI platforms and backend systems

New York, NY6y exp
IndeedNYU
View profile
SS

Executive CTO/VP Engineering specializing in high-performance AI, data systems, and distributed infrastructure

Vancouver, Canada20y exp
Clustera
View profile
JN

JYOTHI N

Screened ReferencesStrong rec.

Senior Data Scientist specializing in analytics, experimentation, and BI on AWS

Austin, TX7y exp
AmazonJawaharlal Nehru Technological University

Data/ML practitioner focused on healthcare data quality and record linkage: analyzed 10M+ records, built anomaly detection and NLP-driven entity resolution, and automated AWS ETL/validation pipelines (Glue/Redshift/Lambda), cutting data errors by 40% and generating $500k in annual savings. Has hands-on experience with embeddings (Sentence Transformers/spaCy), FAISS vector search, and fine-tuning for domain-specific matching.

View profile
LX

Longyang Xu

Screened ReferencesStrong rec.

Junior Full-Stack Software Engineer specializing in cloud microservices and ML-driven products

Quincy, MA1y exp
GraniteCarnegie Mellon University

Backend engineer with hands-on ownership of Python/Flask microservices and recommendation systems across edtech and telecom. Deployed and operated real-time personalization/recommendation platforms on AWS EKS with Jenkins-based CI/CD, GitOps-style declarative configs, and strong observability practices. Has migration experience moving legacy mixed environments to modern containerized Kubernetes and built Kafka pipelines feeding ML services while managing schema evolution.

View profile
MM

Senior Software Engineer specializing in AI/ML backend and cloud infrastructure

Bentonville, AR11y exp
WalmartUniversity of Houston

Backend/data platform engineer with production experience at Walmart and Molina Healthcare, building Python microservices on AWS (EKS + Lambda) for real-time inventory and recommendation systems. Strong in reliability/observability and incident leadership, plus modernizing legacy healthcare workflows and building resilient AWS Glue/PySpark pipelines with schema evolution and data quality controls.

View profile
AP

Ajith P

Screened

Mid-level Backend Software Engineer specializing in AI workflow automation for finance and healthcare

4y exp
Goldman SachsUniversity of Central Missouri

Backend/AI engineer with healthcare domain experience who built a patient journey analytics API (FastAPI/PostgreSQL/Snowflake/Redis) and debugged peak-hour latency down from ~900ms to ~50ms via indexing and query optimization. Shipped an LLM-powered clinical summary/recommendation assistant end-to-end and designed a multi-step risk evaluation agent workflow with safety guardrails against hallucinations and unsafe outputs.

View profile
HM

Junior AI/ML & Cloud Software Engineer specializing in LLM applications

2y exp
Randomwalk.AIUniversity of Illinois Urbana-Champaign

AI engineer (2+ years; pursuing an online MS at UIUC) who has shipped an AI-powered voice screening platform end-to-end on GCP with strong production monitoring and measurable hiring-process impact (80% reduction in unqualified pass-through; ~50+ hours saved per role). Also built and deployed an AWS-based context-aware hybrid search system using OpenSearch as a vector store, and has hands-on experience with multi-agent LLM orchestration (ReAct) and structured-output guardrails.

View profile
VM

Senior Forward Deployed Systems Engineering Leader specializing in AI-native deployments

Oakland, California9y exp
Longshot Space TechnologiesSan Francisco State University

Built and productionized an LLM-enabled system visualization web app at Longshot, designed modularly to pivot quickly to a mobile-friendly interface as customer needs changed. Experienced in diagnosing LLM/agentic workflow failures using observability, deterministic replay, and fault-tree root cause analysis. Also delivers developer-focused demos and trainings (including robotics deployment/mapping at Meta) and partners with sales as a technical closer, including for government clients by demonstrating failure modes and system modularity.

View profile
OO

Senior DevOps/DevSecOps Engineer specializing in AWS & Azure cloud infrastructure

Fairfax, VA10y exp
Technatomy Digital SolutionUniversity of Lagos

Infrastructure/DevOps-focused engineer working across Linux-based enterprise platforms that include IBM Power/AIX in a broader OpenShift/Kubernetes and cloud ecosystem. Built Azure DevOps CI/CD for containerized deployments and resolved a production deployment failure by tracing ImagePullBackOff to outdated registry credentials in Kubernetes secrets. Uses Terraform (with modular structure) plus Ansible to provision and standardize production environments with pipeline-based validation.

View profile
SZ

Siliang Zhang

Screened

Intern Machine Learning Engineer specializing in LLMs, RAG, and vision-language systems

Shanghai, China2y exp
CarizonUSC

Robotics ML/software engineer focused on Vision-Language-Action control for 7-DoF robots, replacing tokenized action decoding with continuous regression heads (including a logit-weighted expectation approach) to improve stability and real-time behavior. Strong in ROS1/ROS2 systems integration and debugging closed-loop manipulation issues via latency instrumentation, QoS-aware distributed messaging, and sim-to-real validation using Gazebo/Unity, Docker, and CI pipelines.

View profile
MN

mahesh narne

Screened

Senior Full-Stack Software Engineer specializing in cloud-native microservices and web apps

San Jose, CA3y exp
PayPalUniversity of Central Missouri

Backend-focused engineer building customer support/order-tracking platforms with Java 17/Spring Boot microservices and a React/TypeScript frontend. Deep experience running event-driven systems on Kubernetes (Kafka, Redis, MySQL) with strong observability (Prometheus/Grafana/Splunk), SLOs, and safe deployment practices (feature flags, canaries). Also built an internal monitoring/debugging dashboard that consolidated metrics and logs for on-call engineers and was adopted by other teams to speed incident response.

View profile
VV

vishal varma

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and MLOps

6y exp
CVS HealthUniversity of Bridgeport

Built and deployed a production RAG-based LLM Q&A and summarization platform for internal documents, emphasizing grounded answers with structured prompting and citations to reduce hallucinations. Experienced orchestrating end-to-end LLM workflows with LangChain plus cloud pipelines (Azure ML Pipelines, AWS), and runs iterative evaluation using both metrics (accuracy/hallucination/latency/cost) and real user feedback to drive reliability.

View profile
AK

Alp Komban

Screened

Junior Machine Learning Engineer specializing in computer vision for medical imaging

Mountain View, CA2y exp
Smartlens Inc.Cornell University

Applied ML/LLM practitioner working in healthcare-facing products, using RAG and LoRA fine-tuning on medical data and implementing production monitoring (confidence scoring) for clinician oversight. Has hands-on experience debugging agentic/LLM pipelines (including OCR preprocessing fixes) and regularly delivers technical demos to doctors, investors, and conferences—contributing to adoption and even helping close a funding round through end-to-end pipeline walkthroughs.

View profile
HC

Hongxi Chen

Screened

Intern Software Engineer specializing in distributed systems and backend infrastructure

Beijing, China0y exp
Chinese Academy of SciencesUniversity of Nottingham

Backend engineer with deep experience building event-driven logistics systems (orders, warehouse execution, real-time delivery tracking) using Spring Boot/PostgreSQL/Redis and strong observability (Prometheus/Grafana). Led a zero-downtime migration from monolithic MySQL to a sharded architecture for ~2M users with dual-write, checksum validation, and fast auto-rollback, and has strong security expertise including PostgreSQL RLS for multi-tenant SaaS and robust OAuth/JWT handling.

View profile
AG

Ayush Gupta

Screened

Mid-level AI Engineer specializing in Agentic AI and Generative AI

6y exp
GeolabeDuke University

Built and deployed a live LLM-powered platform that takes a LinkedIn job URL + resume and generates job-specific resumes and personalized outreach at scale, with production-grade logging/monitoring/retries on Vercel + Railway. Experienced with agent orchestration (AWS Bedrock/Strands, LangGraph, CrewAI) and rigorous AI workflow testing, plus stakeholder-facing prototypes like data lineage/metadata and NL-to-SQL + dashboard generation.

View profile
HH

Mid-level Applied AI Engineer specializing in ML systems, MLOps, and industrial analytics

Toronto, Canada5y exp
FreelanceUniversity of Waterloo

Industrial AI/ML practitioner with experience deploying real-time monitoring and anomaly detection in a regulated Sanofi vaccine manufacturing facility, including root-cause workflows, logging/alerting, and SOP-aligned validation—achieving ~90% faster anomaly detection. Also built Python/NLP-style automation to accelerate instrumentation & control documentation (~40% faster) and delivered end-to-end predictive analytics for an agri-food operations/distribution client using close operator and leadership feedback loops.

View profile
AD

Aarati Dulal

Screened

Senior Full-Stack Java Engineer specializing in cloud-native microservices

Dallas, TX6y exp
Goldman SachsAvila University

Backend/platform engineer who owned high-volume Java/Spring Boot microservices on AWS (Kafka + RDS/DynamoDB) and has hands-on experience debugging complex production latency incidents across DB, JVM/GC, and async consumers. Also shipped applied AI features for ops, including an LLM-powered log analysis assistant and an incident-response agent with strong safety guardrails (schema-validated tool use, retries/backoff, and human-in-the-loop escalation).

View profile
AP

Akash Patil

Screened

Mid-Level Software Engineer specializing in backend systems and LLM/RAG applications

5y exp
IntuitNorthern Illinois University

Backend/AI engineer at Intuit who built a production AI-powered case assistant for support agents (FastAPI on AWS EKS) combining Postgres case data, OpenSearch retrieval with embedding reranking, and internal LLMs. Improved peak-season reliability by diagnosing P95/P99 timeout spikes and cutting P95 latency from ~800ms to <400ms via composite indexing, keyset pagination, connection pool tuning, and caching, while adding grounded-generation guardrails (evidence packs, confidence thresholds, fallbacks, human-in-the-loop).

View profile
AM

Senior Data Analyst specializing in audit analytics, automation, and financial data platforms

Malvern, PA6y exp
VanguardNYU

Full-stack engineer with strong Next.js App Router + TypeScript experience who built and owned a production internal analytics dashboard end-to-end, including server-component data fetching, route handlers for secure proxying, and post-launch monitoring/caching fixes. Also designed Postgres data models and performance-tuned analytics queries, and built reliable BullMQ/Redis-based order-fulfillment workflows with idempotency, retries, and compensating refunds—comfortable operating with high ownership in early-stage teams.

View profile
HS

Haider Shah

Screened

Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI

California, USA13y exp
PineconePreston University

FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.

View profile

Need someone specific?

AI Search