Vetted Observability Professionals

Pre-screened and vetted.

HW

Henry Wu

Screened

Mid-level Software Engineer specializing in backend, cloud infrastructure, and AI systems

Baltimore, MD4y exp
Johns Hopkins UniversityJohns Hopkins University

Built and launched a production self-healing MLOps agent that autonomously diagnosed and fixed model training failures on Kubernetes GPU infrastructure. Combines deep AI infrastructure knowledge with full-stack product ownership, and has delivered measurable impact including 35% less infrastructure waste, nearly 50% less troubleshooting time, and 60% lower LLM API costs.

View profile
Kiran Kumar - Mid-level Software Engineer specializing in Java microservices and GenAI automation in USA

Kiran Kumar

Screened

Mid-level Software Engineer specializing in Java microservices and GenAI automation

USA4y exp
AirbnbAuburn University at Montgomery

Software engineer (4+ years) with hands-on production GenAI experience: built an AI incident triage assistant that summarizes production logs for on-call engineers and iterated it using real incident metrics (time-to-signal, triage duration). Also shipped a RAG-based customer support knowledge assistant using embeddings + vector retrieval with strong guardrails (relevance thresholds/abstain, sanitization, auditing) and a formal eval loop (500-query gold set) that drove measurable retrieval improvements.

View profile
SM

Mid-level Full-Stack Developer specializing in Java/Spring Boot, React, and cloud-native AI automation

Long Beach, CA3y exp
UberCalifornia State University

Software engineer focused on reliability and scalable systems: built React/TypeScript dashboards backed by Java/Spring Boot APIs and designed Kafka-based microservices with strong contract/versioning discipline. Known for shipping incremental improvements with tight feedback loops and for creating internal observability tools that streamline on-call and incident diagnosis under high-traffic conditions.

View profile
HG

Harish Gaddam

Screened

Mid-level AI/ML Engineer specializing in LLM agents and RAG systems

Dallas, TX5y exp
VerizonUniversity of Texas at Arlington

LLM/agentic systems builder at Verizon who deployed a LangGraph-orchestrated multi-agent ticket-automation platform with RAG (FAISS) to replace brittle rule-based bots. Improved routing correctness by ~30–40%, hit ~300ms latency targets via model routing, and reduced ops workload by ~60% through tight iteration with non-technical stakeholders and strong testing/observability practices.

View profile
JM

Mid-level Full-Stack Software Engineer specializing in Healthcare IT and FinTech

Dallas, TX6y exp
Johnson & JohnsonUniversity of North Texas

Full stack engineer in the financial/thematic investing domain who built end-to-end applications on AWS. Notably redesigned a slow portfolio analytics workflow by offloading heavy computations to scheduled AWS Lambda jobs and caching results in DynamoDB (TTL), cutting API latency from ~5 seconds to under 300ms while supporting data-heavy daily market processing.

View profile
AG

Junior Cloud/DevOps Engineer specializing in Kubernetes, Terraform, and multi-cloud customer engineering

New York, NY2y exp
SeqeraUniversity of Waterloo

Solutions Engineer focused on application and platform security for enterprise cloud-native deployments, advising customers on threat modeling and secure CI/CD practices across AWS and Kubernetes. Has implemented SCA/container scanning and vuln checks in pipelines, tuned thresholds to reduce false positives, and driven outcomes like faster security approvals and smoother production rollouts. Troubleshot high-load Kubernetes failures (OOMKills, registry throttling) and turned fixes into a standard tuning guide.

View profile
VS

Mid-Level Software Engineer specializing in LLM agents and real-time data streaming

8y exp
AmazonRutgers University–New Brunswick

Software engineer with experience at Striim and Amazon who ships end-to-end production systems across UI, backend, ML, and operations. Built a real-time PII detection capability for a streaming data platform by integrating Python ML inference into a Java monolith via gRPC sidecars, achieving ~3M events/hour throughput and ~93% accuracy, and helped drive enterprise adoption (Fiserv, CVS). Also modernized internal Amazon tooling for multi-region scale with modularization and fully automated deployments.

View profile
AS

Mid-level Java Full-Stack Developer specializing in cloud microservices

USA4y exp
PaychexTrine University

Backend/platform engineer with payroll domain depth who built high-volume payroll processing microservices (Java/Spring Boot, Kafka, PostgreSQL, Redis) on AWS Kubernetes and debugged major peak-cycle latency by redesigning transaction boundaries and moving to async Kafka processing (>50% latency reduction). Also shipped an LLM-powered HR assistant using RAG with strong security/guardrails (RBAC, PII masking, audit logs) that cut support tickets by 40%, and designed reliable multi-step agent workflows with retries, circuit breakers, and idempotency.

View profile
SV

Supritha V

Screened

Senior Backend Software Engineer specializing in financial workflow automation

San Francisco, CA4y exp
PayPalUniversity of Central Missouri

Backend/AI workflow engineer with PayPal experience building workflow-driven financial compliance systems (Python/Java, Postgres, AWS/EKS) at thousands of executions/day. Has shipped production LLM-powered document extraction with strict schema/rule validation, auditability, and human-in-the-loop fallbacks, and has deep expertise in reliability (idempotency, locking, state machines) and Postgres performance tuning.

View profile
PN

Mid-level Full-Stack Engineer specializing in scalable APIs, cloud infrastructure, and GenAI apps

San Francisco, CA6y exp
DoorDashCal State Chico

Backend/platform engineer with experience across edtech, logistics, and AWS internal systems—owned a production course recommender end-to-end (model serving + APIs + caching/observability), delivering +30% CTR and -20% latency. Has scaled real-time delivery visibility/rerouting on Kubernetes/EKS to sub-200ms P95 during demand spikes and built billion-events/day telemetry pipelines on AWS (Kinesis Firehose, Lambda, S3, Redshift) with schema evolution, dedupe, and replay support.

View profile
AK

Akshay Koneti

Screened

Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices

Dallas, TX6y exp
AmazonUniversity of North Texas

Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).

View profile
TN

Tanveer Nazir

Screened

Senior Cloud & DevOps Engineer specializing in enterprise cloud automation and Kubernetes

Remote, NY11y exp
Bank of AmericaCollege of Staten Island, CUNY

Infrastructure/DevOps engineer with primary ownership in enterprise Linux and AWS/Azure production environments (including financial systems). Built secure, repeatable CI/CD pipelines deploying containerized workloads to EKS/ECS and implemented Terraform/CloudFormation IaC with drift detection and rollback practices; lacks direct IBM Power/AIX/PowerHA experience.

View profile
RK

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp
Scale AIRowan University

Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.

View profile
Lavanya Chilakalapudi - Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs in Tampa, FL

Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs

Tampa, FL5y exp
DatabricksUniversity of South Florida

Backend engineer with experience building microservice-based systems that integrate LLM workflows (code review suggestions, documentation generation, test scaffolding) using REST APIs, Celery/Redis, and OpenTelemetry for observability. Demonstrates hands-on database and performance optimization in PostgreSQL/SQLAlchemy (bulk inserts, lock mitigation, cursor-based pagination) plus multi-tenant data isolation via tenant-aware models, middleware scoping, and schema/row-level strategies.

View profile
ABHIJOY SARKAR - Senior AI Engineer specializing in LLMs, agentic systems, and MLOps in San Francisco Bay Area, CA

Senior AI Engineer specializing in LLMs, agentic systems, and MLOps

San Francisco Bay Area, CA8y exp
FlipkartIIT Ropar

Built and shipped PromptGuard, a production middleware proxy that secures GenAI RAG/agent systems against prompt injection and unsafe tool use using risk scoring, graded policy actions, and least-privilege tool gating. Also replaced LangChain abstractions with a custom state-machine runner for a production voice agent to reduce latency and improve traceability, and delivered a clinic call assistant by converting front-desk/doctor requirements into scenario-based guardrails and measurable evals.

View profile
Deepika Gotla - Senior Technical Support Engineer specializing in Azure Cloud & Generative AI in Bellevue, WA

Deepika Gotla

Screened

Senior Technical Support Engineer specializing in Azure Cloud & Generative AI

Bellevue, WA7y exp
MicrosoftSUNY New Paltz

Microsoft cloud/infra engineer with 5+ years supporting enterprise Azure environments, specializing in security-focused networking (private endpoints, DNS) and production troubleshooting across Azure Front Door/App Gateway WAF/AKS. Has implemented posture improvements via Defender for Cloud, Azure Policy, and RBAC tightening, and also designs secure AWS agent/scanner integrations and modern EKS/GitHub Actions/Secrets Manager observability-enabled SDK rollouts.

View profile
Shobana Chandrasekaran - Mid-Level Software Engineer specializing in AI microservices and generative fashion in Sunnyvale, CA

Mid-Level Software Engineer specializing in AI microservices and generative fashion

Sunnyvale, CA2y exp
The Fword.aiUSC

Backend/AI workflow engineer at a startup building production AI services for fashion workflows, including an AI-powered techpack generation API in Go (Gin) with MongoDB handling ~1k+ daily requests. Recently implementing an image-to-3D dress generation feature end-to-end, integrating a Python FastAPI AI service with ComfyUI + Hunyuan, with strong emphasis on async orchestration, webhooks, and observability (OpenTelemetry + SigNoz).

View profile
Shriya Bannikop - Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems in Seattle, WA

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.

View profile
Vidhi Upadhyay - Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems in Remote

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp
Saayam for AllCarnegie Mellon University

Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.

View profile
Antara Bhavsar - Mid-level Software Engineer specializing in cloud-native systems and Android development in Bloomington, IN

Mid-level Software Engineer specializing in cloud-native systems and Android development

Bloomington, IN3y exp
Indiana UniversityIndiana University Bloomington

Application-focused software engineer with experience at Amazon and Motorola shipping production systems ranging from developer monitoring/on-call tooling (Alcazar, ~40% MTTR improvement) to consumer AI features used by 100K+ users. Currently building an AI/ML-driven platform with a Python/FastAPI backend on AWS (ECS/RDS/S3) and has handled real production latency/scaling incidents end-to-end.

View profile
Jignesh Desai - Director of Software Engineering specializing in fraud detection and payments platforms in San Francisco Bay Area, California, USA

Jignesh Desai

Screened

Director of Software Engineering specializing in fraud detection and payments platforms

San Francisco Bay Area, California, USA26y exp
VisaKennesaw State University

Engineering manager/player-coach who focuses on laying technical foundations and improving team execution: kickstarted Visa standard UI library adoption, standardized API exception handling, and introduced a test automation framework that enabled a shift from quarterly to monthly releases. Led a phased latency/throughput initiative that doubled throughput, and owned a production incident caused by a dependency’s expired certificate, driving monitoring and certificate-expiry notification improvements across dependent systems.

View profile
Krishna Guda - Principal Product Engineer specializing in FinTech platforms, experimentation, and AI workflows in Mountain View, CA

Krishna Guda

Screened

Principal Product Engineer specializing in FinTech platforms, experimentation, and AI workflows

Mountain View, CA24y exp
Credit SesameBITS Pilani

Fintech product engineer working on a large-scale credit monitoring platform (tens of millions of users) with deep experience in regulated banking integrations, PII security, and step-up/MFA flows. Has shipped customer-facing React/TypeScript experiences driven by Optimizely experimentation and built reliable partner-facing microservices/SDKs on AWS, including resolving production traffic loss caused by edge security (DataDome/CAPTCHA) conflicts with payment providers.

View profile
BC

Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI

4y exp
Cardinal HealthRivier University

Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.

View profile

Need someone specific?

AI Search