Vetted Observability Professionals

Pre-screened and vetted.

Akhil Kunala - Mid-level Software Engineer specializing in backend systems and cloud-native FinTech in Seattle, WA

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp
AmazonUniversity of North Texas

Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.

View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).

View profile
AS

Staff-level Software Engineer specializing in identity, access management, and platform security

Grapevine, TX4y exp
PaycomRice University

Backend engineer focused on scalable, security-first platform architecture—recently built an end-to-end centralized access-control system that launched successfully with ~50k early adopters and was designed to support ~10x traffic growth. Experienced in production authn/authz (token verification, handoff/session migration), and in de-risking migrations via feature flags, phased rollouts, A/B testing, and Splunk-based monitoring.

View profile
PK

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp
NeocisCarnegie Mellon University

CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.

View profile
RK

Rutuja Kawade

Screened

Mid-level Software Engineer specializing in cloud infrastructure and distributed systems

Atlanta, GA3y exp
RakutenGeorgia Tech

Cloud infrastructure/product engineer with end-to-end ownership of cloud-native storage/observability products, including taking an internal CMS to Google Cloud Marketplace and scaling to ~40,000 deployments. Strong in Kubernetes-based platforms (Operators, microservices, RabbitMQ) and performance/scalability work (e.g., 200% cluster capacity increase) plus internal tooling that materially improved SRE/QA debugging and release velocity.

View profile
CS

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.

View profile
GV

Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot and Angular

Frisco, TX5y exp
CiscoPurdue University

Full-stack engineer with Cisco supply-chain and Wipro internal platform experience, focused on customer-facing UI performance and secure backend services. Built a bulk Excel inventory upload feature (Spring Boot/Apache POI) that cut manual effort ~80%, and delivered high-scale Angular/React dashboards with strong reliability/observability (FastAPI, JWT, Docker, AWS, AppDynamics).

View profile
PS

Palak Siroya

Screened

Senior Site Reliability Engineer specializing in Azure cloud reliability and data analytics

Renton, WA10y exp
MicrosoftCentral Washington University

AppSec-focused customer advisor with hands-on experience integrating SAST/DAST/SCA into production CI/CD (Azure DevOps) and designing secure agent/scanning deployments in AWS (least-privilege IAM, private subnets, VPC endpoints). Demonstrates strong incident troubleshooting using logs/metrics/traces to diagnose load-related failures (timeouts/retry storms) and drive durable fixes, while tailoring risk/tradeoff communication across engineering, security, and leadership stakeholders.

View profile
JS

Mid-Level Software Engineer specializing in full-stack systems and developer tooling

Austin, TX3y exp
AppleCollege of the Sequoias

Built and productionized an AI extension for JetBrains IDEs providing coding assistance, testing, security sweeps, and documentation generation using both an internal LLM and third-party models (e.g., Gemini, Claude). Experienced in diagnosing customer issues in real time (Slack) with structured follow-through (GitHub Issues) and driving adoption through developer-oriented walkthroughs and video demos.

View profile
RM

Rakesh Munaga

Screened

Mid-level Full-Stack Engineer specializing in AI and FinTech platforms

TX, USA4y exp
JPMorgan ChaseUniversity of Texas at Arlington

Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.

View profile
VD

vikhyath D

Screened

Mid-Level Software Development Engineer specializing in distributed microservices on AWS

Dallas, TX5y exp
AmazonUniversity of North Texas

LLM/agent engineer who has shipped multiple autonomous, multi-step agents to production (document-to-SOP conversion, test generation, code generation) using a custom Python DAG orchestrator with persistent state, tool-calling permissions, and structured outputs (Pydantic/JSON Schema). Demonstrates strong production hardening practices—semantic contracts, golden-dataset prompt regression tests, circuit breakers, and multi-level monitoring—and delivered large productivity wins (34 hours of manual writing reduced to ~20 minutes review; ~15–20 engineering hours/week saved).

View profile
Alex ZhuZhou - Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems in Berkeley, CA

Alex ZhuZhou

Screened

Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems

Berkeley, CA2y exp
EmbraerUC Davis

Backend/LLM engineer with experience productionizing RAG systems (legal-case natural language querying) and optimizing for latency/cost, including a reported ~40% reduction via Redis caching and batching. Built monitoring and real-time debugging workflows (FastAPI, structured logging, correlation IDs, sandbox repro) and regularly delivered technical demos/workshops. Also partners with BD/sales to translate LLM capabilities into business value, including ESG-metric extraction from corporate filings.

View profile
Vamshikrishna Bandi - Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp
PayPalTrine University

Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.

View profile
Praveen Nutulapati - Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems in New York, NY

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.

View profile
Vagmin Yadav - Junior Software Engineer specializing in backend systems, ML pipelines, and DevOps in Pune, India

Vagmin Yadav

Screened

Junior Software Engineer specializing in backend systems, ML pipelines, and DevOps

Pune, India2y exp
Unbox RoboticsGeorgia Tech

TypeScript backend engineer in the robotics domain with hands-on experience building low-latency (20–40ms) production systems using RabbitMQ, Redis, and HA PostgreSQL (Patroni). Has owned end-to-end services supporting 15 clients via config-driven architecture, with strong CI/CD, automated testing, and observability (OpenTelemetry) practices, plus API versioning/deprecation using Keycloak auth.

View profile
Harshana Ragipindi - Mid-level Support/Software Engineer specializing in incident response, automation, and AWS monitoring in USA

Mid-level Support/Software Engineer specializing in incident response, automation, and AWS monitoring

USA4y exp
AmazonUniversity of North Texas

Built and owned end-to-end travel booking and baggage fee calculation platforms used by both customer support and customers, emphasizing fast iteration with automated guardrails and production visibility. Experienced designing TypeScript/React systems and operating RabbitMQ-based microservices at scale, including disciplined event contracts, idempotent consumers, and schema evolution strategies. Also created an internal real-time troubleshooting/pricing console that replaced fragmented tools and improved support resolution workflows through pilot-led adoption.

View profile
Arun Gampala - Mid-level Full-Stack Developer specializing in MERN and AWS microservices in TX, USA

Arun Gampala

Screened

Mid-level Full-Stack Developer specializing in MERN and AWS microservices

TX, USA4y exp
MetLifeSouthern Arkansas University

Backend engineer with experience at MetLife and Amazon focused on security and control for internal and customer-facing services. Emphasizes contract-first Python/FastAPI APIs with strong auth (JWT + RBAC/claims), data-layer isolation (RLS/tenant scoping), and reliability practices like incremental refactors, rollback planning, and idempotency to handle retry-driven failure modes.

View profile
Lamar Petty - Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems in San Francisco, CA

Lamar Petty

Screened

Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems

San Francisco, CA13y exp
Wikimedia FoundationGeorgia Tech

Full-stack engineer with production experience shipping a healthcare-focused web app (Pregnancy-Pal) using Next.js/TypeScript on GCP, integrating a Python/Flask middleware and FHIR server for patient/practitioner dashboards and messaging. Former Wikimedia Foundation Android engineer who led the end-to-end 'Year in Review' feature and built robust automated testing/CI practices (Espresso, GitHub Actions matrix). Strong emphasis on reliability via rigorous validation, comprehensive Postman testing, and detailed API documentation.

View profile
Venu Venkata Surendra reddy Erusu - Mid-Level Software Engineer specializing in AI/ML and Cloud-Native Microservices in Syracuse, NY

Mid-Level Software Engineer specializing in AI/ML and Cloud-Native Microservices

Syracuse, NY4y exp
Syracuse UniversitySyracuse University

Research assistant at Syracuse University who owned a Python/FastAPI analytics backend for user-uploaded large datasets, using S3 streaming uploads and background workers for heavy processing. Has hands-on experience deploying Dockerized Python/Java microservices to AWS EKS with Jenkins-based CI/CD, plus Kafka-based event-driven pipelines and practical migration patterns (dependency mapping, dual-write, reconciliation) to minimize downtime.

View profile
PS

Senior Software Engineer specializing in backend infrastructure, cloud automation, and reliability

Mountain View, CA8y exp
OracleStony Brook University

End-to-end deployment owner for Oracle document delivery/print services in a hospital-like production environment, focused on reliability/performance at scale (thousands of systems). Also describes implementing event-driven RAG/agentic LLM workflows with attention to embeddings/index consistency, latency, and measurable improvements in response relevance and operational efficiency.

View profile
Sunil Parikh - Executive enterprise architect specializing in cloud, cybersecurity, and platform modernization in Plano, TX

Sunil Parikh

Screened

Executive enterprise architect specializing in cloud, cybersecurity, and platform modernization

Plano, TX26y exp
Capital OneStevens Institute of Technology

Architect with early startup experience (1999-2000) who later worked with Capital One evaluating startup products, strategy, and roadmaps. Brings a structured approach to innovation through market research, competitor analysis, risk assessment, gap analysis, and proof-of-concept thinking.

View profile
YP

Mid-level Software Engineer specializing in backend, distributed systems, and AI infrastructure

Menlo Park, CA4y exp
SnowflakeUSC

Built Baioniq, an enterprise LLM platform for automating extraction from massive unstructured documents like contracts and insurance claims. They demonstrate unusually strong production depth in agentic AI—scaling to 100k+ requests/day, processing 1M+ claim documents, and improving extraction accuracy through rigorous RAG architecture, evaluation, and fallback design.

View profile
YJ

YASH JADHAV

Screened

Junior Data Scientist specializing in customer and growth analytics

New York, NY2y exp
Stanford AIMINew York University

Candidate combines fraud analytics experience at Citi with a clinical AI capstone involving reproducible ML pipelines for imaging and notes data. They stand out for turning messy, high-volume data into decision-ready reporting, automating evaluation workflows, and translating analytics into operational impact—from fraud rule changes to retention metric adoption.

View profile

Need someone specific?

AI Search