Vetted Observability Professionals

Pre-screened and vetted.

JA

Entry-Level AI/ML Engineer specializing in LLM automation and RAG systems

Remote, USA1y exp
BalancedTrustNortheastern University

AI Automation Engineer at BalancedTrust who single-handedly shipped production LLM features for FinTech compliance: a policy gap-analysis pipeline (SOC 2/GDPR) and a RAG-based regulatory chatbot. Deeply focused on reliability in high-stakes legal/compliance settings, with strong production engineering (edge functions, parallelized batching to cut latency, structured JSON outputs, guardrails, and monitoring) and close collaboration with non-technical compliance experts.

View profile
NM

Mid-level Machine Learning Engineer specializing in cloud-native generative AI for healthcare

Seattle, WA4y exp
Cleveland ClinicUniversity of the Cumberlands

AI engineer at Cleveland Clinic building production LLM/NLP systems for radiology documentation, focused on HIPAA-aware, real-time performance across ~298 campuses. Re-architected infrastructure with AWS event-driven services to handle scaling and improved SLA compliance ~40%, and complements this with a personal multi-agent debate system (CrewAI) using local Llama/Mistral plus rigorous evaluation (A/B tests, red teaming, observability).

View profile
ST

Mid-level Full-Stack Developer specializing in Healthcare and FinTech web applications

Remote, USA4y exp
Fairview Health ServicesUniversity of Dayton

Hands-on engineer focused on productionizing LLM-powered assistants: builds RAG pipelines with guardrails, response schemas, and citation-grounded outputs, then hardens them with explicit NFRs (latency, uptime, security, cost). Experienced diagnosing agentic/LLM workflow issues in real time using observability and stepwise isolation, and supports go-to-market via developer demos, workshops, and pre-sales technical evaluations in microservices/Spring Boot environments.

View profile
BB

Mid-level Backend Engineer specializing in microservices and event-driven systems

Mclean, VA4y exp
Restaurant Brands InternationalUniversity of Maryland, Baltimore County

Backend-leaning full-stack engineer who has built and operated event-driven microservices platforms (FastAPI/React/TypeScript, Kafka, Kubernetes) and internal DevOps tooling. Delivered measurable impact through user-feedback-driven iteration (WebSockets update mechanism cutting redundant API calls ~30%) and operational improvements (deployment monitoring dashboard reducing rollback time ~40%), with strong focus on reliability, observability, and data consistency at scale.

View profile
BS

Full-Stack Software Engineer specializing in Java, React, and AWS

Plano, TX3y exp
Progress SolutionsNorthwest Missouri State University

Backend-focused Python engineer who builds modular Flask services on AWS and specializes in performance/scalability work across data-heavy APIs. Has concrete wins in query optimization (1.5s to <200ms) and high-throughput async processing (Celery+Redis, ~40% throughput gain), plus experience serving scikit-learn text classification models via containerized REST services and designing multi-tenant data isolation strategies.

View profile
HK

Mid-level Full-Stack Developer specializing in cloud-native microservices and real-time data streaming

Remote, USA4y exp
BMODePaul University

Full-stack engineer who has owned React/TypeScript + Spring Boot dashboard products end-to-end, including real-time performance/alerts and data aggregation across services. Strong in shipping MVPs quickly with feature flags, automated testing and CI/CD, and using monitoring/click-path analytics to prioritize work—achieved a 40% page-load reduction. Experienced operating microservices with RabbitMQ at scale, addressing retries/idempotency/observability and fixing duplicate-processing incidents with idempotent consumer patterns and DLQs.

View profile
SK

Mid-level AI Developer & Machine Learning Engineer specializing in LLM and MLOps systems

Champaign, IL5y exp
CenteneEastern Illinois University

Built and deployed an enterprise RAG application at Centene to help clinical teams retrieve insights from large internal policy document sets, cutting manual research by 30–40%. Implemented custom domain-adapted embeddings (SageMaker + BERT transfer learning) and hybrid retrieval (BM25 + Pinecone) to drive a 22% relevance lift, and ran the system in production on AWS EKS with CI/CD, MLflow, and Prometheus monitoring (99% uptime, ~40% latency reduction).

View profile
VU

Junior Full-Stack Software Engineer specializing in cloud web apps and authentication

Richardson, Texas3y exp
CrowdDoingUniversity of Texas at Dallas

Full-stack engineer with Deloitte and CrowdDoing experience shipping production web platforms on AWS (EC2/RDS/S3/Fargate) using React/TypeScript and Node/Express/PostgreSQL. Built customer-facing authentication/SSO flows (OAuth2 + JWT) and state-specific US privacy consent workflows, and also delivered a Python/Flask LLM-based finance document parser chatbot with vector DB integration and latency optimizations.

View profile
NT

Mid-level DevOps Engineer specializing in AWS/GCP Kubernetes and Terraform

New York, NY6y exp
ReliaQuestLewis University

IBM Power/AIX infrastructure engineer who owned a very large production estate (12 Power9 E980 frames and 400+ AIX 7.2 LPARs) with deep hands-on expertise in VIOS/vHMC, DLPAR, and PowerHA. Demonstrated strong incident response (zero-downtime DLPAR fix; split-brain prevention during storage failure) and modernization skills spanning Jenkins/Ansible CI/CD and Terraform automation for IBM Power Virtual Server/PowerVC.

View profile
Ronak Jain - Mid-level Sensor Fusion Research Engineer specializing in autonomous vehicle perception in Auburn Hills, MI

Ronak Jain

Screened

Mid-level Sensor Fusion Research Engineer specializing in autonomous vehicle perception

Auburn Hills, MI2y exp
Magna InternationalKettering University

Robotics/perception engineer with experience at Magna International building and scaling a ROS2-based autonomous vehicle sensor-fusion stack from radar+camera to include LiDAR, addressing hard problems like PTP nanosecond synchronization and probabilistic data association. Also developed and deployed a real-time 3D LiDAR object detection pipeline (PointPillars-style) optimized with ONNX/TensorRT and FP16, with strong production bringup/monitoring and rigorous simulation-to-road testing practices.

View profile
Vivekananda Reddy - Mid-Level Full-Stack Software Engineer specializing in Java, React, and AWS in Dallas, TX

Mid-Level Full-Stack Software Engineer specializing in Java, React, and AWS

Dallas, TX4y exp
AIGUniversity of North Texas

Backend engineer focused on cloud-native microservices on AWS, owning Python/Flask ingestion services integrated with S3/Lambda and deployed via Docker/Kubernetes with CI/CD. Has led phased migrations from manually managed EC2 setups to automated CloudFormation + pipeline-driven releases, and designed event-driven near-real-time pipelines with idempotency, retry/backoff, and strong observability.

View profile
RT

Rakesh Thota

Screened

Mid-level Data Engineer specializing in multi-cloud real-time data pipelines

California, USA5y exp
Molina HealthcareUniversity at Buffalo

Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.

View profile
SV

Junior Software Engineer specializing in distributed systems and cloud microservices

Bellevue, WA3y exp
SeekOutNortheastern University

Built and shipped an AI-driven interview evaluation pipeline at SeekOut that automated recruiter screening via a multi-stage LLM agent workflow (.NET backend, RabbitMQ orchestration, Python workers). Emphasizes production-grade reliability (idempotency, retries, strict JSON/schema validation), strong observability with OpenTelemetry, and measurable efficiency gains including ~40% reduction in token usage/cost.

View profile
Sai Nikhil Kanchukatla - Senior Software Engineer specializing in Golang microservices and IAM/SSO in Dallas, TX

Senior Software Engineer specializing in Golang microservices and IAM/SSO

Dallas, TX6y exp
HCLTechUniversity of Texas at Arlington

Backend engineer with experience at DigitalOcean and BNY Mellon, specializing in secure, highly available authentication and API platforms. Built an enterprise SSO system integrating Okta via OIDC with resilience patterns (gRPC contracts, circuit breakers, Kafka) and strong encryption, and led a careful monolith-to-Golang microservices migration using shadow traffic, dual writes, and feature flags to preserve data integrity.

View profile
Hy Serure - Executive Technology Leader & Architect specializing in enterprise digital transformation in Oakhurst, NJ

Hy Serure

Screened

Executive Technology Leader & Architect specializing in enterprise digital transformation

Oakhurst, NJ29y exp
Stanley Black & DeckerBaruch College

Longtime S-Corp founder (20+ years) who evolved from running a web agency to becoming a Shopify expert contracted by agencies/brands to upskill teams and lead enterprise eCommerce builds. Recently built and launched a payment-links/QR-code web app in weeks that gained 20 users on day one without advertising; now looking to pivot toward building a scalable product business not tied to hourly contracting.

View profile
Eric Guzman - Senior Solutions Architect specializing in MLOps and AI platform operations in New York, NY

Eric Guzman

Screened

Senior Solutions Architect specializing in MLOps and AI platform operations

New York, NY7y exp
AccentureCity College of New York (CUNY)

Audio/music editor and mixer with Symphony Space promotional work (e.g., Uptown Showdown, Selected Shorts), focused on shaping emotion and pacing through tempo automation, tension-building harmonic choices, and precise cut-to-music timing. Pro Tools certified (Institute of Audio Research) with hands-on mixing workflows across Logic, Reason, and Cubase, and experience iterating based on commercial/producer feedback.

View profile
Xin Li - Staff Software Engineer specializing in distributed systems and high-concurrency APIs in Santa Clara, CA

Xin Li

Screened

Staff Software Engineer specializing in distributed systems and high-concurrency APIs

Santa Clara, CA17y exp
V99 Inc.University of Electronic Science and Technology of China

Full-stack engineer in an insurance + telematics org who built a 0→1 multi-agent, AI-driven mobile regression execution system integrating Jira and n8n, emphasizing determinism (schema-validated outputs), traceability (state-machine style step archives), and scalable device scheduling. Also designed an event-driven microservices architecture using Kafka with standardized API/event contracts and has hands-on incident response experience improving resilience via rate limiting, circuit breaking, caching, and idempotency.

View profile
Bhavishyasai Chigurupati - Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms in Overland Park, KS

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp
CignaUniversity of Central Missouri

Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.

View profile
Umair Khokhar - Director-level Engineering Leader specializing in AI Platforms for Enterprise B2B SaaS in San Francisco, CA

Umair Khokhar

Screened

Director-level Engineering Leader specializing in AI Platforms for Enterprise B2B SaaS

San Francisco, CA15y exp
BaseIQWilmington University

Technical leader/player-coach who architected and shipped an end-to-end computer vision pricing system for a major North American auto seller, using Go + Ray + AWS SageMaker in a low-latency distributed inference architecture. Strong in production governance (logs/tracing/guardrails/AppSec), reliability incident ownership (DNS limits affecting 20% traffic), and measurable delivery acceleration (deployment cycle 16→4 days; delivery speed 5→2 days) through process optimization and AI-assisted enablement.

View profile
Rupini Polavarapu - Senior DevOps Engineer specializing in AWS cloud infrastructure and CI/CD automation in Atlanta, GA

Senior DevOps Engineer specializing in AWS cloud infrastructure and CI/CD automation

Atlanta, GA10y exp
Global PaymentsVignan University

AWS platform/infra engineer with hands-on ownership of EKS cluster lifecycle (upgrades, node scaling, networking/ingress, and EBS-backed stateful storage) and reliability validation using Datadog plus CI/CD smoke tests. Also supported on-prem VMware environments and operated a hybrid on-prem-to-AWS setup over site-to-site VPN, including incident response and implementing change-controlled firewall processes and proactive connectivity health checks.

View profile
Emmanuel Bakwowi - Senior DevSecOps Engineer specializing in multi-cloud Kubernetes and CI/CD automation in New York, NY

Senior DevSecOps Engineer specializing in multi-cloud Kubernetes and CI/CD automation

New York, NY10y exp
SiriusPointUniversity of Bridgeport

Cloud/DevOps engineer operating across AWS and Azure, running Kubernetes workloads with secure CI/CD (GitHub Actions/Azure DevOps) and Terraform IaC. Has supported AIX/PowerHA systems in hybrid environments—handling failover testing, incident recovery, and performance troubleshooting (including multipath/storage-path issues)—and has led cutovers by managing dependencies, rollback, and stabilization.

View profile
JP

Jeet Patel

Screened

Junior AI and Backend Engineer specializing in LLM systems

Massachusetts, USA3y exp
Boston Wholesale Outlet IncNortheastern University

AI/LLM engineer who has shipped production RAG copilots and multi-agent workflows, including a real-time Llama3 (Ollama) copilot backend handling 12k+ concurrent queries at 99.9% uptime. Deep on orchestration (Langflow/Airflow/Kubernetes), reliability evaluation (hallucination detection, p95 latency, token cost), and monitoring (Prometheus/Grafana), with demonstrated stakeholder-facing analytics delivery via Tableau.

View profile
TEJASWI GUDIMETLA - Mid-level Software Engineer specializing in backend systems, microservices, and AI pipelines in North Carolina, USA

Mid-level Software Engineer specializing in backend systems, microservices, and AI pipelines

North Carolina, USA4y exp
AIM for Composites – University of Delaware / Clemson UniversityUniversity at Buffalo

AI/LLM engineer focused on building reliable, scalable multi-agent and RAG-based pipelines across microservices. Stands out for combining practical experimentation with strong engineering discipline around schema validation, retries, observability, and structured API contracts to make LLM systems production-ready.

View profile
PN

Prabir Nandi

Screened

Staff Software Engineer specializing in cloud platforms, Kubernetes, and AI-driven engineering

San Francisco, CA20y exp
Levi Strauss & Co.University of Calcutta

Built a production AI stylist agent for retail store associates that pulls customer loyalty, address, and purchase-history data to generate in-store product recommendations. Demonstrates hands-on experience across agent orchestration, MCP-based tool integration, Vertex AI/GCP deployment, observability, resilience patterns, and pragmatic production tradeoffs like using PostgreSQL with pgvector instead of a standalone vector database.

View profile

Need someone specific?

AI Search