Vetted FastAPI Professionals

Pre-screened and vetted.

RK

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

San Francisco, CA4y exp
PlaidSaint Louis University
View profile
XJ

Senior Software Engineer specializing in FinTech backend systems

Kirkland, WA8y exp
SoFiNortheastern University
View profile
KC

Staff Software Engineer specializing in Healthcare SaaS and real-time systems

Seattle, WA11y exp
AmazonMonash University
View profile
HT

Senior Full-Stack Software Engineer specializing in large-scale streaming platforms

Seattle, WA10y exp
DisneyNYU
View profile
QG

Staff Software Engineer specializing in FinTech and scalable distributed systems

Menlo Park, CA12y exp
RobinhoodAugusta University
View profile
MK

Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps

4y exp
NVIDIAFlorida State University
View profile
NS

Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference

New York city, NY4y exp
PerplexityCleveland State University
View profile
WW

Senior AI Engineer specializing in machine learning, NLP, and generative AI

Madison, WI13y exp
AmazonCase Western Reserve University
View profile
Sumer Joshi - Senior Full-Stack Engineer specializing in backend systems and AI applications in Remote

Sumer Joshi

Screened ReferencesStrong rec.

Senior Full-Stack Engineer specializing in backend systems and AI applications

Remote13y exp
MercorSanta Clara University

Candidate is deeply focused on AI-native software development, using a deliberate planner/implementer agent workflow with tools like Cursor, Claude, and Kimi. They also built a personal project called Config Proctor, an AI-agent-driven Terraform/AWS self-healing system that identifies infrastructure configuration gaps and proposes fixes.

View profile
AS

Akshat Shah

Screened ReferencesStrong rec.

Entry-level Software Engineer specializing in full-stack and AI systems

Los Angeles, CA1y exp
Integrated Media Systems CenterUSC

Frontend-leaning full-stack engineer who described owning an artist search and detail experience across UI, backend integrations, and data modeling. They show practical strength in scalable React architecture, TypeScript safety, and performance tuning, with a product-minded approach to shipping 0→1 features quickly and iterating after launch.

View profile
Manaswini Gogineni - Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development in San Francisco, CA

Manaswini Gogineni

Screened ReferencesStrong rec.

Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development

San Francisco, CA2y exp
CiscoUniversity of Wisconsin–Madison

Backend engineer at Electric Hydrogen who built a serverless device-log ingestion and processing platform in Python/Flask, scaling throughput (4x peak ingestion) while keeping sub-300ms API latency. Strong in Postgres/SQLAlchemy performance (partitioning, materialized views) and production ML integration (ONNX model served via FastAPI microservice with async batch inference, Redis feature caching, and drift monitoring via S3/Lambda). Experienced designing secure multi-tenant systems with schema-per-tenant isolation and KMS-backed encryption.

View profile
BS

Engineering Manager specializing in AI/ML platforms and 0→1 product delivery

Cambridge, MA15y exp
ElsevierHarvard University

Player-coach engineer/lead on a high-scale research integrity platform ("Lighthouse") that flags fraud/manipulation signals across ~3M academic manuscripts per year. Owns architecture decisions (ADRs), implements across Go/Java/React services, and introduced NLP (SciBERT embeddings + human-in-the-loop) to assess out-of-context citations while also handling production incidents with a data-consistency-first approach.

View profile
Yuan-Hsuan Wen - Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems in Taipei, Taiwan

Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems

Taipei, Taiwan3y exp
NVIDIAUSC

Built a web-based interface that connects an internal bug system to an LLM for initial debugging and issue classification, aiming to boost QA and software engineer efficiency while balancing latency and accuracy. Worked as a one-person project and managed constraints like limited hardware and difficulty extracting team debugging context, relying on manager communication and rapid modeling to validate direction.

View profile
Christopher Bun - Executive AI/ML technology leader specializing in healthcare, biotech, and legal AI in Irvine, CA

Executive AI/ML technology leader specializing in healthcare, biotech, and legal AI

Irvine, CA17y exp
Augnition LabsUniversity of Chicago

Repeat founder and startup advisor with experience spanning academic, health tech, legal tech, sports, and gaming. Has participated in fundraising and due diligence and has built companies, engineering teams, and software platforms from scratch, with a strong product-design-first approach to product-market fit and market selection.

View profile
Nishitha Thummala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp
PerplexityUniversity of Nebraska Omaha

Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.

View profile
Kowshika M - Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety in Santa Clara, CA

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp
NVIDIAOregon State University

AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.

View profile
DS

Executive CTO and Founder specializing in AI platforms and hyper-scale SaaS

South San Francisco, CA26y exp
Deep OriginUC Berkeley

CTO-minded builder seeking to join a startup; previously created an AI-driven platform that abstracted away DevOps and infrastructure for drug discovery researchers. Emphasizes high-leverage, zero-to-one execution with managed cloud/open-source tooling, and a strong reliability/reproducibility mindset validated against existing scientific pipelines.

View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
Krishna Reddy - Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants in New York, NY

Krishna Reddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants

New York, NY6y exp
StripeIndiana Wesleyan University

Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.

View profile
AD

Alfred Dwan

Screened

Intern Software Engineer specializing in AI, cloud-native systems, and MLOps

Hong Kong, China1y exp
PredictXNYU

Backend/full-stack engineer who has owned a production recruiting platform end-to-end (TypeScript/Node microservices for scraping/cleaning/serving job data, RabbitMQ for spike handling, MongoDB + Elasticsearch, AWS containers) with pragmatic CI, logging/alerts, and Docker Compose E2E tests. Also operated high-traffic event pipelines during a Binance internship using Kafka + Redis idempotency, with strong observability and failure-mode/rollback/degradation practices, and has experience designing developer-friendly REST APIs and resilient browser automation for E2E flows.

View profile
AK

Avinash K

Screened

Mid-level Software Engineer specializing in AI/LLM and distributed systems

Stony Brook, NY4y exp
Creao AIStony Brook University

Recent internship project at Google Workspace building an LLM-driven Python backend pipeline to extract/enrich NLP features from messy customer web domains and integrate them into a Domain Feature Store for personalization and promotions. Also has hands-on Kubernetes/Docker deployment experience for a Digital Signage SaaS backend with GitHub Actions CI, plus strong streaming-systems knowledge (Kafka exactly-once, schema evolution, Flink scaling) and built an information retrieval system handling 30,000+ cases.

View profile
Dexin Huang - Junior AI Engineer specializing in LLM systems, RAG, and full-stack automation in Guilford, CT

Dexin Huang

Screened

Junior AI Engineer specializing in LLM systems, RAG, and full-stack automation

Guilford, CT1y exp
Slothful LLC (Iris)Columbia University

Built and deployed an AI receptionist product for field-service businesses (HVAC/electrician), including real-time Jobber scheduling integrations and Twilio-based calling. Combines hands-on customer/operator shadowing with strong production engineering (queueing to handle API limits, rigorous testing/mocking, mirrored prod environment) and cross-layer troubleshooting, driving user adoption through review/override workflows.

View profile

Need someone specific?

AI Search