Vetted FastAPI Professionals

Pre-screened and vetted.

SS

Junior Software Engineer specializing in ML, distributed systems, and LLM applications

Austin, TX1y exp
ZondaUC San Diego

Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.

View profile
AS

Avijit Saha

Screened

Junior Software Engineer specializing in cloud-native microservices and AI/ML observability

Bedford, TX3y exp
JPMorgan ChaseUniversity of the Cumberlands

Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.

View profile
SK

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp
ServiceNowRivier University

Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.

View profile
JG

Junior Software Engineer specializing in AI, security, and cloud systems

Trondheim, Norway1y exp
Norwegian University of Science and TechnologyUniversity of Waterloo

Built and deployed an LLM + RAG + memory system on a Furhat social robot, adding continuous face/voice recognition embeddings over WebSockets to enable persistent, natural conversations across sessions. Experienced working around real-world hardware/latency constraints and uses Datadog plus structured debugging/rollback practices for stabilizing customer-facing LLM workflows.

View profile
RK

Rohit Khoja

Screened

Mid-level Full-Stack Engineer specializing in cloud microservices and NLP/LLM systems

Tempe, AZ4y exp
CitigroupArizona State University

Full-stack engineer with 3+ years using Java/Spring Boot (Citi) and React, who built a production observability dashboard monitoring 53 microservices across 17 clusters with real-time health/latency tracing and significant performance improvements (cut load time from ~10s). Also designed a serverless AWS face-recognition system (Lambda/S3/SQS) built to handle burst traffic (~1000 concurrent requests), demonstrating strength in scalable, event-driven architectures.

View profile
SV

Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI

Remote5y exp
InfosysUniversity at Buffalo

GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.

View profile
RK

Principal Software Engineer specializing in AI/ML and cloud-native backend systems

New York, NY16y exp
McKinsey & CompanyNJIT

McKinsey data/ML practitioner who led production deployment of an entity resolution + semantic search platform for unstructured finance and healthcare data, integrating with legacy systems under HIPAA constraints. Deep hands-on stack across transformers (spaCy/HF BERT), embeddings + FAISS, and production MLOps/workflow tooling (Airflow, Docker, CI/CD, Prometheus/Grafana), with reported gains of +30% decision speed and +25% search relevance.

View profile
SA

Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications

Remote, USA5y exp
BigCommerceArizona State University

Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.

View profile
SU

Intern Software Engineer specializing in AWS cloud architecture and GenAI systems

Seattle, WA2y exp
Amazon Web ServicesSan José State University

AWS Solutions Architect intern who advised customers on securing a multi-tenant LLM-based SaaS, including isolation strategy tradeoffs and production guardrails against prompt injection. Has experience investigating a prompt-injection incident using logs/traces and TTP-style documentation, and designing scalable SDK/agent integrations via asynchronous worker architecture with prompt versioning.

View profile
GJ

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and Computer Vision

USA5y exp
WalmartUniversity of New Haven

ML/AI engineer with production experience across retail and healthcare: built a real-time computer-vision shelf monitoring system at Walmart and optimized edge inference latency by ~30% using TensorRT/ONNX and pruning. Also partnered with CVS Health clinical/pharmacy teams to deliver a medication-adherence predictive model, using Streamlit explainability dashboards and achieving an 18% adherence improvement.

View profile
RH

Rahul Hatkar

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

San Francisco, CA6y exp
Scale AIWebster University

AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.

View profile
AP

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp
Bank of AmericaUniversity of North Carolina at Charlotte

Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.

View profile
SS

Intern AI/ML Engineer specializing in GenAI pipelines and cloud automation

Tempe, AZ1y exp
Catalyst SolutionsArizona State University

Built and productionized a Python/LLM-based pipeline at Catalyst Solutions to automate healthcare RFP processing, turning unstructured documents into validated JSON/Excel with schema validation, confidence scoring, and human-review routing. Delivered major operational impact (hours-to-minutes processing, ~60% efficiency gain; 50+ RFPs processed) and modernized legacy scripts into a staged, more reliable architecture using incremental refactoring and fallback comparisons.

View profile
PG

Palash Gharde

Screened

Mid-level Software Development Engineer specializing in backend, data engineering, and ML systems

Arizona, USA5y exp
ServiceNowArizona State University

ML/Backend engineer with ServiceNow experience building production-grade inference services on FastAPI with Docker/Kubernetes (autoscaling, health checks) and strong reliability practices (monitoring, retries/timeouts, fallbacks). Delivered measurable improvements including 30% lower API latency and 18% higher model accuracy, and built A/B testing plus drift-triggered retraining loops to keep models stable in production.

View profile
John Chen - Junior Full-Stack & Data Scientist specializing in ML/NLP and analytics products in Redwood City, CA

John Chen

Screened

Junior Full-Stack & Data Scientist specializing in ML/NLP and analytics products

Redwood City, CA2y exp
ProfitPropsGeorgia Tech

Built and deployed profitprops.io, a sports betting player-props prediction product using ML/AI. Implemented backend APIs with FastAPI/Express.js and Supabase, trained models on AWS GPU (P3) using Docker + RAPIDS, and set up CI/CD with GitHub Actions while working around cost constraints and data-collection hurdles (EC2 proxy rotation/rate limits).

View profile
Monish Sri Sai Devineni - Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps in Boca Raton, FL

Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps

Boca Raton, FL5y exp
Morgan StanleyFlorida Atlantic University

AI/ML engineer with experience at Accenture and Morgan Stanley, building production LLM systems (GPT-3 summarization) and finance-focused ML models (credit risk and trading anomaly detection). Combines MLOps depth (Docker/Kubernetes, AWS SageMaker/Glue/Lambda, MLflow, A/B testing, drift monitoring) with practical domain adaptation techniques like few-shot prompting and RAG/knowledge-base integration.

View profile
Sayali Patil - Mid-level Python Full-Stack Developer specializing in Healthcare and FinTech in Everett, MA

Sayali Patil

Screened

Mid-level Python Full-Stack Developer specializing in Healthcare and FinTech

Everett, MA6y exp
Kaiser PermanenteHarrisburg University of Science and Technology

Backend engineer with hands-on experience building a fraud-transaction monitoring system in Python/Flask, architected as Dockerized microservices and integrated with Kafka for high-volume streaming. Demonstrates strong performance and reliability chops across PostgreSQL/SQLAlchemy tuning (EXPLAIN ANALYZE, N+1 fixes, bulk ops), multi-tenant data isolation, and scaling via background workers + Redis caching, plus real-time ML inference deployment using TensorFlow on AWS.

View profile
Junhui Huang - Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP in Providence, RI

Junhui Huang

Screened

Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP

Providence, RI1y exp
Harvard UniversityBrown University

Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.

View profile
Vaibhav Sharma - Mid-level Software Engineer specializing in AI/ML and data platforms in Remote, USA

Mid-level Software Engineer specializing in AI/ML and data platforms

Remote, USA5y exp
GoogleIndiana University Bloomington

AI/ML engineer who built a production agentic system to automate computational research experiments (simulation execution, parameter exploration, and numerical analysis) and mitigated context-window failures using constrained tool-calling/prompt-chaining patterns in LangChain with OpenAI tool-enabled models. Also has adtech/big-data pipeline experience at InMobi, orchestrating Spark jobs in Airflow to filter bot-like user IDs and publish clean IDs to an online NoSQL store for live serving, plus Apache open-source collaboration experience.

View profile
Prasannakumar B Vardi - Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems in Santa Clara, CA

Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems

Santa Clara, CA9y exp
CardlyticsStony Brook University

Backend/platform engineer who built a high-scale audience segmentation and real-time targeting system using Spark/Glue + S3/Hudi and low-latency API services backed by Redis/relational stores. Demonstrates strong production rigor: Spark performance tuning to eliminate OOM failures, API idempotency/caching to cut p95 latency ~40%, and careful dual-run/feature-flag migrations with reconciliation and rollback runbooks. Experienced implementing layered security with JWT/OAuth, RBAC/ABAC, and database row-level security to prevent privilege escalation.

View profile
Yun-Ting Chiou - Junior Full-Stack Software Engineer specializing in TypeScript, React, and Java microservices in Chicago, IL

Junior Full-Stack Software Engineer specializing in TypeScript, React, and Java microservices

Chicago, IL2y exp
Prospect EquitiesUniversity of Chicago

Software engineer with finance-domain experience who built an internal transaction management system end-to-end at Prospect Equities (TypeScript/React Native + Java Spring Boot microservices on AWS), delivering 40% lower query latency and 73% operational efficiency gains. Has also designed Terraform-provisioned, SQS-based distributed systems and scaled workloads to 10,000+ concurrent users, including monolith-to-SOA modernization that cut internal review time by 47%.

View profile
Pranav Chand - Senior AI/ML Engineer specializing in Generative AI and LLM platforms in ServiceNow, CA

Pranav Chand

Screened

Senior AI/ML Engineer specializing in Generative AI and LLM platforms

ServiceNow, CA5y exp
ServiceNowCalifornia State University, Fullerton

Backend engineer focused on multi-tenant enterprise AI personalization and recommendation platforms, combining ML/LLM intent extraction with deterministic policy guardrails for compliance and auditability. Has hands-on AWS experience (ECS/Lambda/DynamoDB/S3) and led a careful DynamoDB single-table migration using dual write/read, canary + feature-flag rollouts, and strong observability/security (JWT/OAuth2, RBAC, Postgres RLS).

View profile
Sankalp Tiwari - Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines in New York, NY

Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines

New York, NY4y exp
Goldman SachsSan José State University

Backend engineer at Goldman Sachs who built LLM-powered reconciliation/reporting services and high-throughput Kafka pipelines (8M+ events/day). Strong in production-grade Python/FastAPI microservices on Kubernetes with GitOps-style CI/CD, plus experience migrating legacy reporting/settlement services onto an internal Kubernetes platform using shadow deployments and gradual cutovers.

View profile
Akshit Modi - Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps in Remote, USA

Akshit Modi

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

Remote, USA5y exp
TempusArizona State University

Healthcare/clinical ML practitioner who built and productionized ClinicalBERT-based pipelines to extract and standardize oncology EHR data, improving downstream model F1 from 0.81 to 0.92 while controlling training cost via LoRA/QLoRA. Experienced orchestrating real-time AWS ETL/ML workflows (Glue, Lambda, SageMaker) and partnering with clinicians using SHAP-based interpretability, contributing to an 18% reduction in readmissions and full adoption.

View profile

Need someone specific?

AI Search