Pre-screened and vetted.
Junior Software Engineer specializing in ML, distributed systems, and LLM applications
“Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.”
Junior Software Engineer specializing in cloud-native microservices and AI/ML observability
“Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.”
Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps
“Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.”
Junior Software Engineer specializing in AI, security, and cloud systems
“Built and deployed an LLM + RAG + memory system on a Furhat social robot, adding continuous face/voice recognition embeddings over WebSockets to enable persistent, natural conversations across sessions. Experienced working around real-world hardware/latency constraints and uses Datadog plus structured debugging/rollback practices for stabilizing customer-facing LLM workflows.”
Mid-level Full-Stack Engineer specializing in cloud microservices and NLP/LLM systems
“Full-stack engineer with 3+ years using Java/Spring Boot (Citi) and React, who built a production observability dashboard monitoring 53 microservices across 17 clusters with real-time health/latency tracing and significant performance improvements (cut load time from ~10s). Also designed a serverless AWS face-recognition system (Lambda/S3/SQS) built to handle burst traffic (~1000 concurrent requests), demonstrating strength in scalable, event-driven architectures.”
Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI
“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”
Principal Software Engineer specializing in AI/ML and cloud-native backend systems
“McKinsey data/ML practitioner who led production deployment of an entity resolution + semantic search platform for unstructured finance and healthcare data, integrating with legacy systems under HIPAA constraints. Deep hands-on stack across transformers (spaCy/HF BERT), embeddings + FAISS, and production MLOps/workflow tooling (Airflow, Docker, CI/CD, Prometheus/Grafana), with reported gains of +30% decision speed and +25% search relevance.”
Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications
“Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.”
Intern Software Engineer specializing in AWS cloud architecture and GenAI systems
“AWS Solutions Architect intern who advised customers on securing a multi-tenant LLM-based SaaS, including isolation strategy tradeoffs and production guardrails against prompt injection. Has experience investigating a prompt-injection incident using logs/traces and TTP-style documentation, and designing scalable SDK/agent integrations via asynchronous worker architecture with prompt versioning.”
Mid-level Machine Learning Engineer specializing in MLOps, NLP, and Computer Vision
“ML/AI engineer with production experience across retail and healthcare: built a real-time computer-vision shelf monitoring system at Walmart and optimized edge inference latency by ~30% using TensorRT/ONNX and pruning. Also partnered with CVS Health clinical/pharmacy teams to deliver a medication-adherence predictive model, using Streamlit explainability dashboards and achieving an 18% adherence improvement.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.”
Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications
“Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.”
Intern AI/ML Engineer specializing in GenAI pipelines and cloud automation
“Built and productionized a Python/LLM-based pipeline at Catalyst Solutions to automate healthcare RFP processing, turning unstructured documents into validated JSON/Excel with schema validation, confidence scoring, and human-review routing. Delivered major operational impact (hours-to-minutes processing, ~60% efficiency gain; 50+ RFPs processed) and modernized legacy scripts into a staged, more reliable architecture using incremental refactoring and fallback comparisons.”
Mid-level Software Development Engineer specializing in backend, data engineering, and ML systems
“ML/Backend engineer with ServiceNow experience building production-grade inference services on FastAPI with Docker/Kubernetes (autoscaling, health checks) and strong reliability practices (monitoring, retries/timeouts, fallbacks). Delivered measurable improvements including 30% lower API latency and 18% higher model accuracy, and built A/B testing plus drift-triggered retraining loops to keep models stable in production.”
Junior Full-Stack & Data Scientist specializing in ML/NLP and analytics products
“Built and deployed profitprops.io, a sports betting player-props prediction product using ML/AI. Implemented backend APIs with FastAPI/Express.js and Supabase, trained models on AWS GPU (P3) using Docker + RAPIDS, and set up CI/CD with GitHub Actions while working around cost constraints and data-collection hurdles (EC2 proxy rotation/rate limits).”
Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps
“AI/ML engineer with experience at Accenture and Morgan Stanley, building production LLM systems (GPT-3 summarization) and finance-focused ML models (credit risk and trading anomaly detection). Combines MLOps depth (Docker/Kubernetes, AWS SageMaker/Glue/Lambda, MLflow, A/B testing, drift monitoring) with practical domain adaptation techniques like few-shot prompting and RAG/knowledge-base integration.”
Mid-level Python Full-Stack Developer specializing in Healthcare and FinTech
“Backend engineer with hands-on experience building a fraud-transaction monitoring system in Python/Flask, architected as Dockerized microservices and integrated with Kafka for high-volume streaming. Demonstrates strong performance and reliability chops across PostgreSQL/SQLAlchemy tuning (EXPLAIN ANALYZE, N+1 fixes, bulk ops), multi-tenant data isolation, and scaling via background workers + Redis caching, plus real-time ML inference deployment using TensorFlow on AWS.”
Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP
“Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.”
Mid-level Software Engineer specializing in AI/ML and data platforms
“AI/ML engineer who built a production agentic system to automate computational research experiments (simulation execution, parameter exploration, and numerical analysis) and mitigated context-window failures using constrained tool-calling/prompt-chaining patterns in LangChain with OpenAI tool-enabled models. Also has adtech/big-data pipeline experience at InMobi, orchestrating Spark jobs in Airflow to filter bot-like user IDs and publish clean IDs to an online NoSQL store for live serving, plus Apache open-source collaboration experience.”
Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems
“Backend/platform engineer who built a high-scale audience segmentation and real-time targeting system using Spark/Glue + S3/Hudi and low-latency API services backed by Redis/relational stores. Demonstrates strong production rigor: Spark performance tuning to eliminate OOM failures, API idempotency/caching to cut p95 latency ~40%, and careful dual-run/feature-flag migrations with reconciliation and rollback runbooks. Experienced implementing layered security with JWT/OAuth, RBAC/ABAC, and database row-level security to prevent privilege escalation.”
Junior Full-Stack Software Engineer specializing in TypeScript, React, and Java microservices
“Software engineer with finance-domain experience who built an internal transaction management system end-to-end at Prospect Equities (TypeScript/React Native + Java Spring Boot microservices on AWS), delivering 40% lower query latency and 73% operational efficiency gains. Has also designed Terraform-provisioned, SQS-based distributed systems and scaled workloads to 10,000+ concurrent users, including monolith-to-SOA modernization that cut internal review time by 47%.”
Senior AI/ML Engineer specializing in Generative AI and LLM platforms
“Backend engineer focused on multi-tenant enterprise AI personalization and recommendation platforms, combining ML/LLM intent extraction with deterministic policy guardrails for compliance and auditability. Has hands-on AWS experience (ECS/Lambda/DynamoDB/S3) and led a careful DynamoDB single-table migration using dual write/read, canary + feature-flag rollouts, and strong observability/security (JWT/OAuth2, RBAC, Postgres RLS).”
Mid-Level Software Engineer specializing in backend microservices and FinTech data pipelines
“Backend engineer at Goldman Sachs who built LLM-powered reconciliation/reporting services and high-throughput Kafka pipelines (8M+ events/day). Strong in production-grade Python/FastAPI microservices on Kubernetes with GitOps-style CI/CD, plus experience migrating legacy reporting/settlement services onto an internal Kubernetes platform using shadow deployments and gradual cutovers.”
Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps
“Healthcare/clinical ML practitioner who built and productionized ClinicalBERT-based pipelines to extract and standardize oncology EHR data, improving downstream model F1 from 0.81 to 0.92 while controlling training cost via LoRA/QLoRA. Experienced orchestrating real-time AWS ETL/ML workflows (Glue, Lambda, SageMaker) and partnering with clinicians using SHAP-based interpretability, contributing to an 18% reduction in readmissions and full adoption.”