Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems
“Built production LLM-driven products including a job-hunt AI (job ranking + resume optimization) and an InterviewAI agentic pipeline using LangChain. Focused on practical deployment concerns like securing OpenAI usage via rate limiting and tiered quotas, and demonstrates an applied approach to choosing models, retrieval methods (RAG), and prompting strategies.”
Intern Machine Learning Engineer specializing in Generative AI and RAG systems
“Early-career AI/LLM builder who created and deployed a multi-agent news analysis agent (Patrakarita) using CrewAI, coordinating researcher/analyst roles to turn noisy article URLs into structured, prioritized outputs (claims, tone, verification questions, opposing views). Strong focus on orchestration debugging and reliability evaluation, including measuring hallucination/redundancy and improving reasoning by refactoring pipeline sequencing.”
Junior Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“AI/ML engineer who has shipped production systems across computer vision and conversational agents: built a YOLOv8-based wheel fitment pipeline at a Techstars-backed automotive startup, focusing on sub-second latency, monitoring, and robust fallback mechanisms that drove 2–3x page view growth and +5–6k users. Also built a voice-based interview platform orchestrating Deepgram + GPT-4 Mini + OpenAI TTS with FSM-driven reliability, and has hands-on RAG experience (LangChain, hybrid retrieval, cross-encoder reranking, custom pseudo-query generation).”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“Currently at ProShare and reports building an AI/LLM-powered system deployed to production, aimed at helping with status-related difficulties and reducing misunderstandings across transactions. Also cites prior collaboration at Porsche with marketing teams, focusing on translating marketing goals into technical requirements and communicating solutions clearly to non-technical stakeholders.”
Mid-level GenAI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”
Junior AI Engineer specializing in LLM evaluation, prompt engineering, and AI orchestration
“LLM workflow builder who has deployed a personalized GPT experience (including Delphi AI-based knowledge ingestion) and built a LangChain/LangGraph job-aggregation pipeline that ingests, normalizes/dedupes, filters, then uses an LLM to rank and summarize matches. Emphasizes production reliability with structured outputs, retries/fallbacks, metric-driven evaluation, logging/prompt versioning, and A/B testing, and collaborates with non-technical stakeholders through demo-driven iteration.”
Junior Full-Stack Software Engineer specializing in React, Node.js, AWS, and Generative AI
“Built and production-deployed a Streamlit-based PDF RAG chatbot using LangChain (FAISS, embeddings, prompt templates) and OpenAI, optimizing Streamlit’s stateless behavior by caching vector DB + chat history to cut latency and API cost. Demonstrates a rigorous evaluation mindset (gold datasets, unit tests, LLM-as-judge, groundedness KPIs) and has experience communicating privacy/accuracy safeguards (RBAC, data masking, citations) to a non-technical client at Kalven Technologies.”
Senior Full-Stack Software Engineer specializing in cloud-native web, mobile, and AI features
“Frontend lead for a consumer-facing social platform, owning architecture through release. Built scalable React/TypeScript systems (Redux Toolkit, Remix) with a shared Storybook component library and strong quality gates (CI, Jest/Cypress). Experienced modernizing legacy codebases incrementally with feature flags and shipping major dashboard features with staged rollouts and close QA collaboration.”
Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications
“Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.”
Junior Software Engineer specializing in ML, RAG systems, and safety-critical risk modeling
“Backend/cloud engineer from Resilient Tech with hands-on experience deploying REST APIs and database migrations into a live ERP used by real customers while maintaining 99% uptime. Has debugged intermittent AWS container timeouts down to security group/load balancer misconfigurations, and has extended Python in an ERPNext system to meet GST/e-invoicing compliance requirements with strong customer collaboration.”
Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms
“LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).”
Senior Software Engineer specializing in full-stack systems, big data, and applied AI
“Built and deployed ForensicLLM, a local domain-specific LLaMA-3.1-8B model for digital forensic investigators using RAFT + RAG over 1000+ curated research papers, with citation-aware responses and rigorous evaluation (BERTScore/G-Eval). Deployed via vLLM and Docker and validated through a chatbot survey with 80+ participants; published at DFRWS EU 2025.”
Junior Software Engineer specializing in full-stack web development and test automation
“Full-stack engineer who built and owned a production workflow/kanban-style drag-and-drop system in Next.js (App Router) with Postgres/Prisma, including reusable component abstractions, Cypress E2E coverage, and post-launch performance/bug ownership. Notable for measurable impact (25% faster UI dev, ~30% query perf improvement) and for leading an incremental Express→NestJS migration that reduced technical debt (~40%) through better structure, docs, and team enablement.”
Junior Full-Stack Software Engineer specializing in cloud-native web apps and AI tooling
“Software engineer with experience across edtech, live gaming, and an AI document intelligence platform, delivering end-to-end customer-facing features and production backends. Built secure, automated live-session scheduling integrating Zoom and TalentLMS (JWT/RBAC, idempotency, transactions) cutting setup time from ~3 minutes to under 1 minute, and optimized real-time gaming dashboards/APIs with query tuning, caching, and CDN improvements (~60% latency reduction under peak load) on AWS.”
Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms
“Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.”
Junior AI/ML Engineer specializing in Generative AI, NLP, and MLOps
“LLM engineer who has deployed a production RAG system (LangChain/FAISS/FastAPI) for enterprise semantic search, tackling real-world latency by LoRA/PEFT fine-tuning and grounding outputs with retrieval. Brings strong MLOps (Docker, AWS EKS, CI/CD, MLflow) plus stakeholder-facing explainability experience using SHAP to align ML-driven financial guidance with non-technical domain experts.”
Senior Full-Stack AI/ML Engineer specializing in MLOps and GenAI
“Senior backend/data engineer who has built and maintained HIPAA-compliant, real-time clinical FastAPI services on AWS, orchestrating ML/LLM and vector DB calls with strong reliability patterns (auth, timeouts/retries, graceful degradation, idempotency). Also delivered AWS IaC/CI-CD (Terraform/Helm/GitHub Actions) across EKS/Lambda/SageMaker and built Glue/Spark ETL with schema evolution and data quality controls, plus demonstrated large SQL performance wins (15 min to <9 sec) and hands-on incident ownership.”
Mid-Level Applied AI Engineer specializing in LLM services, RAG, and OCR/NLP extraction
“Backend/platform engineer who built and evolved a large-scale healthcare document processing system (OCR + LLM orchestration) in Python/FastAPI on Google Cloud (Cloud Run, GCS, Firestore), processing ~1.5M files per batch and tens of millions overall. Emphasizes reliability and operational safety via deterministic IDs, idempotent state machines, strong observability, and self-healing reconciliation, plus disciplined migrations using dual-run validation and incremental rollouts.”
Junior Backend Engineer specializing in cloud APIs and AI-enabled systems
“Built and shipped "OnCall Copilot," a production Slack-based RAG assistant that answers on-call questions from runbooks and postmortems with citations using a FAISS vector index. Emphasizes reliability and measurable performance via strict guardrails ("no evidence, no answer"), evaluation metrics, drift monitoring, and operational hardening with Docker, logging, health checks, and offline fallback.”
Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems
“Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.”
Intern Software Engineer specializing in backend systems and distributed data pipelines
“LLM engineer with production experience building end-to-end document processing workflows that unify layout analysis, OCR, and downstream LLM reasoning. Has implemented reliability features (retries, robust error handling, OpenTelemetry logging) and built agentic systems using LangChain/CrewAI, including a student research-paper assistant, while collaborating closely with PMs and non-technical end users to reduce technical debt and simplify architectures.”
Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG
“LLM/AI agent engineer with consulting/contract experience (Kanhaiya Consulting LLC) who deployed a production AI agent to automate BIM list workflows end-to-end—from database understanding and data cleaning to automated visualizations/dashboards. Worked around restricted real-time data access by generating synthetic data and improving outputs via supervised fine-tuning, and uses AWS-based LLMOps observability (Opic/OPEC) plus hybrid retrieval (vector+BM25 with reranking) to optimize relevance, latency, and cost.”
Junior Full-Stack Software Engineer specializing in AI/ML platforms and microservices
“Graduate-school lab engineer who built and owned the final architecture of a Microservices Hub that integrates REST APIs, issues API keys, monitors 10+ Linux servers, and visualizes service dependencies via a topology graph. Strong in bridging legacy and modern stacks (Dockerized and non-Dockerized services like Apache/screen) using deep Linux/networking knowledge, plus practical real-time audio streaming for STT/TTS and experience mentoring others.”
Senior Full-Stack & AI Developer specializing in Python/React, AWS, and LLM/RAG systems
“Backend Python engineer who owned the full backend build of an AI-driven platform for UK golf clubs, including FastAPI microservices, vector search, and a tuned LangChain+Pinecone RAG pipeline focused on cost and hallucination reduction. Experienced deploying Django/FastAPI/Flask stacks on AWS-backed Kubernetes with GitOps/ArgoCD-style delivery, plus executing legacy-to-AWS migrations and building Kafka-based real-time analytics pipelines.”