Pre-screened and vetted.
Junior Machine Learning Engineer specializing in cloud-based ML and automation
“Built and shipped a production multi-agent LLM system at Solena that automated internal project intake, validation, reporting, and stakeholder communications using Python, SQL, and LangChain, with strong emphasis on reliability (structured validation, safe defaults, logging, and state tracking). Also used LangGraph to orchestrate a multi-step video summarization pipeline, and has experience partnering with non-technical stakeholders to define “completion” criteria and reporting needs.”
Senior Software Engineer specializing in Backend Systems and Generative AI (RAG)
“Backend engineer with experience building an end-to-end civic tech AI platform that ingests city council meeting videos, transcribes them with Whisper, and enables natural-language Q&A via a LangChain/FAISS RAG pipeline. Demonstrated strong systems thinking by tuning retrieval for accuracy/latency/memory (cutting response time ~3s→1s and memory ~500MB→25MB) and by safely migrating an ERP from monolith toward services using dual writes, reconciliation, and idempotency to protect financial workflows.”
Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices
“Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems
“Built production LLM-driven products including a job-hunt AI (job ranking + resume optimization) and an InterviewAI agentic pipeline using LangChain. Focused on practical deployment concerns like securing OpenAI usage via rate limiting and tiered quotas, and demonstrates an applied approach to choosing models, retrieval methods (RAG), and prompting strategies.”
Intern Machine Learning Engineer specializing in Generative AI and RAG systems
“Early-career AI/LLM builder who created and deployed a multi-agent news analysis agent (Patrakarita) using CrewAI, coordinating researcher/analyst roles to turn noisy article URLs into structured, prioritized outputs (claims, tone, verification questions, opposing views). Strong focus on orchestration debugging and reliability evaluation, including measuring hallucination/redundancy and improving reasoning by refactoring pipeline sequencing.”
Mid-level GenAI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”
Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure
“Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.”
Junior AI Engineer specializing in LLM evaluation, prompt engineering, and AI orchestration
“LLM workflow builder who has deployed a personalized GPT experience (including Delphi AI-based knowledge ingestion) and built a LangChain/LangGraph job-aggregation pipeline that ingests, normalizes/dedupes, filters, then uses an LLM to rank and summarize matches. Emphasizes production reliability with structured outputs, retries/fallbacks, metric-driven evaluation, logging/prompt versioning, and A/B testing, and collaborates with non-technical stakeholders through demo-driven iteration.”
Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications
“Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.”
Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms
“LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).”
Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms
“Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.”
Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems
“Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.”
Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG
“LLM/AI agent engineer with consulting/contract experience (Kanhaiya Consulting LLC) who deployed a production AI agent to automate BIM list workflows end-to-end—from database understanding and data cleaning to automated visualizations/dashboards. Worked around restricted real-time data access by generating synthetic data and improving outputs via supervised fine-tuning, and uses AWS-based LLMOps observability (Opic/OPEC) plus hybrid retrieval (vector+BM25 with reranking) to optimize relevance, latency, and cost.”
Mid-level AI Engineer specializing in GenAI, agentic workflows, and RAG systems
“Built a production multi-agent RAG assistant using LangChain/LangGraph with OpenAI embeddings and FAISS, focusing on retrieval quality and latency (Redis caching, parallel retrieval, precomputed embeddings). Experienced orchestrating ETL/ML pipelines with Airflow and Databricks Workflows, and has delivered an AI assistant for business ops to extract insights from policy/compliance documents through close non-technical stakeholder collaboration.”
Junior AI/ML Engineer specializing in Generative and Agentic AI
“Built and deployed a production-grade LLM agent for credit management and accounts receivable automation, integrating ERP/MySQL data via a RAG pipeline and exposing services through FastAPI with Pydantic-validated outputs on AWS Bedrock. Emphasizes reliability and compliance for financial operations using schema validation and human-in-the-loop review, reporting ~32% reduction in manual work and ~41% improvement in response time/reliability.”
Intern Data Scientist specializing in machine learning, NLP, and LLM fine-tuning
“Built a production-style AI meeting summarization and action-item extraction system (Azure Speech-to-Text + transformer summarization/NER) exposed via a Flask REST API, with explicit guardrails to prevent hallucinated tasks. Strong focus on reliability: modular agent/workflow design, precision-first evaluation with human-validated golden notes, and practical orchestration patterns (tool-augmented agents; ready to scale into Airflow/LangGraph/Prefect).”
Mid-level AI/ML Engineer specializing in GenAI, NLP, and production MLOps
“AI/LLM engineer who built and deployed a production healthcare RAG chatbot ("DoctorBot") with strict medical safety guardrails, an 85% confidence-gated verification layer, and latency optimizations that cut responses from ~8s to ~2–3s. Also worked on finflow.ai to generate finance/banking test cases from BRDs, collaborating closely with non-technical domain stakeholders, and has hands-on orchestration experience with LangChain/LangGraph and agentic evaluation/monitoring practices.”
“Built a production AI-powered university marking system that automates question generation and grading from PDF course materials using a RAG pipeline (S3 + Pinecone) orchestrated with LangChain/LangGraph and deployed on AWS ECS via Docker/ECR and GitHub Actions CI/CD. Addressed a key real-world LLM challenge—grading consistency—by implementing rubric-based scoring, retrieval re-ranking, and standardized context summarization, validated against human instructors.”
Junior AI Engineer specializing in LLM agents, RAG, and MLOps
Mid-Level Full-Stack Software Engineer specializing in cloud and analytics platforms
Mid-level Generative AI Engineer specializing in LLMs and RAG for enterprise and FinTech
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and agentic automation
Junior Applied AI Engineer specializing in conversational and voice agent platforms