Pre-screened and vetted.
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems
“Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Junior Data Analyst specializing in BI, analytics, and machine learning
“Analytics professional with hands-on experience turning messy Excel-based operational data into SQL/Python pipelines and Power BI dashboards, including a production bottleneck project that improved workflow efficiency by 20%. Also brings applied machine learning experience from a Databricks/PySpark loan risk scoring project using logistic regression and XGBoost on large-scale S3 data.”
Mid-level AI/Data Engineer specializing in agentic AI and data platforms
“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”
Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs
“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”
Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines
“Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.”
Mid-Level Software Engineer specializing in AWS microservices and distributed systems
“CloudData engineer who productionized an LLM assistant for a warehouse/logistics customer by wrapping it as a versioned, containerized API with guardrails, deterministic post-processing, and full observability. Experienced diagnosing real-time RAG/agentic incidents (latency spikes and confident-wrong answers) using trace-based isolation, replay in staging, retrieval tuning, and canary releases. Regularly runs technical demos/workshops and partners with sales on security/IAM, SLAs, and pilot rollouts to drive adoption.”
Mid-level Data Scientist specializing in ML, LLM pipelines, and MLOps
“Built and deployed a production LLM-driven document understanding pipeline using LangChain/LangGraph, focusing on reliability via step-by-step prompting, validation checks, and monitoring. Also partnered with non-technical marketing stakeholders at Heartland Community Network to deliver an XGBoost targeting model surfaced in Power BI, improving campaign conversion by 12%.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Intern Data Scientist specializing in machine learning and NLP
“Analytics-focused early-career candidate with internship experience owning reporting and system performance analysis projects end to end. They combine SQL data preparation, Python automation, and dashboard delivery with measurable impact, including roughly 50% less manual reporting and about 20% better forecast accuracy.”
Mid-level AI/ML Engineer specializing in LLM systems and MLOps
“Built and deployed an AI tutoring assistant end-to-end at Nexora School, spanning discovery with school districts, multi-agent LangGraph/RAG architecture, AWS Bedrock migration, and post-launch stabilization. Stands out for combining hands-on LLM systems engineering with strong educator-facing trust building, FERPA-driven architecture decisions, and disciplined production practices around evals, logging, and messy document ingestion.”
Junior Data Engineer specializing in LLM agents and RAG pipelines
“Built and deployed “ApartmentFinder AI,” a multi-agent system using Google ADK, Gemini, and Google Maps MCP to automate apartment shortlisting and commute-time analysis, cutting a 45–70 minute user workflow down to ~30 seconds. Also has strong delivery/process chops from serving as an SDLC Release Coordinator, managing 52+ releases and reducing SDLC issues by 84%.”
Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure
“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”
Mid-level Full-Stack Software Engineer specializing in cloud, data pipelines, and GenAI
“Full-stack engineer currently building an employee management system end-to-end with React, Node/Express, and PostgreSQL, including JWT auth and RBAC. Previously worked at TCS on large-scale State Bank of India web applications, applying Redis caching, server-side pagination/filtering, and async job offloading to improve performance and reliability.”
Mid-level Data Scientist specializing in NLP, recommender systems, and ML deployment
“At Provenbase, built and shipped a production LLM-powered semantic search and candidate matching platform (RAG with GPT-4/Gemini, multi-agent orchestration, Elasticsearch vector search) to scale sourcing across 10M+ candidate records and 1000+ data sources. Drove sub-second performance, cut LLM spend 30% with routing/caching, and improved recruiting outcomes (+45% sourcing accuracy; +38% visibility of underrepresented talent) through bias-aware ranking and tight collaboration with recruiting stakeholders.”
Junior Machine Learning Engineer specializing in predictive modeling and GenAI RAG systems
“LLM engineer who built and deployed an emotionally intelligent AAC communication system using an emotion-aware RAG pipeline (Empathetic Dialogues + GoEmotions) and a PEFT-adapted model. Experienced with LangChain/LangGraph and custom Python orchestration, focusing on reliability (guards, schema validation, fallbacks), latency optimization, and rigorous evaluation (automatic metrics + human-in-the-loop), with a reported 18% user satisfaction improvement.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”
Junior Machine Learning Engineer specializing in multimodal systems and LLMs
“Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.”
Mid-Level Backend Software Engineer specializing in scalable cloud systems and LLM automation
“JavaScript engineer with open-source experience on a database visualization library, focused on real-time rendering performance for large datasets (virtualized DOM rendering, requestAnimationFrame/debouncing, memoization) and on raising project quality via tests and CI performance benchmarks. Also built Kafka-based messaging documentation and sample producer/consumer apps to speed onboarding, and has experience diagnosing production issues including concurrency-related duplicate data problems.”
Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval
“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”
Mid-level AI Engineer & Researcher specializing in healthcare AI and multimodal LLM systems
“Backend/ML engineer focused on clinical AI transparency who built ShifaMind, an explainability-enforced clinical ML system using UMLS/MIMIC-IV/PubMed data with RAG, GraphSAGE, and cross-attention. Demonstrated strong production engineering via FastAPI API design and safe migrations (feature flags/shadow inference), plus HIPAA-aligned auth/RLS patterns; also delivered a real-time comet detection system reaching 97.7% accuracy.”
Entry-level Full-Stack Engineer specializing in AI and distributed systems
“Full-stack engineer who built an AI-based inventory/procurement query system at Botlily/Botlerly using Flask and Google Sheets as a live knowledge base, overcoming Sheets latency with caching and structured in-memory models. Demonstrated strong LLM product engineering (40% accuracy improvement via preprocessing/prompting) and customer-driven iteration with bar/restaurant owners, evolving the tool into a more comprehensive inventory management and forecasting solution.”