Pre-screened and vetted.
Mid-level AI Engineer specializing in NLP and production ML systems
“AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.”
Junior Software Engineer specializing in backend, cloud, and LLM-powered search
“Python backend engineer (BetterWorld Technology) who owns microservice systems end-to-end on Azure, including Kubernetes deployments, CI/CD, and production monitoring/alerting. Has hands-on experience integrating SQL/NoSQL (including Cosmos DB with vector search/graph workflow) and has built a Kafka + Spark Streaming pipeline to Snowflake with a reported 40% latency reduction.”
Mid-level Data Scientist specializing in insurance, healthcare, and cloud analytics
“Built a production-style LLM document summarization/generation workflow that mitigates token limits and reduces hallucinations using semantic chunking, FAISS-based embedding retrieval (top-k via cosine similarity), and section-wise generation. Orchestrated the end-to-end pipeline with AWS Step Functions and aligned outputs with sales stakeholders through demos, visuals, and documentation.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Mid-level Backend Software Engineer specializing in Python/FastAPI and cloud-native microservices
“Backend engineer who evolved Coca-Cola bottlers' Trade Promotion Optimization platform at Coke One North America, building domain-focused microservices in Node.js and Python (Flask/FastAPI) with PostgreSQL. Experienced in multi-tenant security (OAuth2/JWT, RBAC, row-level scoping by bottler/region), API contract/versioning discipline, and Azure DevOps-driven incremental rollouts with strong observability.”
Mid-level Full-Stack Engineer specializing in cloud-native FinTech analytics
“Full-stack/ML-leaning engineer who has shipped production-grade real-time analytics and an internal AI support assistant using RAG over enterprise documentation. Demonstrates strong systems thinking across scalability, reliability, observability, and LLM safety/evaluation (thresholded retrieval, RBAC, response validation, regression-gated evals), with concrete iteration based on performance metrics and user feedback.”
Mid-level Full-Stack Developer specializing in cloud-native microservices and AI/ML
“Full-stack/AI engineer who has shipped production systems spanning real-time analytics dashboards and an internal LLM-powered knowledge assistant. Experienced with RAG pipelines (embeddings/vector DB, semantic retrieval, query rewriting) plus evaluation loops and guardrails, and builds observable Kafka-based data pipelines monitored with Prometheus/Grafana.”
Intern AI/ML Engineer specializing in LLMs, RAG, and agentic automation
“Built and deployed production NLP/LLM systems including a multilingual (5-language) health misinformation detection pipeline with latency optimization (batching/quantization/caching) and explainability (gradient-based attention visualizations). Experienced orchestrating end-to-end AI workflows with Airflow and Prefect, and partnering with customer support ops to deliver an AI agent for ticket summarization and priority classification with clear, measurable acceptance criteria.”
Mid-Level Software Engineer specializing in LLM applications, RAG, and OCR automation
“At Trellis, built and shipped a production multi-agent, authenticated GenAI chatbot for sensitive financial account inquiries (loan/payment lookups), using dynamic model routing to control latency and cost while improving accuracy. Implemented prompt-injection defenses (Meta Prompt Guard), RAG with LangChain, and LLM-as-a-judge evaluation; the system cut manual support call volume by 40%+ and was refined through close collaboration with QA-driven user testing.”
Mid-level GenAI Engineer specializing in LLM automation, RAG, and document intelligence
“Built and deployed a production GenAI resume screening and matching system for Florida Atlantic University, focused on improving recruiter efficiency and search relevance. Demonstrates strong RAG engineering (embeddings, query rewriting, metadata filtering, threshold tuning) plus practical reliability work (grounding constraints, fallbacks, and evaluation using real user queries) using Python REST APIs and orchestration frameworks like LangChain and LlamaIndex.”
Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure
“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”
Mid-level Data Scientist specializing in NLP, recommender systems, and ML deployment
“At Provenbase, built and shipped a production LLM-powered semantic search and candidate matching platform (RAG with GPT-4/Gemini, multi-agent orchestration, Elasticsearch vector search) to scale sourcing across 10M+ candidate records and 1000+ data sources. Drove sub-second performance, cut LLM spend 30% with routing/caching, and improved recruiting outcomes (+45% sourcing accuracy; +38% visibility of underrepresented talent) through bias-aware ranking and tight collaboration with recruiting stakeholders.”
Mid-Level Software Engineer specializing in distributed systems and cloud microservices
“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”
Junior Software Engineer specializing in backend microservices and AI/ML automation
Mid-level Prompt Engineer specializing in NLP, LLMs, and RAG systems
Senior Software Engineer specializing in cloud-native platforms, microservices, and AI/ML systems
Mid-level Customer Success/Support Engineer specializing in Healthcare IT and automation
Mid-Level Software Engineer specializing in LLM and RAG applications