Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in computer vision and reinforcement learning
“Early-stage engineer with hands-on embedded prototyping experience (Arduino/Raspberry Pi) who helped build an award-winning smart glasses project enabling phone notifications via Bluetooth. Strong computer vision performance optimization background, including accelerating 120 FPS inference by moving from TensorFlow to PyTorch and deploying through ONNX + TensorRT quantization, plus Docker-based GPU deployment and CI/ML practices.”
Intern Software & AI Engineer specializing in distributed systems and LLM applications
“Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Mid-level AI/Data Engineer specializing in agentic AI and data platforms
“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”
Mid-level Software Engineer specializing in AI, backend systems, and cloud platforms
“Full-stack engineer who helped build and launch an internal genAI platform called GAIL, supporting multiple LLMs, confidential document upload for RAG pipelines, and collaborative chat. Worked across FastAPI, React/TypeScript, AWS/DynamoDB, and Azure, with notable ownership of backend RAG logic, MCP integration architecture, and frontend fixes that improved chat usability.”
Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs
“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines
“Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.”
Mid-level Full-Stack Engineer specializing in cloud-native FinTech analytics
“Full-stack/ML-leaning engineer who has shipped production-grade real-time analytics and an internal AI support assistant using RAG over enterprise documentation. Demonstrates strong systems thinking across scalability, reliability, observability, and LLM safety/evaluation (thresholded retrieval, RBAC, response validation, regression-gated evals), with concrete iteration based on performance metrics and user feedback.”
Mid-level Backend & Blockchain Engineer specializing in Cosmos SDK and EVM
“Built and productionized an LLM+RAG lending assistant on AWS to help loan officers quickly answer questions from credit policies and prior decisions, tackling hallucinations with retrieval-only responses and a no-context fallback. Also automated end-to-end ETL and model retraining/deployment using Apache Airflow, and has experience translating clinical stakeholder needs (doctors/care managers) into ML features, metrics, and dashboards.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Junior Software Engineer specializing in AI platforms, distributed systems, and cloud infrastructure
“Software engineer with limited robotics background but deep experience building end-to-end document ingestion and image understanding systems, including a CAD-specific pipeline using a custom model to extract components and bounding boxes for user-facing visualization and Q&A. Also brings strong infrastructure/DevOps skills (Docker, Kubernetes, GitHub Actions, Terraform) with emphasis on reliability, cost optimization, and uptime.”
Mid-Level Software/AI Engineer specializing in backend systems, data pipelines, and RAG automation
“Backend engineer with experience modernizing high-traffic subscription and payment systems (TCS) by moving to event-driven Spring Boot microservices with Kafka, adding idempotency/state management to eliminate duplicate processing. Built and scaled FastAPI services for AI automation workflows (360DMMC) with versioned contracts, JWT security, and strong observability, and has led live refactors using feature flags, parallel runs, and data reconciliation.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level AI/ML Engineer specializing in LLM systems and MLOps
“Built and deployed an AI tutoring assistant end-to-end at Nexora School, spanning discovery with school districts, multi-agent LangGraph/RAG architecture, AWS Bedrock migration, and post-launch stabilization. Stands out for combining hands-on LLM systems engineering with strong educator-facing trust building, FERPA-driven architecture decisions, and disciplined production practices around evals, logging, and messy document ingestion.”
Intern AI/ML Engineer specializing in LLMs, RAG, and agentic automation
“Built and deployed production NLP/LLM systems including a multilingual (5-language) health misinformation detection pipeline with latency optimization (batching/quantization/caching) and explainability (gradient-based attention visualizations). Experienced orchestrating end-to-end AI workflows with Airflow and Prefect, and partnering with customer support ops to deliver an AI agent for ticket summarization and priority classification with clear, measurable acceptance criteria.”
Junior AI Engineer & Full-Stack Developer specializing in AI agents and RAG systems
“Full-stack TypeScript/React/Next.js builder who created an end-to-end customer-facing product (AI Job Master) that generates personalized outreach from resumes and job descriptions. Demonstrates strong product + engineering ownership with rapid MVP iteration, instrumentation-driven prioritization, and pragmatic reliability patterns (microservices, queues, correlation IDs, retries) while tackling a key AI challenge: user trust and output consistency.”
Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure
“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”
Entry AI Engineer specializing in LLM agents, RAG, and computer vision
“Robotics/AV-focused candidate who contributed to an F1TENTH autonomous vehicle college project, building key autonomy components from raw sensor data to driving commands. Strong in perception and state estimation (visual odometry, particle-filter localization), plus mapping (occupancy grids) and planning/control (RRT, Gap Follow, PID), with hands-on ROS tooling and simulation validation in Gazebo/RViz and ROS environment containerization using Docker.”
Mid-level GenAI Engineer specializing in LLM automation, RAG, and document intelligence
“Built and deployed a production GenAI resume screening and matching system for Florida Atlantic University, focused on improving recruiter efficiency and search relevance. Demonstrates strong RAG engineering (embeddings, query rewriting, metadata filtering, threshold tuning) plus practical reliability work (grounding constraints, fallbacks, and evaluation using real user queries) using Python REST APIs and orchestration frameworks like LangChain and LlamaIndex.”
Mid-Level Software Engineer specializing in distributed systems and cloud microservices
“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”
Junior Software Engineer specializing in AI and full-stack development
“Junior web developer turned applied AI builder who has shipped both user-facing web UX improvements (Vue.js + Drupal/Twig) and production LLM systems. Built a Google Cloud-hosted Llama/Ollama RAG customer-service chatbot with citation-based guardrails and a metrics-driven eval loop, and also delivered a large-scale Python pipeline analyzing 14M Amazon consumer reviews for flavor-trend detection.”
Mid-level Deployment Engineer specializing in AI integrations and data pipelines
“Built and owned enterprise data/integration deployments and production AI workflows, including a Python-based migration pipeline that moved 2M records with major improvements in onboarding speed, error rate, latency, and uptime. Also shipped a financial RAG assistant over 50K documents with sub-second p95 latency, showing a strong blend of customer-facing deployment ownership, data engineering, and LLM systems expertise.”