Pre-screened and vetted.
Mid-level Full-Stack Developer specializing in healthcare and scalable web platforms
“Software engineer experienced delivering customer-facing, real-time industrial monitoring dashboards (motors/shafts/turbines) by partnering directly with end users to refine charts, alerts, and performance. Strong in API/platform integrations and production troubleshooting—uses feature flags, logging, validation/mapping, containerization, and performance testing to keep systems stable while iterating quickly.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows
“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”
Mid-level Software Engineer specializing in Generative AI automation and secure platforms
“Backend/security-focused engineer from VeroTX who built an IdP service (Spring Boot + MongoDB on GCP) for an AI workflow platform and drove major latency improvements via caching and query/index optimization. Also shipped an AI loan-processing agent using LangChain/LangGraph, owning the document ingestion + vector database layer and designing a reliable multi-step workflow with retries, monitoring, and human-in-the-loop safeguards.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Mid-level AI Engineer specializing in causal inference and LLM research
“LLM engineer who has deployed a production system combining LLMs with causal inference (DoWhy) to enable counterfactual “what-if” analysis for experimental research, including a robust variable-mapping/validation layer to reduce hallucinations. Also partnered with non-technical operations leadership at Irriion Technologies to deliver an AI-assisted onboarding workflow that cut onboarding time by 50% and reduced manual errors by ~40%.”
Intern Backend Developer specializing in AI, multi-agent systems, and computer vision
“Backend-focused Python engineer who built core systems for an AI beauty-advice product: converting facial-recognition landmarks into usable facial measurements and dynamically shaping chatbot context for personalized guidance. Also worked on high-volume data ingestion at AINVESTgroup, improving agent context selection via a RAG database when upstream tags were unreliable, and has strong Git/GitOps + automated testing practices from rapid-deadline delivery environments.”
Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems
“Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.”
Mid-Level Software Engineer specializing in cloud-native microservices
“Built and shipped both a solo real-time multiplayer Spades game (TypeScript monorepo with shared client/server engine) and a production internal LLM-powered document Q&A tool for a SaaS company. Demonstrates strong RAG pipeline design (Pinecone + embeddings + reranking), rigorous eval/regression practices, and pragmatic data ingestion/observability work across Confluence, Notion, and messy PDFs/OCR—backed by clear metric improvements (P@1 61%→78%, escalations 40%→22%).”
Intern Software & AI Engineer specializing in distributed systems and LLM applications
“Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).”
Entry-Level Full-Stack Software Engineer specializing in serverless AWS and AI applications
“Built and deployed serverless AWS applications (Lambda/S3/RDS Proxy) including a NASA L’Space React + Python data analysis tool, focusing on performance for large datasets. Demonstrates strong cloud troubleshooting across compute and networking (CloudWatch-driven diagnosis, EC2 scaling, security group fixes) and a user-driven iteration loop that improved product usability with dynamic filtering and interactive UI.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Mid-level AI/Data Engineer specializing in agentic AI and data platforms
“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”
Junior Software Engineer specializing in AI platforms and backend systems
“Built and shipped AI products at Humanitarians AI, including a full-stack multi-agent platform that consolidated six faculty AI tools into one interface and achieved 100+ user adoption, 70% less workflow switching, and a 6x latency improvement. Also designed a grounded document parser using FAISS and structured LLM outputs that reduced hallucinations by 60%, showing strong depth in both product-minded engineering and production AI systems.”
Mid Software Engineer specializing in backend distributed systems and AI/RAG platforms
“Full-stack engineer with hands-on ownership of a production AI knowledge assistant used by 10,000+ daily users. Combines React/Next.js frontend work with FastAPI, AWS serverless, and RAG architecture using GPT-4, LangChain, and Pinecone, with measurable impact on relevance, latency, uptime, and support deflection.”
Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs
“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”
Mid-level Robotics Engineer specializing in simulation-to-real ML control
“Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Mid-level Backend & Blockchain Engineer specializing in Cosmos SDK and EVM
“Built and productionized an LLM+RAG lending assistant on AWS to help loan officers quickly answer questions from credit policies and prior decisions, tackling hallucinations with retrieval-only responses and a no-context fallback. Also automated end-to-end ETL and model retraining/deployment using Apache Airflow, and has experience translating clinical stakeholder needs (doctors/care managers) into ML features, metrics, and dashboards.”
Intern AI/GenAI Engineer specializing in NLP, RAG, and Snowflake Cortex
“Built and deployed a production AI invention/patent review platform that compares invention submissions against patent rules to provide instant feedback, reportedly cutting legal team review time by ~80%. Learned Snowflake Cortex LLMs and production deployment (Docker + AWS) on the job, and validated system quality through human-in-the-loop testing with experienced legal stakeholders.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”