Pre-screened and vetted.
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Mid-level AI Engineer specializing in ML, NLP, and Generative AI
“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”
Entry-Level Backend Engineer specializing in analytics automation and cloud data pipelines
“Forward Deployment Engineer focused on application security and production integrations, with hands-on experience hardening API-driven ticketing systems (JWT/RBAC/rate limiting/log redaction) and implementing CI/CD security controls (Bandit SAST, SCA, container hardening). Strong in diagnosing peak-load production issues using logs/metrics/infra signals and driving durable fixes like adaptive throttling and backoff, while aligning engineering, business, and leadership stakeholders on risk and SLA impact.”
Mid-Level Software/AI Engineer specializing in backend systems, data pipelines, and RAG automation
“Backend engineer with experience modernizing high-traffic subscription and payment systems (TCS) by moving to event-driven Spring Boot microservices with Kafka, adding idempotency/state management to eliminate duplicate processing. Built and scaled FastAPI services for AI automation workflows (360DMMC) with versioned contracts, JWT security, and strong observability, and has led live refactors using feature flags, parallel runs, and data reconciliation.”
Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs
“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Mid-level Robotics Engineer specializing in simulation-to-real ML control
“Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.”
Intern AI/GenAI Engineer specializing in NLP, RAG, and Snowflake Cortex
“Built and deployed a production AI invention/patent review platform that compares invention submissions against patent rules to provide instant feedback, reportedly cutting legal team review time by ~80%. Learned Snowflake Cortex LLMs and production deployment (Docker + AWS) on the job, and validated system quality through human-in-the-loop testing with experienced legal stakeholders.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines
“Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.”
Junior Full-Stack Engineer specializing in LLM-powered products
“Built multiple systems from scratch at DSSD and Aglint, including an NGO sustainability reporting dashboard and a production LLM-powered phone screening agent using Twilio/Retell AI with RAG grounded in PostgreSQL candidate/job data. Strong focus on real-world reliability: guardrails, monitoring, and lightweight eval/regression loops that reduced recruiter score overrides by ~30%. Currently on OPT through May 2026 (plans STEM OPT extension) and committed to relocating to NYC for in-person work; seeking $90k–$120k base with meaningful equity for founding engineer roles.”
Junior Robotics Engineer specializing in AI, perception, and autonomous navigation
“Robotics software engineer with 2+ years of ROS/ROS2 experience who built a mobile robot stack from scratch (Fusion 360 → URDF → ROS) and integrated teleop, SLAM, and navigation. Worked in an ASU lab applying deep learning for person tracking on a TurtleBot setup, and solved real deployment issues like Raspberry Pi video-stream latency via compression and on-board processing. Also reports experience with CI/CD tooling (Jenkins) and Kubernetes.”
Mid-level AI Engineer & Researcher specializing in healthcare AI and multimodal LLM systems
“Backend/ML engineer focused on clinical AI transparency who built ShifaMind, an explainability-enforced clinical ML system using UMLS/MIMIC-IV/PubMed data with RAG, GraphSAGE, and cross-attention. Demonstrated strong production engineering via FastAPI API design and safe migrations (feature flags/shadow inference), plus HIPAA-aligned auth/RLS patterns; also delivered a real-time comet detection system reaching 97.7% accuracy.”
Mid-level GenAI Engineer specializing in LLM automation, RAG, and document intelligence
“Built and deployed a production GenAI resume screening and matching system for Florida Atlantic University, focused on improving recruiter efficiency and search relevance. Demonstrates strong RAG engineering (embeddings, query rewriting, metadata filtering, threshold tuning) plus practical reliability work (grounding constraints, fallbacks, and evaluation using real user queries) using Python REST APIs and orchestration frameworks like LangChain and LlamaIndex.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”
Mid-level Data Scientist specializing in NLP, recommender systems, and ML deployment
“At Provenbase, built and shipped a production LLM-powered semantic search and candidate matching platform (RAG with GPT-4/Gemini, multi-agent orchestration, Elasticsearch vector search) to scale sourcing across 10M+ candidate records and 1000+ data sources. Drove sub-second performance, cut LLM spend 30% with routing/caching, and improved recruiting outcomes (+45% sourcing accuracy; +38% visibility of underrepresented talent) through bias-aware ranking and tight collaboration with recruiting stakeholders.”
Mid-Level Software Engineer specializing in distributed systems and cloud microservices
“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”
Mid-Level Software/ML Engineer specializing in NLP, OCR, and fraud detection in FinTech
Junior Full-Stack Software Engineer specializing in cloud microservices and AI compliance
Junior Software Engineer specializing in Cloud, DevOps, and AI/ML
Junior AI Engineer specializing in LLM agents and computer vision
Junior Software Engineer specializing in backend systems and ML infrastructure
Junior NLP/ML Engineer specializing in LLMs and retrieval-augmented generation