Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems
“Research assistant at St. Louis University who built and deployed a production document-intelligence RAG system (Python/TensorFlow, vector DB, FastAPI) on AWS, focusing on grounding to reduce hallucinations and latency optimization via caching/async/batching. Also developed a personalized recommendation system for the Frenzy social platform and partnered closely with product/UX to define metrics and iterate on hybrid recommenders and cold-start handling.”
Mid-level Machine Learning Engineer specializing in computer vision and reinforcement learning
“Early-stage engineer with hands-on embedded prototyping experience (Arduino/Raspberry Pi) who helped build an award-winning smart glasses project enabling phone notifications via Bluetooth. Strong computer vision performance optimization background, including accelerating 120 FPS inference by moving from TensorFlow to PyTorch and deploying through ONNX + TensorRT quantization, plus Docker-based GPU deployment and CI/ML practices.”
Senior Full-Stack Engineer specializing in cloud-native microservices and AI/ML integration
Senior Full-Stack Developer specializing in React, Node.js, and AWS
“Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.”
Intern Software & AI Engineer specializing in distributed systems and LLM applications
“Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).”
Intern Data Scientist specializing in Generative AI and NLP
“Backend/AI engineer with internship experience building an AI-powered financial insights platform (FastAPI, Redis, BigQuery) and prior HCL experience leading a monolith-to-microservices refactor (Flask, Kafka) using blue-green deployments. Demonstrates strong performance/security focus (OAuth/JWT/RBAC, encryption) and measurable impact on latency, downtime, and ML model reliability; MVP was submitted to Google’s accelerator program.”
Mid-level AI Engineer specializing in NLP and production ML systems
“AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.”
Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems
“Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.”
Junior AI Engineer specializing in Generative AI, RAG, and NLP
“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”
Intern Backend Developer specializing in AI, multi-agent systems, and computer vision
“Backend-focused Python engineer who built core systems for an AI beauty-advice product: converting facial-recognition landmarks into usable facial measurements and dynamically shaping chatbot context for personalized guidance. Also worked on high-volume data ingestion at AINVESTgroup, improving agent context selection via a RAG database when upstream tags were unreliable, and has strong Git/GitOps + automated testing practices from rapid-deadline delivery environments.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production inference
“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”
Mid-level Data Scientist specializing in insurance, healthcare, and cloud analytics
“Built a production-style LLM document summarization/generation workflow that mitigates token limits and reduces hallucinations using semantic chunking, FAISS-based embedding retrieval (top-k via cosine similarity), and section-wise generation. Orchestrated the end-to-end pipeline with AWS Step Functions and aligned outputs with sales stakeholders through demos, visuals, and documentation.”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows
“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”
Mid-level AI Engineer specializing in Generative AI and multimodal RAG systems
“GenAI/LLM engineer who built and productionized a 0-1 application (EMULaiTOR at Lumanity) combining qualitative + quantitative data using Postgres/pgvector RAG and prompt engineering, deployed with Azure backend and AWS-hosted frontend. Demonstrates strong production instincts (latency reduction via region alignment, autoscaling/health checks) and hands-on agent/tool-call debugging, plus experience enabling sales and winning a large pharma client.”
Entry-Level Backend Engineer specializing in analytics automation and cloud data pipelines
“Forward Deployment Engineer focused on application security and production integrations, with hands-on experience hardening API-driven ticketing systems (JWT/RBAC/rate limiting/log redaction) and implementing CI/CD security controls (Bandit SAST, SCA, container hardening). Strong in diagnosing peak-load production issues using logs/metrics/infra signals and driving durable fixes like adaptive throttling and backoff, while aligning engineering, business, and leadership stakeholders on risk and SLA impact.”
Mid-level Full-Stack Software Engineer specializing in Healthcare and Insurance platforms
“Full-stack engineer with healthcare and insurance domain experience who has owned production systems end-to-end (React/Next.js, FastAPI/Node, Postgres, AWS SNS/SQS, Docker, CI/CD) and delivered measurable impact (30% faster data processing). Also productionized an LLM-powered clinical data assistant using RAG + a vector database with guardrails and evaluation loops, cutting analyst lookup time by ~30–40%, and has experience modernizing monoliths to microservices with feature-flagged, low-regression rollouts.”
Mid-level Software Engineer specializing in Generative AI automation and secure platforms
“Backend/security-focused engineer from VeroTX who built an IdP service (Spring Boot + MongoDB on GCP) for an AI workflow platform and drove major latency improvements via caching and query/index optimization. Also shipped an AI loan-processing agent using LangChain/LangGraph, owning the document ingestion + vector database layer and designing a reliable multi-step workflow with retries, monitoring, and human-in-the-loop safeguards.”
Mid-Level Software/AI Engineer specializing in backend systems, data pipelines, and RAG automation
“Backend engineer with experience modernizing high-traffic subscription and payment systems (TCS) by moving to event-driven Spring Boot microservices with Kafka, adding idempotency/state management to eliminate duplicate processing. Built and scaled FastAPI services for AI automation workflows (360DMMC) with versioned contracts, JWT security, and strong observability, and has led live refactors using feature flags, parallel runs, and data reconciliation.”
Junior Software Engineer specializing in AI platforms, distributed systems, and cloud infrastructure
“Software engineer with limited robotics background but deep experience building end-to-end document ingestion and image understanding systems, including a CAD-specific pipeline using a custom model to extract components and bounding boxes for user-facing visualization and Q&A. Also brings strong infrastructure/DevOps skills (Docker, Kubernetes, GitHub Actions, Terraform) with emphasis on reliability, cost optimization, and uptime.”
Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs
“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”