Pre-screened and vetted.
Mid-level Software Engineer specializing in Python backend and LLM/ML systems
“Backend/AI engineer who has shipped production LLM systems end-to-end, including an AI request-routing service (FastAPI + BART MNLI + OpenAI/Gemini) that improved accuracy ~25% after launch via eval-driven prompt/category iteration. Also built an enterprise document intelligence/RAG platform on Azure (Blob/SharePoint/Teams ingestion, OCR/NLP chunking, embeddings in Azure Cognitive Search) with PII guardrails (Presidio), confidence gating, and scalable event-driven pipelines handling millions of documents.”
Mid-level Machine Learning Engineer specializing in computer vision and reinforcement learning
“Early-stage engineer with hands-on embedded prototyping experience (Arduino/Raspberry Pi) who helped build an award-winning smart glasses project enabling phone notifications via Bluetooth. Strong computer vision performance optimization background, including accelerating 120 FPS inference by moving from TensorFlow to PyTorch and deploying through ONNX + TensorRT quantization, plus Docker-based GPU deployment and CI/ML practices.”
Senior Full-Stack Developer specializing in React, Node.js, and AWS
“Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.”
Mid-level AI Engineer specializing in ML, NLP, and Generative AI
“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”
Entry-Level Data Scientist specializing in ML, Azure, and LLM applications
“ML/computer-vision practitioner who shipped a CycleGAN-based bilingual handwriting translation demo (English↔Telugu) for low-resource scripts using unpaired datasets, focusing on preserving handwriting style and real-time deployment via Gradio. Also delivered a medical imaging pipeline by fine-tuning ResNet-50 and ViT-B/16 for pneumonia detection, emphasizing reproducibility, measurable evaluation, and stakeholder-friendly iteration.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Junior Machine Learning Engineer specializing in predictive modeling and GenAI RAG systems
“LLM engineer who built and deployed an emotionally intelligent AAC communication system using an emotion-aware RAG pipeline (Empathetic Dialogues + GoEmotions) and a PEFT-adapted model. Experienced with LangChain/LangGraph and custom Python orchestration, focusing on reliability (guards, schema validation, fallbacks), latency optimization, and rigorous evaluation (automatic metrics + human-in-the-loop), with a reported 18% user satisfaction improvement.”
Junior Software Engineer specializing in AI/ML and full-stack web development
“Built core perception and decision layers for a 3D AI-powered interactive avatar/agent with a robotics-like perception–reasoning–action loop, combining computer vision, NLP, and real-time response. Focused on making multimodal inputs robust (normalization, intent + emotion signal fusion) and improving real-time performance via instrumentation, profiling, and parallelization; also designed distributed, loosely coupled state-based communication and deployed services with Docker.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”
Junior Machine Learning Engineer specializing in multimodal systems and LLMs
“Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.”
Director-level IT & Operations leader specializing in cloud infrastructure and cybersecurity
Mid-level AI/ML Engineer specializing in LLM systems, MLOps, and real-time fraud detection
Mid-level AI/ML Engineer specializing in MLOps and healthcare machine learning
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
Mid-level AI/ML Engineer specializing in LLMs, RAG, and cloud AI infrastructure
Principal AI/ML Engineer specializing in LLMs, RAG pipelines, and production MLOps
Mid-level AI/ML Engineer specializing in LLM, RAG, and semantic search systems