Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems
Director-level Strategy & Operations leader specializing in transportation and consulting
Junior Software Engineer specializing in distributed systems and AI agents
“Python backend engineer focused on high-throughput document/PDF processing systems, building end-to-end pipelines that extract structured content for downstream NLP use cases. Demonstrates strong practical MLOps-adjacent infrastructure skills: Kubernetes deployments, GitLab CI, GitOps workflows, and an incremental migration to AWS using EC2/Lambda tradeoffs. Deep hands-on optimization experience (selective OCR, layout-aware extraction, parallelism, caching, idempotency, and backpressure/autoscaling).”
Engineering Manager specializing in AI/ML platforms and 0→1 product delivery
“Player-coach engineer/lead on a high-scale research integrity platform ("Lighthouse") that flags fraud/manipulation signals across ~3M academic manuscripts per year. Owns architecture decisions (ADRs), implements across Go/Java/React services, and introduced NLP (SciBERT embeddings + human-in-the-loop) to assess out-of-context citations while also handling production incidents with a data-consistency-first approach.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety
“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference
“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”
Mid-level Data Scientist specializing in anomaly detection and production ML
“Interned at Backblaze building production AI systems for incident response and security operations, including an internal LLM-powered incident triage assistant that used Snowflake + RAG over historical tickets/postmortems and delivered results via Slack and a web UI. Emphasizes reliability (PII filtering, grounding, schema validation, fallbacks) and rigorous evaluation/observability (offline replay, partial rollouts, time-to-first-action metrics, Prometheus/Grafana).”
Mid-level Software Engineer specializing in AI/LLM and distributed systems
“Recent internship project at Google Workspace building an LLM-driven Python backend pipeline to extract/enrich NLP features from messy customer web domains and integrate them into a Domain Feature Store for personalization and promotions. Also has hands-on Kubernetes/Docker deployment experience for a Digital Signage SaaS backend with GitHub Actions CI, plus strong streaming-systems knowledge (Kafka exactly-once, schema evolution, Flink scaling) and built an information retrieval system handling 30,000+ cases.”
Mid-Level Backend Engineer specializing in REST APIs and AWS
“Backend engineer who built a new REST eligibility service at Barclays that unified siloed account logic (card/loan/deposit) and integrated with web/mobile, ultimately serving millions of users daily. Also built an end-to-end LLM-based pharmaceutical care-plan generation tool in a rapid Columbia startup competition, emphasizing configurable design, strict validation, persistence, and robust error handling.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Mid-level Software Developer specializing in cloud data engineering and MLOps
“Software engineer with strong AWS production experience, including an end-to-end historical backfill system exporting ~10PB of CloudWatch logs into a data lake using Step Functions/Kinesis/Lambda/Firehose/Glue. Emphasizes reliability and operability (DynamoDB checkpointing, monitoring dashboards, CI/CD with canary tests) and has also built customer-facing UI work for the Visa Developer Portal using Angular + Spring Boot, plus React/Redux frontend work.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
Intern Machine Learning Engineer specializing in AI security and anomaly detection
Intern Software Engineer specializing in data science and network visualization
Mid-level AI/ML Engineer specializing in recommender systems, fraud detection, and LLMs
Entry Software Engineer specializing in AI infrastructure and ML inference systems
Mid-level Full-Stack Developer specializing in cloud-native microservices and FinTech
Mid-level AI/ML Engineer specializing in NLP/LLMs and production ML systems