Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps
“Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.”
Mid-level Machine Learning Engineer specializing in Generative AI and MLOps
“LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.”
“Built end-to-end LLM/RAG systems for biological data and scientific literature analysis in a drug discovery setting, helping researchers explore disease insights and treatment hypotheses faster. Combines applied GenAI product work with strong production engineering, including monitoring, retrieval optimization, reusable Python services, and scalable deployment on AWS/Kubeflow.”
Mid-level Full-Stack Engineer specializing in AI-driven data platforms
“Full-stack engineer with 5+ years of experience who built real-time data visualization and analytics systems at Uber, spanning React/TypeScript frontends, Node/GraphQL services, Kafka pipelines, and PostgreSQL. Particularly compelling for teams needing a hands-on builder who can turn ambiguous customer needs into scalable products, and who has also applied RAG with LangChain/OpenAI over 1.8M support files to surface actionable insights.”
Junior Full-Stack & Data Engineer specializing in cloud platforms and cybersecurity ML
“Built a hackathon "Patient Summary Assistant" backend focused on healthcare workflows, combining RAG-based summarization with HIPAA-minded privacy controls (NER redaction + encryption). Demonstrated strong infra skills by deploying on Kubernetes with Helm/HPA and GitOps (ArgoCD), plus migrating from OpenAI to an on-prem Llama 3 stack (vLLM, quantization, shadow-mode testing) and adding real-time Kafka ingestion for patient vitals/anomaly alerts.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps
“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision
“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”
“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”
Mid-level AI/ML Engineer specializing in generative AI and intelligent automation
“Backend-focused AI engineer with enterprise experience building startup-style internal products at JPMorgan Chase. He helped create an AI-powered financial research platform for analysts, leading retrieval and multi-agent orchestration work that cut research prep from hours to under 20 minutes while scaling across large volumes of SEC filings and earnings transcripts.”
Junior Software Engineer specializing in full-stack systems and distributed log analytics
“CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.”
Mid-level Software Engineer specializing in cloud infrastructure and distributed systems
“Cloud infrastructure/product engineer with end-to-end ownership of cloud-native storage/observability products, including taking an internal CMS to Google Cloud Marketplace and scaling to ~40,000 deployments. Strong in Kubernetes-based platforms (Operators, microservices, RabbitMQ) and performance/scalability work (e.g., 200% cluster capacity increase) plus internal tooling that materially improved SRE/QA debugging and release velocity.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance
“Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.”
Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech
“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”
Intern Software Engineer specializing in ML/NLP and LLM applications
“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”
Mid-level Full-Stack Engineer specializing in AI and FinTech platforms
“Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.”
Mid-Level Software Engineer specializing in AI/ML and Cloud-Native Microservices
“Research assistant at Syracuse University who owned a Python/FastAPI analytics backend for user-uploaded large datasets, using S3 streaming uploads and background workers for heavy processing. Has hands-on experience deploying Dockerized Python/Java microservices to AWS EKS with Jenkins-based CI/CD, plus Kafka-based event-driven pipelines and practical migration patterns (dependency mapping, dual-write, reconciliation) to minimize downtime.”
Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems
“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”
Mid-Level Java Full-Stack Developer specializing in cloud-native microservices
“QA/validation-focused engineer with experience at Meta testing an ML+LLM content classification/summarization system, including production-vs-test behavior gaps. Built automated E2E validation and drift monitoring (PSI, KL divergence, embedding cosine similarity) run daily/multiple times per day and gated via CI. Also implemented Jenkins-orchestrated Selenium/API test suites in Docker at Capgemini and partnered with a business analyst to convert business rules into automated AI-driven validation checks.”
Intern AI/ML Engineer specializing in Generative AI and applied machine learning
“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”
Senior Distributed Systems Architect specializing in backend platforms and FinTech
“Full-stack engineer who built an AI-powered visual product discovery feature end to end across web, mobile, backend, and ML integration. Particularly strong in TypeScript-first monorepo architecture, serverless AWS microservices, and productionizing computer vision/LLM pipelines with monitoring, prompt refinement, and human-in-the-loop quality controls.”
Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation
“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”
Intern Full-Stack/Backend Software Engineer specializing in SaaS migrations and NLP
“AI/ML practitioner who built an Indian Sign Language recognition system (MediaPipe hand keypoints + CNN/RNN) as an accessibility-focused teaching aid, iterating closely with advocacy groups and educators and reaching 92% accuracy. Also has production-scale data migration experience at Saasgenie, using Kubernetes pod parallelization to migrate 1M+ ITSM records with a 5x throughput gain under API rate limits.”