Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in GenAI, computer vision, and real-time ML pipelines
Junior Solution Engineer specializing in energy storage systems and robotics
Senior Full-Stack AI/ML Engineer specializing in personalization, NLP, and GenAI platforms
Mid-level Full-Stack Engineer specializing in AI-powered enterprise applications
Senior Data Scientist specializing in healthcare analytics and scalable ML pipelines
Mid-Level Software Development Engineer specializing in distributed systems and event-driven architectures
“Built and maintained an internal JavaScript/React real-time event monitoring UI used by multiple Goldman Sachs teams (e.g., Private Wealth Management and Bulk Trading Systems). Focused on scaling performance under hundreds of events/sec—using profiling, memoization, batching, and debouncing—and paired it with strong internal documentation and disciplined incident diagnosis via synthetic load testing and logs/metrics.”
Senior Machine Learning Engineer specializing in MLOps and Generative AI
Mid-level AI/ML Software Engineer specializing in Generative AI and NLP
Mid-level AI/ML Engineer specializing in agentic AI and production ML systems
“ML/AI engineer with hands-on experience shipping production computer vision and GenAI systems, including a fabric defect detection platform that combined vision models with agentic LLM workflows to reach 89% human-inspector agreement at 200 ms latency. Also built a RAG-based code QA tool for developers and emphasizes production monitoring, evaluation, caching, and reusable Python service design.”
Mid-level AI Engineer specializing in GenAI and RAG systems
“AI engineer who built a production e-commerce system that analyzes product images alongside sales and demographic data to generate actionable creative recommendations, now used by 20+ clients. Also built orchestrated document/agent pipelines (Airflow, LangGraph) including a compliance drift detector auditing 401 compliance documents, with an emphasis on traceability, logging, and production integration.”
Mid-level Software Engineer specializing in SRE, observability, and LLM-powered automation
Intern Software Engineer specializing in full-stack development and applied AI
“Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level AI/ML Engineer specializing in real-time anomaly detection and AI agents
“Built a production real-time anomaly detection platform for high-frequency trading at HSBC, using a streaming stack (Pulsar + Spark Structured Streaming + AWS Lambda) and a transformer-based model combining time-series and numerical signals. Experienced in MLOps and safe deployment (Kubernetes, canary releases, MLflow/Grafana monitoring) and in aligning model performance with risk/compliance expectations through SLA-driven tuning and stakeholder-friendly dashboards.”
Senior Software Engineer specializing in cloud-native microservices and healthcare integrations
“Backend engineer at Cerebrone.ai building cloud-native Flask microservices for an AI-driven automation platform on GCP (Cloud Run/App Engine), including dedicated inference services integrating OpenAI and internal ML pipelines. Demonstrated strong performance and scalability wins across Postgres/SQLAlchemy optimization, multi-tenant (healthcare/HIPAA-grade) data isolation, and high-throughput background processing with Celery/Redis/RabbitMQ, with multiple quantified latency/CPU/throughput improvements.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications
“Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.”