Pre-screened and vetted.
Junior Solution Engineer specializing in energy storage systems and robotics
Senior Full-Stack AI/ML Engineer specializing in personalization, NLP, and GenAI platforms
Senior Data Scientist specializing in healthcare analytics and scalable ML pipelines
Mid-Level Software Development Engineer specializing in distributed systems and event-driven architectures
“Built and maintained an internal JavaScript/React real-time event monitoring UI used by multiple Goldman Sachs teams (e.g., Private Wealth Management and Bulk Trading Systems). Focused on scaling performance under hundreds of events/sec—using profiling, memoization, batching, and debouncing—and paired it with strong internal documentation and disciplined incident diagnosis via synthetic load testing and logs/metrics.”
Senior Machine Learning Engineer specializing in MLOps and Generative AI
Mid-level AI/ML Software Engineer specializing in Generative AI and NLP
Mid-level AI Engineer specializing in GenAI and RAG systems
“AI engineer who built a production e-commerce system that analyzes product images alongside sales and demographic data to generate actionable creative recommendations, now used by 20+ clients. Also built orchestrated document/agent pipelines (Airflow, LangGraph) including a compliance drift detector auditing 401 compliance documents, with an emphasis on traceability, logging, and production integration.”
Mid-level Software Engineer specializing in SRE, observability, and LLM-powered automation
Intern Software Engineer specializing in full-stack development and applied AI
“Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI
“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics
“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”
Mid-level AI/ML Engineer specializing in real-time anomaly detection and AI agents
“Built a production real-time anomaly detection platform for high-frequency trading at HSBC, using a streaming stack (Pulsar + Spark Structured Streaming + AWS Lambda) and a transformer-based model combining time-series and numerical signals. Experienced in MLOps and safe deployment (Kubernetes, canary releases, MLflow/Grafana monitoring) and in aligning model performance with risk/compliance expectations through SLA-driven tuning and stakeholder-friendly dashboards.”
Senior Software Engineer specializing in cloud-native microservices and healthcare integrations
“Backend engineer at Cerebrone.ai building cloud-native Flask microservices for an AI-driven automation platform on GCP (Cloud Run/App Engine), including dedicated inference services integrating OpenAI and internal ML pipelines. Demonstrated strong performance and scalability wins across Postgres/SQLAlchemy optimization, multi-tenant (healthcare/HIPAA-grade) data isolation, and high-throughput background processing with Celery/Redis/RabbitMQ, with multiple quantified latency/CPU/throughput improvements.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications
“Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.”
Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps
“AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.”
Mid-level Generative AI Engineer specializing in LLM systems and RAG
“Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.”
Junior Machine Learning Engineer specializing in computer vision and LLM applications
“Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).”
Mid-level Robotics Software Engineer specializing in autonomous perception and sensor fusion
“Robotics engineer with Honeywell and Tata Motors experience deploying ROS/ROS2 autonomous mobile robot fleets into live factory environments, integrating sensors, safety PLCs, and on-prem services. Known for solving end-to-end latency and stability issues (including network spikes under load) using gRPC, Docker, and improved diagnostics—cutting diagnosis time from hours to minutes and achieving sub-150 ms control response.”
Mid-level Full-Stack Java Developer specializing in cloud-native microservices
“Full-stack engineer focused on enterprise, cloud-native microservices—building Spring Boot backends and React/Angular front ends with strong security (OAuth/JWT), AWS infrastructure (RDS/S3), and containerized deployments (Docker/Kubernetes). Has delivered data-heavy order/account/transaction platforms and healthcare solutions including EHR integrations for secure patient data exchange, with emphasis on testing, performance tuning, and reliability (load testing).”