Pre-screened and vetted.
Mid-level Generative AI Engineer specializing in LLMs, RAG, and semantic search
Mid-Level Full-Stack Software Engineer specializing in cloud-native FinTech systems
Senior Software Engineer specializing in cloud-native Java microservices
Mid-level Backend Engineer specializing in JavaScript/TypeScript runtimes and OSS developer experience
Junior Robotics Engineer specializing in reinforcement learning and robot manipulation
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and computer vision
Mid-level AI Engineer specializing in production LLM, RAG, and agentic AI systems
Junior AI Engineer specializing in RAG pipelines and agentic AI systems
“Built and shipped production RAG/agentic systems in high-stakes domains (biomedical and legal), including an enterprise biomedical document retrieval platform over ~10k scientific docs and a multilingual African-law assistant at the World Bank. Deep hands-on experience with LangChain/LangGraph/LlamaIndex and evaluation tooling (LLM-as-a-judge, safety/hallucination detection), with measurable gains in retrieval quality and hallucination reduction.”
Junior Data Scientist specializing in ML, LLMs, and RAG applications
“University hackathon finalist (2nd place) who built CareerSpark, a production-style multi-agent career guidance app in 24 hours using a hierarchical debate architecture with a moderator/judge agent. Has startup internship experience at LiveSpheres AI using LangChain for multi-LLM orchestration, and demonstrates a structured approach to testing/evaluation (golden sets, integration sims, latency/accuracy KPIs) plus strong non-technical stakeholder communication.”
Mid-level Data Scientist specializing in Generative AI and multimodal systems
“Recent J&J intern who built a conversational RAG agent and led a shift from a monolithic model to a modular RAG workflow, cutting response time from several days to under a second by tackling data fragmentation, context retention, and embedding/latency optimization. Also worked on a large (7B-parameter) multimodal VQA pipeline for healthcare research and stays current via NeurIPS/ICLR and open-source contributions.”
Mid-level Backend Software Developer specializing in cloud-native microservices
“Backend engineer with American Express experience maintaining an internal Python/Flask rewards simulation microservice used by product analysts and QA. Demonstrated strong performance and scalability work: moved batch simulations to Celery, added Redis caching to cut DynamoDB latency, and tuned Postgres/SQLAlchemy queries with EXPLAIN ANALYZE and composite indexes (bringing API responses under ~200ms by queueing jobs). Also has experience integrating ML via Flask-based model-serving APIs (scikit-learn/LightGBM packaged with joblib) and designing multi-tenant data isolation and tenant-specific configuration systems.”
Junior AI/ML Engineer specializing in RAG systems and cloud-native MLOps
“Built and shipped a production LLM-powered RAG system at Upstart enabling natural-language search across 50k+ scattered internal technical docs. Delivered sub-300ms p95 latency for ~50 active users with strong hallucination safeguards (retrieval-first, thresholds, citations) plus robust testing/monitoring and cost controls (prompt caching cutting API spend ~20%).”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems
“Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.”
Mid-level AI/ML Engineer specializing in Generative AI and production ML systems
“Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.”
Mid-level AI/ML Engineer specializing in LLM systems, RAG, and MLOps
“Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.”
Junior Robotics & Embedded Software Engineer specializing in Linux-based distributed robotic systems
“Robotics software engineer focused on system-level C++/Linux stacks for multi-robot platforms, owning the communication layer and validation/testing infrastructure. Built Python simulation/replay and fault-injection tooling integrated with Docker + GitLab CI/CD, and debugged real-time localization issues by instrumenting IPC timing and refactoring multi-threaded pipelines for deterministic performance.”