Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and GenAI
“Full-stack engineer with cloud and GenAI experience who has owned production features end-to-end, including a reporting dashboard optimized from 14s to 5s using query/API refactoring and monitored via AWS CloudWatch. Also productionized an OpenAI-powered chatbot using LangChain with prompt design, guardrails, and evaluation via production logs and user feedback, and has led incremental legacy-to-microservices modernization with parallel run to avoid regressions.”
Senior Full-Stack Developer specializing in cloud-native FinTech and AI platforms
“Full-stack engineer with strong production ownership: built and operated a real-time transaction monitoring/fraud-alerting system using Java Spring Boot, Kafka, Docker, and AWS with CI/CD. Demonstrates metrics-driven operations (latency, stability, consumer lag, true/false positives) and reliability patterns for integrations (idempotency, retries/backoff, DLQs, reconciliation/backfills), plus modern React/TypeScript + Node/Postgres architecture experience.”
Mid-level Software Engineer specializing in LLM, RAG, and cloud AI
“Recent master’s graduate who led a team project building an LLM-based chatbot with RBAC-controlled information disclosure and a focus on reducing hallucinations. Also has hands-on embedded robotics experience (Arduino obstacle-avoiding robot using ultrasonic sensors) and practical DevOps/cloud deployment exposure with Docker, Terraform, Jenkins, and AWS (EKS/ECS/CodePipeline).”
Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception
“Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG
“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”
Mid-level AI Engineer specializing in multi-agent systems and RAG
“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”
Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications
“Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI Engineer specializing in LLMs, RAG, and data engineering
“AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).”
Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and MLOps
“ML/LLM engineer with production experience at General Motors building Transformer-based search and recommendation personalization for a high-traffic vehicle platform. Delivered significant KPI gains (17% conversion lift, 14% bounce-rate reduction) and optimized real-time inference via ONNX Runtime and INT8 quantization while implementing robust MLOps (Airflow/MLflow, monitoring, drift-triggered retraining) and stakeholder-facing explainability/dashboards.”
Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting
“Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.”
Mid-level AI/ML Engineer specializing in production RAG systems and MLOps
“Built and deployed a GPT-4 + Pinecone RAG system that lets users query large internal document collections with grounded, cited answers. Demonstrates strong applied LLM engineering (chunking experiments, hallucination controls, metadata recency boosting) plus production-minded evaluation/monitoring and performance tuning (rate-limit mitigation via pooling/batching). Also effective at translating complex AI concepts to non-technical stakeholders through prototypes and live demos, helping secure client sponsorship.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots
“Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.”
Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics
“Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.”
Mid-Level Full-Stack Engineer specializing in cloud-native e-commerce and AI/ML systems
“Full-stack engineer with strong ownership in fast-moving environments: designed and shipped a pre-order/campaign inventory system (NestJS + Strapi + Datadog) that freed 34% warehouse space and reduced stock risk to ~5.7%. Also built rapid, high-impact logistics features (Spot Sales) that drove last-mile cost to ~0 in ~40 days, and has hands-on AWS/Terraform/CI-CD experience including deploying a global RAG system with Pinecone, Datadog, and PagerDuty.”
Mid-level AI/ML Engineer specializing in Generative AI and MLOps
“Built and shipped a production RAG assistant using GPT-4, LangChain, and Pinecone/FAISS to search 50K+ institutional documents, with a strong focus on groundedness and hallucination reduction through retrieval optimization and re-ranking. Pairs this with a metrics-driven evaluation/monitoring approach (BLEU/ROUGE, manual sampling, logging) and workflow automation via Airflow, and has experience translating stakeholder needs into iterative AI prototypes.”
Junior Software Engineer specializing in cloud-native microservices and applied NLP
“Backend engineer who built an AI-driven "Smart Feedback Analyzer" API (Flask → FastAPI) that processes user feedback with NLP (Hugging Face + OpenAI) and returns structured insights. Demonstrates strong production-minded architecture: stateless services, Cloud Run + Docker deployment, Redis/Celery background processing, and Postgres/SQLAlchemy performance tuning (EXPLAIN ANALYZE, indexing, N+1 fixes), plus multi-tenant data isolation via JWT/API-key derived tenant IDs.”
Entry-Level AI/ML Engineer specializing in LLM automation and RAG systems
“AI Automation Engineer at BalancedTrust who single-handedly shipped production LLM features for FinTech compliance: a policy gap-analysis pipeline (SOC 2/GDPR) and a RAG-based regulatory chatbot. Deeply focused on reliability in high-stakes legal/compliance settings, with strong production engineering (edge functions, parallelized batching to cut latency, structured JSON outputs, guardrails, and monitoring) and close collaboration with non-technical compliance experts.”
Senior AI/ML Engineer specializing in financial risk, fraud detection, and GenAI analytics
“AI/ML engineer with experience at Northern Trust and Persistent Systems building production LLM + RAG systems for regulated financial use cases, including liquidity forecasting, anomaly detection, and credit scoring. Emphasizes compliance-first design with explainability (SHAP), traceability (MLflow), and hallucination controls (FAISS + citation-grounded prompting), and has delivered drift-triggered retraining pipelines using Airflow and Kubernetes while translating model outputs into business-ready marketing segments.”
Mid-level AI/Backend Engineer specializing in RAG and data platforms
“Built and shipped a production LLM-powered financial Q&A interface that extracts precise numeric data from PDFs using a hybrid AWS Textract + LLM normalization pipeline, with confidence gating and guardrails to prevent unreliable answers. Experienced with LangChain-based RAG orchestration (chunking, memory, structured outputs) and collaborated closely with PMs/analysts on IRS Form 990 extraction requirements.”
Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems
“Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.”