Pre-screened and vetted in the NYC Metro.
Mid-level ML Engineer specializing in real-time inference and anomaly detection
“Built DocMind, an end-to-end PDF chat assistant using React/TypeScript, FastAPI, and Postgres/pgvector, showing full-stack ownership plus practical performance tuning and AWS debugging skills. At Social Tech Labs, improved onboarding, shipped lean under ambiguity, and created a reusable low-latency feature serving layer that reduced duplicated infrastructure work across models.”
Mid-level GenAI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”
Mid-level Generative AI Engineer specializing in LLMs and RAG for enterprise and FinTech
Mid-level Machine Learning Engineer specializing in GenAI, RAG, and medical imaging
Mid-level AI/ML Engineer specializing in Generative AI, RAG agents, and multimodal systems
Mid-level Generative AI/ML Engineer specializing in LLMs, RAG, and MLOps
Junior AI/ML Engineer specializing in Python ML, NLP, and model deployment
“Built and productionized a real-time social-media sentiment analysis system used by a marketing team to monitor brand/campaign performance. Experienced in orchestrating LLM workflows with LangChain (validation → prompting → parsing → post-processing), plus monitoring, retraining, and RAG-style retrieval using embeddings/vector stores to keep outputs reliable over time.”
Mid-level Machine Learning Engineer specializing in real-time AI and data platforms
“ML/NLP engineer who has built production systems end-to-end: a real-time recommendation platform (100k+ profiles) using BERTopic-style clustering and a RAG-based news summarization/recommendation stack with ChromaDB. Strong focus on scaling and reliability (GPU batching, Redis caching, Kafka ingestion, Docker/Kubernetes, Prometheus/Grafana) and on maintaining model quality over time via drift monitoring and retraining triggers.”
Mid-level GenAI Engineer specializing in AI agents and FinTech platforms