Pre-screened and vetted.
Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms
“AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps for FinTech
“ML/LLM engineer with production experience building a compliant RAG-based virtual assistant at Intuit, optimizing embeddings and FAISS retrieval (including PCA) for low-latency, privacy-controlled search and deploying via AWS SageMaker containers. Also built scalable Airflow+MLflow pipelines using Docker and KubernetesExecutor, cutting training cycles by 37%, and partnered with civil engineers/project managers at Aegis Infra to deliver predictive maintenance for construction equipment.”
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”
Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps
“ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.”
Junior Full-Stack Software Engineer specializing in React and FinTech
“Full-stack engineer with banking-domain experience (Cognizant/Kotak) building and optimizing high-usage transaction/account APIs on Spring Boot/Node/PostgreSQL in AWS/Docker, including peak-load performance fixes. Also built an end-to-end retail demand-forecasting feature during a master’s program, spanning data pipelines, ensemble models, dashboards, and operational guardrails like validation and fallbacks.”
Junior Machine Learning Engineer specializing in NLP and multimodal transformers
“Built and deployed LLM-powered agentic chatbot and text-to-SQL systems using LangGraph/LangChain (and Bedrock), structuring workflows as DAGs with planning/replanning and validation to improve tool-calling reliability and reduce hallucinations. Operates production feedback loops with online/offline metrics, drift detection, and LangSmith-based evaluation pipelines, and regularly partners with business stakeholders and clinicians using slide decks and visual charts.”
Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines
“LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.”
Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps
“Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.”
Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare
“Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.”
Mid-level AI/ML Engineer specializing in applied AI for banking and healthcare
“Built end-to-end AI products across fintech and healthcare, including a real-time loan risk prediction system and a patient feedback insights platform. Stands out for combining full-stack delivery, production ML/MLOps on AWS, and pragmatic human-in-the-loop safeguards; reported a 22% improvement in prediction accuracy.”
Senior AI Engineer specializing in machine learning, IoT, and data platforms
“Backend/cloud engineer who built an AWS serverless IoT system that computes Bluetooth beacon locations from telemetry using heavy scientific Python (NumPy/SciPy/pandas) packaged as Dockerized Lambda, integrated with Java microservices and scheduled batch orchestration. Has deep AWS delivery experience (CI/CD with Code* tools, CloudFormation, cost controls) and has led high-severity incident response including CloudTrail forensics and infrastructure recovery after a compromised-keys crypto-mining attack.”
Junior AI & Data Engineer specializing in LLM systems and analytics platforms
“Backend/ML engineer who built a job-search automation SaaS using a modular Selenium ETL pipeline, rigorous testing/observability, and a cost-optimized two-pass LLM ranking approach. Has led high-integrity data extraction from messy multi-city PDF records (95% integrity) and managed modular production rollouts for a 20+ engineer team, with a strong security focus (deny-by-default, row-level access control) in an AI-assisted grading platform.”
Mid-level AI/ML Engineer specializing in MLOps and cloud-deployed ML systems
“ML/AI engineer who built and productionized an NLP system at PurevisitX, orchestrating end-to-end ML workflows with Airflow (S3 ingestion through auto-retraining) and optimizing for drift and low-latency inference. Also partnered with Citibank risk teams on a fraud detection model, translating results via dashboards and iterating thresholds based on stakeholder feedback.”
Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.”
Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI
“ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.”
Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots
“Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.”
Intern AI/ML Engineer specializing in NLP, computer vision, and reinforcement learning
“Built an Arduino-based obstacle-avoiding robot using sonar/laser sensors and improved performance from 0.60 to 0.87 accuracy through sensor-fusion thresholding and iterative tuning. In an internship, optimized a legal-document NLP pipeline by switching to a distilled/quantized transformer and offloading inference to a GPU-backed Flask service, cutting inference time by 40%+ without added infrastructure spend.”
Mid-level AI/ML Engineer specializing in Generative AI and MLOps
“Built and shipped a production RAG assistant using GPT-4, LangChain, and Pinecone/FAISS to search 50K+ institutional documents, with a strong focus on groundedness and hallucination reduction through retrieval optimization and re-ranking. Pairs this with a metrics-driven evaluation/monitoring approach (BLEU/ROUGE, manual sampling, logging) and workflow automation via Airflow, and has experience translating stakeholder needs into iterative AI prototypes.”
Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML
“Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.”
Senior Data Scientist specializing in ML, NLP, and production AI systems
“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Mid-level AI/ML Engineer specializing in production RAG systems and MLOps
“Built and deployed a GPT-4 + Pinecone RAG system that lets users query large internal document collections with grounded, cited answers. Demonstrates strong applied LLM engineering (chunking experiments, hallucination controls, metadata recency boosting) plus production-minded evaluation/monitoring and performance tuning (rate-limit mitigation via pooling/batching). Also effective at translating complex AI concepts to non-technical stakeholders through prototypes and live demos, helping secure client sponsorship.”