Pre-screened and vetted.
“Built an automated ML/NLP document classification system for unstructured legal documents, combining classical models (TF-IDF + logistic regression/random forest) with entity resolution via fuzzy matching validated by precision/recall. Also implemented semantic similarity search using sentence embeddings stored in FAISS and improved matching by fine-tuning a transformer on domain-specific data and tuning similarity thresholds for fewer false positives.”
Intern Machine Learning Engineer specializing in NLP, RAG, and time-series forecasting