NLP

CSE256 Assignment 3: Language Modeling

In this report, we discuss the various ways of building probabilistic language models, specifically N-grams. Using the given corpora from three different domains, we first evaluate the reference Unigram implementation provided in the starter code in Section 2.

Yi Rong

Last updated on May 4, 2022 NLP, Machine Learning

CSE256 Assignment 1: Text Classification

In this report, we discuss the various ways of data pre-processing and feature engineering for a text classification task. We first start by giving an overview of the classification task, the model used, and the given baseline implementation in Section 2.

Yi Rong

Last updated on May 4, 2022 NLP, Machine Learning