Consider the task of predicting the sentiment of a text document as either {positive, negative, neutral}. We want to use a neural network to learn a model for this task, given a training dataset of the form { (documenti, labeli) } i in [1..N]. Each document di contains Ni words (wi,1, ..., wi, Ni), where the words wi,j belong to the vocabulary V = { w1, ..., w|V| }. Describe how the documents are encoded as vectors of size |V| for each of the following two methods:
Bag of Words
Tf-Idf weights