Dear TAs,
We noticed that in the text provided as dataset for project 2 contains only couple of sentences that makes sense i.e. there is a question, a coherent answer and then an abrupt transition to another topic.
Considering this, approaching the part RNN vs LSTM vs GRU we were wondering if in building the training set X and the labels T (in the notation used in the notebook) for the recurrent networks, one should considered as X all the sequences falling in a sliding window over all the text (neglecting the change of topic) and the successive word as Y or alternatively applying the same procedure on a successive pair of question and answer.
Best,
Luca