Hi!
I have difficulties understanding exactly the nature of the input and output of the model? X[:-1,:-1] and T[:-1,1:] respectively.
What represent X and T exactly ? Should T be a list of words ? and X a list of a vector of word preceding T or just the preceding word.
Then why the matrices X[:-1,:-1] and T[:-1,1:] ? And why cutting the last element of the dataset ?
Some code are given (this is great but...) and i tried to adapt mine to it but it is not well explained how should be things.
Thank you for your reply.