Hello,
We have a question concerning the chatbot miniproject.
When creating the three different models in part 2, we output a 3 dimensions matrix (# of sentences , maxlen, vocabulary size). It is indeed a word probability distribution, but for each word position in the sentence. Is it a correct solution, or should we only get the probability for the last word of a sentence? And we were also thinking of passing this output to an other neural network to get proper transition probabilities.
Thanks a lot,
Best regards,
Robin Leurent and Alexis Mermet.