Hi, in the below question, I don't understand how we arrive at the correct answers. In particular I don't follow why t5 is necessary. My understanding for HMM's is we only depend on the last k tags, and since k = 1, t3 is the only relevant tag
Considering an order-1 HMM model to tag a word sequence with tags t𝑘, select all of the following expressions which are equal to 𝑃(t4| t2 t3 t5 t6):