Extracting attention matrix with TensorFlow's seq2seq example code during decoding

Question

It seems like the attention() method used to compute the attention mask in the seq2seq_model.py code in the example TensorFlow code for the sequence-to-sequence code is not called during decoding.

Does anyone know how to resolve this? A similar question was raised here: Visualizing attention activation in Tensorflow, but it's not clear to me how to get the matrix during decoding.

Thanks!

score 0 · Answer 1 · answered Dec 20 '16 at 11:15

0

Why do you need the mask? If it's just for visualizing, you might need to pass the tensor and fetch it in session run, I guess.

answered Dec 20 '16 at 11:15

Lukasz Kaiser

41
1

Extracting attention matrix with TensorFlow's seq2seq example code during decoding

1 Answers1