tensortflow tf.keras.layers.Attention for RNN

Asked Aug 24 '20 at 00:46

Active Aug 24 '20 at 00:46

Viewed 292 times

Thanks.

asked Aug 24 '20 at 00:46

user4918159

This is self-attention mechanism in Transformer models (which is computationally totally different from attention in CNN/RNNs). More info: https://arxiv.org/pdf/1706.03762.pdf – thushv89 Aug 24 '20 at 03:38
here a full explanation and demonstration: https://stackoverflow.com/a/61775631/10375049 – Marco Cerliani Aug 24 '20 at 07:15

0 Answers0