Highest Voted 'gru' Questions

2

votes

1 answer

How to interpret get_weights for Keras GRU?

I am unable to interpret the results of get_weights from a GRU layer. Here's my code - #Modified from - https://machinelearningmastery.com/understanding-simple-recurrent-neural-networks-in-keras/ from pandas import read_csv import numpy as np from…

asked Jun 30 '22 at 02:14

desert_ranger

1,096
3
13
26

1

vote

0 answers

keras seq2seq with GRU instead of LSTM

I am trying to modify the code in https://keras.io/examples/nlp/lstm_seq2seq/ so it uses GRU instead of LSTM. I have managed to get it to train properly and have constructed the encoder-decoder model for inference using code from Implementing…

python tensorflow keras lstm gru

asked Aug 22 '23 at 13:56

Duncan Leung

142
1
11

1

vote

1 answer

How can i improve my recurrent neural network

I want to implement a recurrent neural network for natural language inference. I'm new in this topic and this is a task from a module from my university, so i've had some code beforehand which i tried to adopt for this task. The problem i have is…

python machine-learning nlp recurrent-neural-network gru

asked Jul 10 '23 at 09:26

Felix Wernlein

19
2

1

vote

1 answer

How should I reshape my data to feed into pytorch GRU network?

I've been having problems getting my data to fit the dimensions required by pytorch GRU. My input is a 256-long float vector, in bathes of 64, so the size of a batch tensor is [64, 256] According to pytorch documentation, GRU takes input of size…

python deep-learning pytorch gru

asked Jun 09 '23 at 07:36

Hubert Rybka

13
3

1

vote

1 answer

What will be the effect of using 'break' statement within a 'for' in the torch forward module ? -- torch graph

I want to develop a GRU-based model for variant length input data. So I think I should use the while statement in the forward and then break it when all of the sequences were processed. Will it affect the torch graph? Does this disturb the network…

pytorch gradient recurrent-neural-network break gru

asked May 15 '23 at 09:14

Alireza AR

11
1

1

vote

0 answers

Roberta with GRU is not training

I'm trying to fine-tune RoBERTa and integrate external knowledge via a BiGRU block. But the model is not learning (the train loss is around 0.8 and is not decreasing). There is no problem with the data, I tried some other RoBERTa-based models on the…

optimization nlp bert-language-model roberta-language-model gru

asked Apr 22 '23 at 14:55

atlas

11
1

1

vote

0 answers

Implementations of the `GRU` cell is different from the descriptions

I need to put GRU cell for inference on certain hardware. And as I just found, definitions, available on Internet from multiple sources, for example, https://en.wikipedia.org/wiki/Gated_recurrent_unit, is not agree with cell implementations on both…

tensorflow pytorch recurrent-neural-network gru

asked Feb 27 '23 at 15:43

Alexey Birukov

1,565
15
22

1

vote

0 answers

training and validation losses decreasing slowly

i have implemented 2DCNN model followed by GRU layer class CNN2D(nn.Module): def __init__(self, img_x=88, img_y=88, fc_hidden1=512, fc_hidden2=512, drop_p=0.3, CNN_embed_dim=512,num_classes=9): super(CNN2D, self).__init__() …

deep-learning pytorch computer-vision conv-neural-network gru

asked Feb 18 '23 at 06:35

sarah

11
3

1

vote

0 answers

MLP and LSTM in time series

Hope you are doing well, I am doing a research paper about wind energy forecasting using deep learning. Where I used 3 neural networks namely: RNN, LSTM, MLP. The results were good, but the thing that I found somewhat strange is the superiority of…

time-series lstm forecasting mlp gru

asked Sep 30 '22 at 17:22

Shamil

11
2

1

vote

0 answers

PyTorch pack_padded_sequence is extremely slow

I am building a GRU-based architecture. Before, I was just padding the batches of sequences and passing it to the GRU. Obviously, that was introducing some small error in the results because it's not quite the 100% correct thing to do (the GRU…

machine-learning pytorch gru

asked May 01 '22 at 03:50

hologram

533
2
5
21

1

vote

0 answers

Keras GRU input shape

I have built a custom generator that outputs X data with shape (100,2,2048) belonging to Y 16 (16) classes to be passed to a GRU model for video classification. 100 is the sequence length, 2 is for 2 simultaneous camera views, each with 2048…

keras deep-learning neural-network recurrent-neural-network gru

asked Apr 26 '22 at 01:11

Hamzah Bawah

11
1
4

0

votes

0 answers

Function prediction of GRU

I have a question about the code I found on the internet, I hope you can help me When I create a function to predict the values that are in x_test, I get values other than those obtained from : lstm_predictions =…

pytorch gru

asked Jul 27 '23 at 21:02

Junior

1
1

0

votes

0 answers

Custom GRU implementation performing very slow

I am working on customizing the GRU layer to suit my specific requirements. To achieve this, I am implementing a custom GRU layer following the architecture and implementation of the GRU layer in Keras. However, I noticed that when I experiment…

tensorflow keras deep-learning recurrent-neural-network gru

asked Jul 26 '23 at 10:22

Bisnu Sarkar

1
1

0

votes

0 answers

My GRU Model performance is not working properly

I am trying to create a GRU Model to predict energy consumption. I have a 700k rows for my dataset that was resampled from seconds to per hour and that makes a final data of 720 rows. Upon trying to create the model, I haven't figured out what's…

model prediction energy consumption gru

asked Jul 23 '23 at 12:05

Engr. Ryan Francisco

1

0

votes

1 answer

An example usage for tf.keras.layers.GaussianDropout in TensorFlow2 for deep GRU network

There are not much example of using tf.keras.layers.GaussianDropout in TensorFlow 2, and I am just converting my code from Tensorflow1.15 to Tensorflow 2, and having some difficulty to understand the new way of coding in TF2. So, can anyone please…

python-3.x tensorflow2.0 gaussian dropout gru

asked Jun 14 '23 at 15:06

MK 5012

29
1
9

Questions tagged [gru]