Top "Recurrent-neural-network" questions

A recurrent neural network (RNN) is a class of artificial neural network where connections between units form a directed cycle.

What is the intuition of using tanh in LSTM

In LSTM Network (Understanding LSTMs), Why input gate and output gate use tanh? what is the intuition behind this? it …

machine-learning deep-learning lstm recurrent-neural-network activation-function
Tensorflow TypeError: Fetch argument None has invalid type <type 'NoneType'>?

I'm building a RNN loosely based on the TensorFlow tutorial. The relevant parts of my model are as follows: input_…

python artificial-intelligence tensorflow typeerror recurrent-neural-network
How to get the output shape of a layer in Keras?

I have the following code in Keras (Basically I am modifying this code for my use) and I get this …

python keras lstm recurrent-neural-network
why do we "pack" the sequences in pytorch?

I was trying to replicate How to use packing for variable-length sequence inputs for rnn but I guess I first …

deep-learning pytorch recurrent-neural-network tensor zero-padding
What's the difference between "hidden" and "output" in PyTorch LSTM?

I'm having trouble understanding the documentation for PyTorch's LSTM module (and also RNN and GRU, which are similar). Regarding the …

deep-learning pytorch lstm recurrent-neural-network tensor
Pytorch - RuntimeError: Trying to backward through the graph a second time, but the buffers have already been freed

I keep running into this error: RuntimeError: Trying to backward through the graph a second time, but the buffers have …

python deep-learning pytorch recurrent-neural-network autograd
logits and labels must be broadcastable error in Tensorflow RNN

I am new to Tensorflow and deep leaning. I am trying to see how the loss decreases over 10 epochs in …

python tensorflow machine-learning deep-learning recurrent-neural-network
How to use return_sequences option and TimeDistributed layer in Keras?

I have a dialog corpus like below. And I want to implement a LSTM model which predicts a system action. …

deep-learning keras lstm recurrent-neural-network
Tensorflow: Attempting to use uninitialized value beta1_power

I got the following error when I try to run the code at the end of the post. But it …

python machine-learning tensorflow lstm recurrent-neural-network
Soft attention vs. hard attention

In this blog post, The Unreasonable Effectiveness of Recurrent Neural Networks, Andrej Karpathy mentions future directions for neural networks based …

machine-learning neural-network recurrent-neural-network