Getting gradient of model output w.r.t weights using Keras

Question

I am interested in building reinforcement learning models with the simplicity of the Keras API. Unfortunately, I am unable to extract the gradient of the output (not error) with respect to the weights. I found the following code that performs a similar function (Saliency maps of neural networks (using Keras))

get_output = theano.function([model.layers[0].input],model.layers[-1].output,allow_input_downcast=True)
fx = theano.function([model.layers[0].input] ,T.jacobian(model.layers[-1].output.flatten(),model.layers[0].input), allow_input_downcast=True)
grad = fx([trainingData])

Any ideas on how to calculate the gradient of the model output with respect to the weights for each layer would be appreciated.

Have you had any advance? I am getting the following error using a similar saliency function: https://github.com/fchollet/keras/issues/1777#issuecomment-250040309 — ssierral, Sep 28 '16 at 01:02
I have not had any success with Keras. However, I have been able to do this using tensorflow. — Matt S, Oct 07 '16 at 21:38
https://github.com/yanpanlau/DDPG-Keras-Torcs CriticNetwork.py uses the tensorflow backend to calculate gradients while using Keras for actually building the net architecture — Matt S, Oct 14 '16 at 21:17

score 59 · Accepted Answer · edited Jul 11 '20 at 21:00

59

To get the gradients of model output with respect to weights using Keras you have to use the Keras backend module. I created this simple example to illustrate exactly what to do:

from keras.models import Sequential
from keras.layers import Dense, Activation
from keras import backend as k


model = Sequential()
model.add(Dense(12, input_dim=8, init='uniform', activation='relu'))
model.add(Dense(8, init='uniform', activation='relu'))
model.add(Dense(1, init='uniform', activation='sigmoid'))
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

To calculate the gradients we first need to find the output tensor. For the output of the model (what my initial question asked) we simply call model.output. We can also find the gradients of outputs for other layers by calling model.layers[index].output

outputTensor = model.output #Or model.layers[index].output

Then we need to choose the variables that are in respect to the gradient.

  listOfVariableTensors = model.trainable_weights
  #or variableTensors = model.trainable_weights[0]

We can now calculate the gradients. It is as easy as the following:

gradients = k.gradients(outputTensor, listOfVariableTensors)

To actually run the gradients given an input, we need to use a bit of Tensorflow.

trainingExample = np.random.random((1,8))
sess = tf.InteractiveSession()
sess.run(tf.initialize_all_variables())
evaluated_gradients = sess.run(gradients,feed_dict={model.input:trainingExample})

And thats it!

edited Jul 11 '20 at 21:00

Salvatore

10,815
4
31
69

answered Dec 30 '16 at 15:39

Matt S

1,434
1
15
16

4

I've run this code (with theano as backend) and the following error is raised: "TypeError: cost must be a scalar.". I wonder, can this be achieved with a backend-agnostic approach? – bones.felipe Mar 11 '17 at 22:24
Matt S, how do the gradients get calculated without specifying the labels in sess.run? – Aleksandar Jovanovic Jan 13 '18 at 12:16
I am taking gradient w.r.t input. If you want gradient w.r.t loss then you need to define the loss function, replace outputTensor in k.gradients with loss_fn, and then pass the labels to the feed dict. – Matt S Jan 16 '18 at 19:17
2

I believe you meant 'gradient w.r.t. output.' – sahdeV Mar 08 '18 at 10:38
@MattS hi, Matt. Just saw your answer and could you please explain how to pass the labels to the feed dict? Thank you so much! – beepretty Jul 22 '18 at 01:00
14

The problem with this solution is that it doesn't solve the problem of how to get those gradients out of Keras at training time. Sure, for some random toy input I can just do what you wrote above, but if I want the gradients that were computed in an actual training step performed by Keras' fit() function, how do I get those? They are not part of the fetch list that is passed to sess.run() somewhere deep down in the depths of the Keras code, so I can't have those unless I spend a month of understanding and rewriting the Keras training engine :/ – Alex Jan 27 '19 at 22:48
1

@Alex, they're inside the optimizer. Some inspiration: https://stackoverflow.com/questions/51140950/how-to-obtain-the-gradients-in-keras – Daniel Möller Mar 07 '19 at 00:56
Reading the code for the `SGD` optimizer also brings some ideas. – Daniel Möller Mar 07 '19 at 01:19
For `gradients = k.gradients(outputTensor, listOfVariableTensors)`, it works only when the outputTensor is a scalar tensor. K.gradients() expects the first parameter to be the loss (usually a scalar tensor). (I know for the original question, it asks the gradient w.r.t the output) – kz28 Mar 18 '20 at 14:33
very nice answer! thank you, this helped me a lot. I want to emphasize the fact that the `gradients` vector is consists of the gradients of all layers to the requested layer. then the gradient of the output of the model can be obtained via `gradients[-1]`. Moreover, you can take the gradient of this `gradients` vector. In that case, the last item in the result would be `None` – sajed zarrinpour Apr 04 '20 at 13:44
@Alex I am facing this problem too, particularly to log gradient information into tensorboard, when using keras. Did u find a solution? I have seen the `write_grads` parameter of `keras.callbacks.TensorBoard` is deprecate but could not find a valid alternative. – roschach Jun 21 '20 at 15:00

score -3 · Answer 2 · edited Mar 23 '20 at 21:18

The below answer is with the cross entropy function, feel free to change it your function.

outputTensor = model.output
listOfVariableTensors = model.trainable_weights
bce = keras.losses.BinaryCrossentropy()
loss = bce(outputTensor, labels)
gradients = k.gradients(loss, listOfVariableTensors)

sess = tf.InteractiveSession()
sess.run(tf.global_variables_initializer())
evaluated_gradients = sess.run(gradients,feed_dict={model.input:training_data1})
print(evaluated_gradients)

Getting gradient of model output w.r.t weights using Keras

2 Answers2

Linked

Related