How to update model parameters along with batch norm with accumulated gradients?

Asked Aug 17 '17 at 21:56

Active Aug 17 '17 at 21:56

Viewed 160 times

So, similar to this question: How to update model parameters with accumulated gradients?

I have a large network, and a very small batch size. To combat this I want to accumulate gradients (multiple forward passes) and then apply the update of the parameters using the mean gradient.

However, my network has BN layers. How should I handle this?

asked Aug 17 '17 at 21:56

Dammi

1,268
2
13
23

How to update model parameters along with batch norm with accumulated gradients?

0 Answers0