Why filters do not learn same features

Question

The result of the convolution operation is multiple subsets of data are generated per kernel. For example if 5 kernels are applies to an image of dimension WxDx1 (1 channel) then 5 convolutions are applied to the data which generates a 5 dimensional image representation. WxDx1 becomes W'xD'x5 where W' and D' are smaller in dimension that W * D

Is the fact that each kernel is initialised to different values prevent each kernel from learning the same parameters ? If not what prevents each kernel learning the same parameters ?

If the image is RGB instead of grayscale so dimension WxDx3 instead of WxDx1 does this impact how the kernels learns patterns ?

The kernels are the learnable parameters in convolution, so yes, they do learn. — Dr. Snoopy, Aug 31 '18 at 09:11
oh yes my bad! made a confusion, sorry, weight ARE indeed the kernels parameters. sorry for the mistake! — Frayal, Aug 31 '18 at 09:20

score 1 · Accepted Answer · answered Aug 31 '18 at 09:26

As you already mentioned, the sole fact of differing what Kernels learn is due to the random initialization of the weights in the beginning.

A great explanation is delivered here and also applies for the convolutional kernels in CNNs.
I regard this as distinct enough to not highlight it as a duplicate, but essentially it works the same.

Why filters do not learn same features

1 Answers1