I am a beginner and learning deep learning with baby steps. I have a question about designing the nets. I see in the papers, there are layers with different inputs/outputs and I do not know how to calculate/design before implementation.
For instance, in this paper, there are some numbers beside the schematic layers output (see the following figure). How these filter size and other parameters are being specified for a network with a specific image size as input.
or in another paper, they have the following design:
and they have mentioned, For a 256x256
input image,
the total sub-sampling factor of the network is 4
, resulting in
a 64x64xL array
, where L is the number of class labels. How this 64x64
size is being obtained?
How can I learn to design the net and calculates inputs/outputs of layers?
Thank you for any help