Convolutional layer:
1) Filters would have a width and height smaller than the input (e.g. 3x3) to apply the filter to the input volume.
2) Number of filters equals the number of channels in the output volume.
3) Stride is usually 1 to preserve spatial information.
Pooling layer:
1) Pooling filter width and height is usually 2x2.
2) Stride is usually the same as the filter size (2x2) to halve the spatial dimensions at each downsampling.
3)
Related topics: