1. Which parameter is usually adapted to the image size so that a convolutional layer can process images of varying sizes? Briefly explain.

    1. kernel size
    2. number of channels
    3. stride
  2. A.:

    The stride. Using stride=(k,m) amounts to dividing the image sizes by k and l, respectively. By careful choice of the stride according to input sizes, one can achieve an output of fixed size for this layer.