1. What is the main effect to be expected on the weights of a deep network when we use:

    1. L2 regularization?
    2. L1 regularization?

  2. A.:
    1. With L2 regularization, we expect small weights in absolute value.
    2. With L1 regularization, we expect sparse weights, that is, many of them equal to zero.