1. What is the difference between minimizing a function f mathematicaly, and minimizing f for deep learning purposes?

    A.:

    Minimizing f mathematically means finding the smallest value of f in its range (global minimum). Minimizing f in the context of deep learning usually means decreasing f a lot, not necessarily reaching a minimum, either local or global.