Comment by esafak
Overparameterized neural networks don't have that problem because there are no local maxima; there are many roads to Rome.
Overparameterized neural networks don't have that problem because there are no local maxima; there are many roads to Rome.