Gradient descent aligns the layers of deep linear networks
Presenter
February 21, 2019
Keywords:
- deep network
- linear network
- maximum margin
- optimization