Media Summary: Training Neural Networks Part 1 activation functions, weight initialization, gradient flow, batch normalization babysitting the ...
Cs231n Winter 2016 Lecture 3 Linear Classification 2 Optimization - Detailed Analysis & Overview
Training Neural Networks Part 1 activation functions, weight initialization, gradient flow, batch normalization babysitting the ...