Fitting batch norm into a neural network (part 3) [6:50 min]