ML Knowledge
Does the issue of vanishing gradient occur closer to the beginning or end of a neural network?
Interview question asked to Data Scientists interviewing at Eventbrite, Pinterest, NetApp and others: Does the issue of vanishing gradient occur closer to the beginning or end of a neural network?.