ML KnowledgeDoes the vanishing gradient problem manifest nearer to the first or last layer of a neural network?Was asked at