vanishing gradient problem

The vanishing gradient problem refers to the issue in artificial neural networks where the gradient (a measure of how the network's parameters should be adjusted during training) becomes extremely small as it propagates backwards from the output layer to the earlier layers. This leads to very slow or no learning in those earlier layers, making it difficult for the network to effectively learn and update its parameters.

Requires login.