Investigate how to use gradient clipping
Some code is already available in the current version to use gradient clipping, but it is not used during the training. We should investigate how we can use it and if it can improve the training.
Some code is already available in the current version to use gradient clipping, but it is not used during the training. We should investigate how we can use it and if it can improve the training.