Use CTC probability matrix to compute confidence scores when decoding with a language model
Confidence scores do not correlate with error rates when a language model is used.
The current implementation relies on torchaudio's ctc_decoder.
Instead, we should score the sequence based on the initial CTC probability matrix, by averaging the probabilities for each timestep and token decoded by the language model.
Ref: