Skip to content

Use CTC probability matrix to compute confidence scores when decoding with a language model

Confidence scores do not correlate with error rates when a language model is used.

The current implementation relies on torchaudio's ctc_decoder. Instead, we should score the sequence based on the initial CTC probability matrix, by averaging the probabilities for each timestep and token decoded by the language model.

Ref: