Display 5 worst predictions at the end of evaluation
We could use difflib to display the 5 least confident predictions in stdout. You could also try to make edlib work to align transcriptions.
Easier after #231 (closed)
We can sort the predictions by "WER" and process the 5 (hardcoded) worst predictions ones (it is already computed on each batch, see "wer"
key of batch_metrics
.).
Edited by Yoann Schneider