KaldiFormat extraction
Depends #26 (closed)
Implement a new class KaldiDataGenerator(DataGenerator)
that generates the dataset in kaldi format.
Add a run(self)
method on DataGenerator
that will be called on the atr-data-generator extract
command call.
def run(self):
transcriptions: List[Transcription] = self.get_line_transcriptions()
self.export(transcriptions)
def export(self, transcriptions):
raise NotImplementedError
The KaldiDataGenerator
will override the export
method with whatever code is needed to do that. IMO it's only one file export per transcription but I could be wrong.