Skip to content
Snippets Groups Projects
Commit 7540dd66 authored by Solene Tarride's avatar Solene Tarride
Browse files

Remove duplicate unknown token in tokens.txt

parent 14c391e5
No related branches found
No related tags found
No related merge requests found
......@@ -367,7 +367,6 @@ class ArkindexExtractor:
self.mapping.encode[token]
) if token in self.mapping.encode else self.language_tokens.append(token)
self.language_tokens.append(self.mapping.ctc.encoded)
self.language_tokens.append(self.unknown_token)
# Build LM corpus
train_corpus = [
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment