New mode to split prediction by starting tokens
We need a new way of splitting the text to create objects through attention maps. This time, the split will be done on NER starting tokens. This new Level
could be called and exported as entity
in the JSON files.
You'll need to update:
- https://gitlab.teklia.com/atr/dan/-/blob/main/dan/ocr/predict/attention.py?ref_type=heads#L47
- https://gitlab.teklia.com/atr/dan/-/blob/main/dan/ocr/predict/attention.py?ref_type=heads#L74
- and expose the parameter
You can test by making a prediction and viewing the resulting GIF via attention_map=True
and visualizing the output JSON.
Edited by Yoann Schneider