Filter entities by name when extracting data from Arkindex
Sometimes we don't want to extract all entities. It's already partially done (https://gitlab.com/teklia/atr/dan/-/blob/main/dan/datasets/extract/extract.py#L110) but right now we raise an Error in this case. We could add a flag (--allow-unknown
) to ignore this error if we want to ignore some entities (we only need to not specify any tokens for them).
Edited by Yoann Schneider