Merge branch 'poc-entities-xml' into 'main'
Refactor entities extraction with lxml See merge request !316
Showing
- dan/datasets/extract/__init__.py 0 additions, 2 deletionsdan/datasets/extract/__init__.py
- dan/datasets/extract/arkindex.py 21 additions, 65 deletionsdan/datasets/extract/arkindex.py
- dan/datasets/extract/db.py 11 additions, 4 deletionsdan/datasets/extract/db.py
- dan/datasets/extract/exceptions.py 0 additions, 15 deletionsdan/datasets/extract/exceptions.py
- dan/datasets/extract/utils.py 169 additions, 15 deletionsdan/datasets/extract/utils.py
- docs/css/ner.css 42 additions, 0 deletionsdocs/css/ner.css
- docs/usage/datasets/extract.md 56 additions, 27 deletionsdocs/usage/datasets/extract.md
- mkdocs.yml 3 additions, 0 deletionsmkdocs.yml
- requirements.txt 1 addition, 0 deletionsrequirements.txt
- tests/conftest.py 65 additions, 0 deletionstests/conftest.py
- tests/data/entities.yml 4 additions, 0 deletionstests/data/entities.yml
- tests/data/tokens/end_tokens.yml 15 additions, 3 deletionstests/data/tokens/end_tokens.yml
- tests/data/tokens/no_end_tokens.yml 15 additions, 3 deletionstests/data/tokens/no_end_tokens.yml
- tests/test_db.py 11 additions, 12 deletionstests/test_db.py
- tests/test_extract.py 110 additions, 191 deletionstests/test_extract.py
Loading
Please register or sign in to comment