Skip to content
Snippets Groups Projects
README.md 2.89 KiB
Newer Older
Denis Coquenet's avatar
Denis Coquenet committed
# DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
[![Python >= 3.10](https://img.shields.io/badge/Python-%3E%3D3.10-blue.svg)](https://www.python.org/downloads/release/python-3100/)

Yoann Schneider's avatar
Yoann Schneider committed
For more details about this package, make sure to see the documentation available at <https://atr.pages.teklia.com/dan/>.
For development and tests purpose it may be useful to install the project as a editable package with pip.

- Use a virtualenv (e.g. with virtualenvwrapper `mkvirtualenv -a . dan`)
- Install `dan` as a package (e.g. `pip install -e .`)
Code syntax is analyzed before submitting the code.\
To run the linter tools suite you may use pre-commit.
```shell
pip install pre-commit
pre-commit run -a
```

### Run tests
Tests are executed with `tox` using [pytest](https://pytest.org).
To install `tox`,
```shell
pip install tox
tox
```

To reload the test virtual environment you can use `tox -r`

Run a single test module: `tox -- <test_path>`
Run a single test: `tox -- <test_path>::<test_function>`

The tests use a large file stored via [Git-LFS](https://docs.gitlab.com/ee/topics/git/lfs/). Make sure to run `git-lfs pull` before running them.

### Update documentation

Please keep the documentation updated when modifying or adding features.
It's pretty easy to do:
```shell
pip install -r doc-requirements.txt
mkdocs serve
```

You can then write in Markdown in the relevant `docs/*.md` files, and see live output on <http://localhost:8000>.
## Inference
Mélodie Boillet's avatar
Mélodie Boillet committed

To apply DAN to an image, one needs to first add a few imports and to load an image. Note that the image should be in RGB.
Mélodie Boillet's avatar
Mélodie Boillet committed
```python
import cv2
from dan.ocr.predict.prediction import DAN
Mélodie Boillet's avatar
Mélodie Boillet committed

image = cv2.cvtColor(cv2.imread(IMAGE_PATH), cv2.COLOR_BGR2RGB)
```

Then one can initialize and load the trained model with the parameters used during training.
Mélodie Boillet's avatar
Mélodie Boillet committed
```python
model_path = "model.pt"
params_path = "parameters.yml"
charset_path = "charset.pkl"
model = DAN("cpu")
Mélodie Boillet's avatar
Mélodie Boillet committed
model.load(model_path, params_path, charset_path, mode="eval")
```

To run the inference on a GPU, one can replace `cpu` by the name of the GPU. In the end, one can run the prediction:
Mélodie Boillet's avatar
Mélodie Boillet committed
```python
text, confidence_scores = model.predict(image, confidences=True)
```
## Training

This package provides three subcommands. To get more information about any subcommand, use the `--help` option.

Yoann Schneider's avatar
Yoann Schneider committed
See the [dedicated section](https://atr.pages.teklia.com/dan/get_started/training/) on the official DAN documentation.
### Data extraction from Arkindex
Yoann Schneider's avatar
Yoann Schneider committed
See the [dedicated section](https://atr.pages.teklia.com/dan/usage/datasets/extract/) on the official DAN documentation.
### Model training

Yoann Schneider's avatar
Yoann Schneider committed
See the [dedicated section](https://atr.pages.teklia.com/dan/usage/train/) on the official DAN documentation.
Yoann Schneider's avatar
Yoann Schneider committed
See the [dedicated section](https://atr.pages.teklia.com/dan/usage/predict/) on the official DAN documentation.