# ATR Data generator

Create datasets from [Arkindex](https://demo.arkindex.org), a platform developed by [Teklia](https://teklia.com) to train **Automatic Text Recognition** pipelines.

After installing this Python package, you will gain access to the `atr-data-generator` command. To learn more about it and its subcommands, run `atr-data-generator --help`.

Both subcommands use a YAML configuration file, provided via the `--config` parameter. More details about the structure of this configuration file are available in the respective section. Every run will export both a `config.yaml` file and a `param.json` file that can be used to reproduce the data generation.

See the [Development](dev.md) section to learn how to contribute to this project.