Skip to content

Export Datasets in corpora exports

Datasets should be exported in corpora exports so that we can manipulate them in https://gitlab.teklia.com/arkindex/export

We'll need a new version of the export, even though it is backward compatible (only new tables are added).

A table dataset is added with columns:

  • ID
  • name
  • state
  • sets (strings joined by comma)

A table dataset_element is added with columns:

  • ID
  • element_id
  • dataset_id
  • set

The load_export django command should also support these new tables, simply creating a new dataset & assigning elements in each relevant set.

Edited by Bastien Abadie