Format datasets and add datasets documentation for doc-ufcn
Is not an issue per se
but a recommendation
.
-
How about adding a script to format a dataset of
images
+polygons
to a formatteddoc-ufcn
dataset? -
How about adding a script to format a ground truth in
PAGE xml
and/orALTO xml
comprisingimages
+xml files
, extract the lines, cut the lines from the image files and format adoc-ufcn
dataset? -
doc-ufcn
dataset formatting documentation, unfortunatelly, is missing, especially nothing is clear aboutclasses_colors
ofclasses_names
.
Personally I wrote some scripts that do that (raw images + polygons to doc-ufcn
, ALTO XML and PAGE XML to doc-ufcn
) even though I'm not sure if I did it right.
Just a thought.
Edited by Teodor Bors