Format datasets and add datasets documentation for doc-ufcn
Is not an issue per se but a recommendation.
-
How about adding a script to format a dataset of
images+polygonsto a formatteddoc-ufcndataset? -
How about adding a script to format a ground truth in
PAGE xmland/orALTO xmlcomprisingimages+xml files, extract the lines, cut the lines from the image files and format adoc-ufcndataset? -
doc-ufcndataset formatting documentation, unfortunatelly, is missing, especially nothing is clear aboutclasses_colorsofclasses_names.
Personally I wrote some scripts that do that (raw images + polygons to doc-ufcn, ALTO XML and PAGE XML to doc-ufcn) even though I'm not sure if I did it right.
Just a thought.
Edited by Teodor Bors