DAN-P4: Improve data loading and preprocessing

See https://redmine.teklia.com/issues/4196

Unstarted Issues (open and unassigned)

Ongoing Issues (open and assigned)

Completed Issues (closed)

Fix call to seed_worker
#132 P1
Fix valid batch size to 1
#129 P1
Set default value to None for --image_max_width during prediction
#128 P1
Add a prediction test for process_image function
#126 P1
Debug/remove PiecewiseAffine augmentation transform
#125 P1
Load image using torch + use training pre-processing function during prediction
#122 P2
Merge DatasetManager / GenericDataset / OCRDatasetManager / OCRDataset classes
#120 P2
Pre-process the images immediately after loading them.
#114 P2
Use default mean and std values?
#111 P2
Compute mean and std only if training from scratch
#110
Indicate python version compatibility
#107 P3 Quick Win
Remove the remove_linebreaks parameter from training configuration
#106 Quick Win
Remove DPIAdjusting transform
#105 Quick Win
Use a single padding method
#104 P2
Directly read images using torch
#102 P2
Utils pairwise function can be replaced by itertools pariwise
#101 Quick Win
Simplify mean and std computation
#100 P2
Remove randint, rand, rand_uniform and round_floats from utils.py
#99 Quick Win
Use torchvision functions / transforms for data augmentation
#98 P2
Remove normalize parameter from training configuration
#97 Quick Win
Remove padding value and padding token parameters from training configuration
#96 Quick Win
Remove add_eot and add_sot parameters from training configuration
#95 Quick Win
Remove width_divisor and height_divisor parameters from training configuration
#94 Quick Win
Extract data in sub-resolution
#93 P2
Move OCR utils functions to utils.py
#92 Quick Win
Remove deepcopy from OCR code
#58 P2