Load a language model and decode with it during evaluation
All threads resolved!
All threads resolved!
Compare changes
+ 4
− 3
@@ -21,7 +21,8 @@ from torch.nn.parallel import DistributedDataParallel as DDP
@@ -74,11 +75,11 @@ class GenericTrainingManager: