Evaluation: Specify sets to evaluate
We mostly want to evaluate on validation and test sets. Rarely on train
set.
The list of sets to evaluate should be parametrized behind a --sets
parameter to the evaluate command.
Defaults should use all sets.
Edited by Yoann Schneider