1.18.67

fhieber released this 21 Dec 10:55

[1.18.67]

Added

Added fertility as a further type of attention coverage.
Added an option for training to keep the initializations of the model via --keep-initializations. When set, the trainer will avoid deleting the params file for the first checkpoint, no matter what --keep-last-params is set to.

[1.18.66]

Fixed

Fix to argument names that are allowed to differ for resuming training.

[1.18.65]

Changed

More informative error message about inconsistent --shared-vocab setting.

[1.18.64]

Added

Adding translation sampling via --sample [N]. This causes the decoder to sample each next step from the target distribution probabilities at each
timestep. An optional value of N causes the decoder to sample only from the top N vocabulary items for each hypothesis at each timestep (the
default is 0, meaning to sample from the entire vocabulary).

[1.18.63]

Changed

The checkpoint decoder and nvidia-smi subprocess are now launched from a forkserver, allowing for a better separation between processes.

[1.18.62]

Added

Add option to make TranslatorInputs directly from a dict.

Assets 2