Skip to content

1.18.67

Compare
Choose a tag to compare
@fhieber fhieber released this 21 Dec 10:55

[1.18.67]

Added

  • Added fertility as a further type of attention coverage.
  • Added an option for training to keep the initializations of the model via --keep-initializations. When set, the trainer will avoid deleting the params file for the first checkpoint, no matter what --keep-last-params is set to.

[1.18.66]

Fixed

  • Fix to argument names that are allowed to differ for resuming training.

[1.18.65]

Changed

  • More informative error message about inconsistent --shared-vocab setting.

[1.18.64]

Added

  • Adding translation sampling via --sample [N]. This causes the decoder to sample each next step from the target distribution probabilities at each
    timestep. An optional value of N causes the decoder to sample only from the top N vocabulary items for each hypothesis at each timestep (the
    default is 0, meaning to sample from the entire vocabulary).

[1.18.63]

Changed

  • The checkpoint decoder and nvidia-smi subprocess are now launched from a forkserver, allowing for a better separation between processes.

[1.18.62]

Added

  • Add option to make TranslatorInputs directly from a dict.