semi/ptm training is excessively, probably unnecessarily slow #42

dhdaines · 2022-08-05T14:25:35Z

In theory, semi-continuous or PTM models are supposed to be fast! But training them is incredibly slow, especially the initial flat start. This is most likely due to some redundant or inefficient computation in the training code.

Training a 128-Gaussian PTM model on 100 hours of data on 16 CPUs takes approximately 4 hours, whereas training a 4000 senone, 16 Gaussian continuous model with LDA and MLLT takes only 1h25 (without LDA and MLLT it's less than an hour).

And of course the accuracy of said PTM model is quite atrocious.

One might argue that they are thoroughly obsolete, but actually they are maybe the only reason left to use CMU Sphinx, since they produce very small models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

semi/ptm training is excessively, probably unnecessarily slow #42

semi/ptm training is excessively, probably unnecessarily slow #42

dhdaines commented Aug 5, 2022 •

edited

Loading

semi/ptm training is excessively, probably unnecessarily slow #42

semi/ptm training is excessively, probably unnecessarily slow #42

Comments

dhdaines commented Aug 5, 2022 • edited Loading

dhdaines commented Aug 5, 2022 •

edited

Loading