Skip to content

Revert "Optionally use flash-attn's CE loss for metrics (#3394)" #6059

Revert "Optionally use flash-attn's CE loss for metrics (#3394)"

Revert "Optionally use flash-attn's CE loss for metrics (#3394)" #6059