Skip to content

Support multi-query attention for encoder-decoder models #2270

Support multi-query attention for encoder-decoder models

Support multi-query attention for encoder-decoder models #2270