请问一下在prefill计算时,ProcessKV和FlattenKV的作用分别是什么? #2063
Unanswered
sitabulaixizawaluduo
asked this question in
Q&A
Replies: 1 comment 3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
是不是可以理解ProcessKV是把经过qkv_proj计算的k v存到block里面,FlattenKV是把block kv cache转为整块的kv cache参与后续的attention计算?
Beta Was this translation helpful? Give feedback.
All reactions