[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op #68385

crazyxiaoxi · 2024-09-23T08:06:20Z

PR Category

CINN

PR Types

improvements

Description

添加flashmask_attention算子符号推导接口
找到单测
/test/legacy_test/test_flashmask.py
/test/legacy_test/test_flash_attention.py
只有unnitest
Pcard-67164

gongshaotian · 2024-09-25T09:45:02Z

paddle/fluid/pir/dialect/operator/interface/infer_symbolic_shape/multiary_infer_sym.cc

@@ -1543,6 +1543,84 @@ bool FlashAttnOpInferSymbolicShape(
 //   return true;
 // }

+bool FlashmaskAttentionOpInferSymbolicShape(


对比下和 FlashAttnOpInferSymbolicShape 的区别，看看是不是能直接复用

两个函数在qkv的处理和output的计算是一样的。
但是输入参数所取的operand位置不一样，同时输入的参数也不太相同，不能直接复用。

gongshaotian · 2024-09-27T04:01:56Z

paddle/fluid/pir/dialect/operator/interface/infer_symbolic_shape/multiary_infer_sym.cc

+    PADDLE_ENFORCE_EQ(
+        infer_context->IsEqual(startend_row_indices[3], symbol::DimExpr{1}) ||
+            infer_context->IsEqual(startend_row_indices[3],
+                                   symbol::DimExpr{2}) ||
+            infer_context->IsEqual(startend_row_indices[3], symbol::DimExpr{4}),
+        true,
+        common::errors::InvalidArgument(
+            "flashmask_attention startend_row_indices "
+            "mask_bounds must in [1,2,4]"));


这个enforce先删掉吧，目前还不支持编译期在控制流里判等。要写的话需要先判断下startend_row_indices[3]是不是int类型

gongshaotian · 2024-09-27T06:17:47Z

paddle/fluid/pir/dialect/operator/interface/infer_symbolic_shape/multiary_infer_sym.cc

+  auto batch_size_expr = q.shape()[0];
+  auto num_heads_expr = q.shape()[2];
+  auto seqlen_q_rounded_expr = round_multiple(q.shape()[1]);
+  auto seqlen_k_rounded_expr = round_multiple(k.shape()[1]);
+
+  if (op->result(1)) {
+    std::vector<symbol::DimExpr> softmax_shape{batch_size_expr,
+                                               num_heads_expr,
+                                               seqlen_q_rounded_expr,
+                                               seqlen_k_rounded_expr};
+    infer_context->SetShapeOrDataForValue(
+        op->result(1), symbol::TensorShapeOrDataDimExprs(softmax_shape));
+  }
+  if (op->result(2)) {
+    std::vector<symbol::DimExpr> softmax_lse_shape{
+        batch_size_expr, num_heads_expr, seqlen_q_rounded_expr};
+    infer_context->SetShapeOrDataForValue(
+        op->result(2), symbol::TensorShapeOrDataDimExprs(softmax_lse_shape));
+  }
+  if (op->result(3)) {
+    std::vector<symbol::DimExpr> seed_offset_shape{symbol::DimExpr{2}};
+    infer_context->SetShapeOrDataForValue(
+        op->result(3), symbol::TensorShapeOrDataDimExprs(out_shape));
+  }
+  return true;


这部分推导逻辑没在kernel、infermeta、以及FlashAttnOpInferSymbolicShape中找到对应依据

与刘旭东的类似

paddle-ci-bot · 2024-10-08T03:20:07Z

Sorry to inform you that 266fe33's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

gongshaotian

LGTM

crazyxiaoxi added 2 commits September 23, 2024 08:04

first

bbdc135

fix build

76d65c0

crazyxiaoxi changed the title ~~[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attn op~~ [CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op Sep 25, 2024

gongshaotian reviewed Sep 25, 2024

View reviewed changes

gongshaotian reviewed Sep 27, 2024

View reviewed changes

fix

266fe33

gongshaotian approved these changes Oct 14, 2024

View reviewed changes

Hongqing-work approved these changes Oct 14, 2024

View reviewed changes

luotao1 merged commit f56e672 into PaddlePaddle:develop Oct 15, 2024
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op #68385

[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op #68385

crazyxiaoxi commented Sep 23, 2024 •

edited

Loading

gongshaotian Sep 25, 2024 •

edited

Loading

crazyxiaoxi Sep 26, 2024

gongshaotian Sep 27, 2024

crazyxiaoxi Oct 14, 2024

gongshaotian Sep 27, 2024

crazyxiaoxi Oct 14, 2024

paddle-ci-bot bot commented Oct 8, 2024

gongshaotian left a comment

[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op #68385

[CINN] 【Infer Symbolic Shape BUAA 】Add flashmask_attention op #68385

Conversation

crazyxiaoxi commented Sep 23, 2024 • edited Loading

PR Category

PR Types

Description

gongshaotian Sep 25, 2024 • edited Loading

Choose a reason for hiding this comment

crazyxiaoxi Sep 26, 2024

Choose a reason for hiding this comment

gongshaotian Sep 27, 2024

Choose a reason for hiding this comment

crazyxiaoxi Oct 14, 2024

Choose a reason for hiding this comment

gongshaotian Sep 27, 2024

Choose a reason for hiding this comment

crazyxiaoxi Oct 14, 2024

Choose a reason for hiding this comment

paddle-ci-bot bot commented Oct 8, 2024

gongshaotian left a comment

Choose a reason for hiding this comment

crazyxiaoxi commented Sep 23, 2024 •

edited

Loading

gongshaotian Sep 25, 2024 •

edited

Loading