[Transforms] Add constant_tensors_folding pass #74

niuxiaog · 2024-05-15T08:14:47Z

This PR implements constant_tensors_folding pass (RFC: PR #183) in issue #56 and issue #146:

The input MLIR function entry will be split into two functions: entry() and runtime_fold(). The runtime_fold() function contains the constant operations whose input tensors and output tensors are all constant. The new entry() function contains other operations that depend on the variable values and folded constant tensors.
When needed, compile_time_fold() can be enabled to fold constant tensors during compile time.
A constant tensor may be fully folded by several sequential operations. If fully folding increases the data size dramatically (i.e., by BroadcastOp), we choose a partial folding.
Swaps the constant BroadcastOp and the constant operations after it, so that these constant operations can be included into partial folding. Currently there are strong constraints on these ops to ensure correctness.
Necessary information for runtime execution is added as GlobalOp to the MLIR module.
During the pass, the buffers for storing folded tensors are allocated using the APIs provided by the constant cache manager, which will be implemented by another PR [Runtime] Constant cache manager and runtime pipeline #342.

This reverts commit 22c3d76.

ciyongch · 2024-09-18T02:15:39Z

lib/gc/Transforms/ConstantSubgraphAnalysis.cpp

+void ConstantSubgraphAnalysis::runOnOperation() {
+  Operation *op = getOperation();
+  auto &func =
+      op->getRegions().front().getBlocks().front().getOperations().front();


Is there any shortcut for this kind of operation?

ciyongch · 2024-09-18T02:19:23Z

lib/gc/Transforms/Pipeline.cpp

@@ -53,6 +53,8 @@ void populateTensorPasses(mlir::OpPassManager &pm) {
  // todo: padding propagation pass
  // todo: layout propagation pass
  // todo: tensor constant propagation pass
+  pm.addPass(createConstantSubgraphAnalysisPass());
+  pm.addPass(createConstantTensorFoldingPass());


Shall we combine these two passes into one, and provide an option to do the analysis only is needed?

I feel the same. Maybe we should put the analysis into the pass, unless the const-subgraph-analysis is needed by more than one pass.

OK, I will put them into to one.

niuxiaog · 2024-09-24T06:43:18Z

Though the PR is ready_to_review, we need PR #342 and benchgc's support to fully enable this feature. For OpenVINO integration, this feature can be enabled by modifying file.

niuxiaog added 10 commits May 15, 2024 15:31

move codes from dnn-compiler

3d3308c

Merge branch 'main' into xgniu/constant_weights_folding

4f112c0

Merge branch 'main' into xgniu/constant_weights_folding

d50a3e8

Add single operand check

6219935

Add cache manager

5eb0ac0

Use llvm global [need to cowork with yijie/mainfunc_wrapper]

c3e186d

Rename; Add llvm dependence

8c50b67

Change dtype

25f611e

Fix visibility and type

4363915

Support cpmplex topo

94f2813

niuxiaog force-pushed the xgniu/constant_weights_folding branch from acf3ae8 to 94f2813 Compare June 3, 2024 06:39

niuxiaog added 2 commits June 3, 2024 16:18

Rename

0f67f75

Split into short functions

d7663a5

niuxiaog force-pushed the xgniu/constant_weights_folding branch from a0ddebe to d7663a5 Compare June 4, 2024 03:00

niuxiaog added 16 commits June 5, 2024 11:13

Add a test

3f34e97

Adapt to constant PropertyType

22c3d76

Merge branch 'main' into xgniu/constant_weights_folding

5c92931

Revert "Adapt to constant PropertyType"

9218762

This reverts commit 22c3d76.

Fix link

4e447dd

Fold arith.constant

d4d81a6

Add compile_time_fold and runtime_fold.

afec52a

Fix license and tidy

9c4fd70

Fix link

fad5f92

Only enable runtime folding

57f887d

Rename and polish

1fc3b9f

Merge branch 'main' into xgniu/constant_weights_folding

aaa4ed4

Add accuracy tests on mlp

bfc12c7

Merge branch 'main' into xgniu/constant_weights_folding

346965f

Merge branch 'main' into xgniu/constant_weights_folding

75fcaed

Support MemRef args

f9c2425

Add to pipeline

d8d2d79

niuxiaog force-pushed the xgniu/constant_weights_folding branch from 387523a to d8d2d79 Compare August 20, 2024 06:20

niuxiaog added 8 commits August 25, 2024 23:10

Merge branch 'main' into xgniu/constant_weights_folding

fc739e5

Forbid buffer_to_tensor case

22c4474

Merge branch 'main' into xgniu/constant_weights_folding

968677d

Merge branch 'main' into xgniu/constant_weights_folding

1473a88

Add shape info to global

e20d059

Merge branch 'main' into xgniu/constant_weights_folding

ad24768

Clean tests.

edbb708

Updates

fa30e4a

niuxiaog requested review from zhczhong, Menooker, AndreyPavlenko, ciyongch and ZhennanQin and removed request for zhczhong September 14, 2024 03:42

niuxiaog changed the title ~~[Transforms] Add constant_weights_folding pass~~ [Transforms] Add constant_tensors_folding pass Sep 14, 2024

niuxiaog mentioned this pull request Sep 14, 2024

[Runtime] Constant cache manager and runtime pipeline #342

Open

Merge branch 'main' into xgniu/constant_weights_folding

a255c7b

ciyongch reviewed Sep 18, 2024

View reviewed changes

ciyongch added the CPU label Sep 18, 2024

ciyongch added this to the 0.1 CPU - Llama functionnality support milestone Sep 18, 2024

niuxiaog added 2 commits September 18, 2024 15:12

Merge into one pass

77e0f02

Skip case

2df16c2

niuxiaog linked an issue Sep 24, 2024 that may be closed by this pull request

const weight packing support #146

Open

niuxiaog added the ready to review label Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Transforms] Add constant_tensors_folding pass #74

[Transforms] Add constant_tensors_folding pass #74

niuxiaog commented May 15, 2024 •

edited

Loading

ciyongch Sep 18, 2024

ciyongch Sep 18, 2024

Menooker Sep 18, 2024

niuxiaog Sep 18, 2024

niuxiaog commented Sep 24, 2024

[Transforms] Add constant_tensors_folding pass #74

Are you sure you want to change the base?

[Transforms] Add constant_tensors_folding pass #74

Conversation

niuxiaog commented May 15, 2024 • edited Loading

ciyongch Sep 18, 2024

Choose a reason for hiding this comment

ciyongch Sep 18, 2024

Choose a reason for hiding this comment

Menooker Sep 18, 2024

Choose a reason for hiding this comment

niuxiaog Sep 18, 2024

Choose a reason for hiding this comment

niuxiaog commented Sep 24, 2024

niuxiaog commented May 15, 2024 •

edited

Loading