WIP - Switching to data-tiling fusion. #21441

hanhanW · 2025-07-21T23:08:26Z

No description provided.

The compiler is very smart on static shape inference that can generate partially dynamic shape during the lowering. It makes data-tiling fusion very struggle because they are expected to be dynamic shape but some dimensions are inferred to static values in Stream AnnotateDispatchAssumptions pass. Because it will lead to `tensor.cast -> set_encoding -> tensor.cast` sequence in a dispatch, while we expect the bindings have encoded tensor types. E.g., Input IR: ```mlir %0 = iree_tensor_ext.dispatch_load ... tensor<?x?xi8> %1 = set_encoding %0 : tensor<?x?xi8> -> tensor<?x?xi8, #encoding> iree_tensor_ext.dispatch_store %1, ... tensor<?x?xi8, #encoding> -> ... tensor<?x?xi8, #encoding> ``` After annotation: ```mlir %0 = iree_tensor_ext.dispatch_load ... tensor<4x?xi8> %cast = tensor.cast %0 : tensor<4x?xi8> -> tensor<?x?xi8> %1 = set_encoding %cast : tensor<?x?xi8> -> tensor<?x?xi8, #encoding> %cast_0 = tensor.cast %1 : tensor<?x?xi8, #encoding> to tensor<4x5xi8> iree_tensor_ext.dispatch_store %cast_0, ... tensor<4x5xi8> -> ... tensor<?x?xi8, #encoding> ``` It is hard to materialize the encodings when cast op is present. Given that the original goal is testing dynamic shape, modifying the input program is an easier fix. The issue is observed from #21441. Signed-off-by: hanhanW <hanhan0912@gmail.com>

Signed-off-by: hanhanW <hanhan0912@gmail.com>

- Rename it to `materialize_encoding_vmvx.mlir` to follow the naming convention. - Delete the legacy `tensor.extract_slice` op from tests, because UnSetEncodingOp has slicing semantics. Signed-off-by: hanhanW <hanhan0912@gmail.com>

Signed-off-by: hanhanW <hanhan0912@gmail.com>

It also replaces `TileAndFuse` pass uses with `TileRootAndFuseProducerConsumer` pass that may impact other dispatches, if they use DoubleTilingExpert. E.g., generic ops dispatches. Signed-off-by: hanhanW <hanhan0912@gmail.com>

…gConfigAttr. Signed-off-by: hanhanW <hanhan0912@gmail.com>

Signed-off-by: hanhanW <hanhan0912@gmail.com>

…anonical form. Signed-off-by: hanhanW <hanhan0912@gmail.com>

…ibution passes. Signed-off-by: hanhanW <hanhan0912@gmail.com>

This was referenced Jul 21, 2025

[DT] Disable early materialization by default. #20323

Closed

[NFC] Switch dynamic inputs to flow.tensor.dynamic_constant. #21461

Merged

hanhanW force-pushed the dt-fusion-default branch from a2380b6 to 012f3a0 Compare July 23, 2025 18:02

Cherry-pick "Restrict linalg.pack to not have extra padding sizes."

ee9c0e3

Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW force-pushed the dt-fusion-default branch from 197faec to 01363d8 Compare July 24, 2025 18:27

hanhanW added 8 commits July 24, 2025 13:53

[GPU][NFC] Delete unused legacy LLVMGPUTensorPad pass.

b5fbec6

Signed-off-by: hanhanW <hanhan0912@gmail.com>

Cherry-pick "[mlir][linalg] Remove artificial padding check.".

387c374

Signed-off-by: hanhanW <hanhan0912@gmail.com>

[CPU] Switch CPUDoubleTilingExpert pipeline to use IREE::CPU::Lowerin…

7891078

…gConfigAttr. Signed-off-by: hanhanW <hanhan0912@gmail.com>

WIP - Switching to data-tiling fusion.

2b3c1f4

Signed-off-by: hanhanW <hanhan0912@gmail.com>

Cherry-pick: [Codegen] Convert matmul that takes readonly init to a c…

090fd21

…anonical form. Signed-off-by: hanhanW <hanhan0912@gmail.com>

Cherry-pick: [CPU] Add CombineLayoutTransformation passes after distr…

efb04d1

…ibution passes. Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW force-pushed the dt-fusion-default branch from 01363d8 to efb04d1 Compare July 24, 2025 23:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP - Switching to data-tiling fusion. #21441

WIP - Switching to data-tiling fusion. #21441

Uh oh!

hanhanW commented Jul 21, 2025

Uh oh!

Uh oh!

WIP - Switching to data-tiling fusion. #21441

Are you sure you want to change the base?

WIP - Switching to data-tiling fusion. #21441

Uh oh!

Conversation

hanhanW commented Jul 21, 2025

Uh oh!

Uh oh!