AMDGPU: Move some code out of macro for defining regclass decoder #155755

arsenm · 2025-08-28T05:41:31Z

Use a template function for the implementation, and use the macro
to define a constant function pointer with the expected name. Not
sure if there's a cleaner way to do this. This worked out to less
code using variadic templates to forward the arguments, but it added
a noticable ~10 seconds to compilation time on this file.

This will help avoid another copy-paste version of this function
in a future change.

arsenm · 2025-08-28T05:41:46Z

AMDGPU: Move some code out of macro for defining regclass decoder #155755 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2025-08-28T05:42:04Z

@llvm/pr-subscribers-backend-amdgpu

Author: Matt Arsenault (arsenm)

Changes

Use a template function for the implementation, and use the macro
to define a constant function pointer with the expected name. Not
sure if there's a cleaner way to do this. This worked out to less
code using variadic templates to forward the arguments, but it added
a noticable ~10 seconds to compilation time on this file.

This will help avoid another copy-paste version of this function
in a future change.

Full diff: https://github.com/llvm/llvm-project/pull/155755.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp (+15-10)

diff --git a/llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp b/llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
index 6a2beeed41dfd..8651ddc89dce2 100644
--- a/llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
+++ b/llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
@@ -146,17 +146,22 @@ static DecodeStatus decodeDpp8FI(MCInst &Inst, unsigned Val, uint64_t Addr,
     return addOperand(Inst, DAsm->DecoderName(Imm));                           \
   }
 
-// Decoder for registers, decode directly using RegClassID. Imm(8-bit) is
-// number of register. Used by VGPR only and AGPR only operands.
+// Decoder for registers, decode directly using RegClassID. Imm(8-bit) is number
+// of register. Used by VGPR only and AGPR only operands.
+template <unsigned RegClassID>
+static DecodeStatus decodeRegisterClassImpl(MCInst &Inst, unsigned Imm,
+                                            uint64_t /*Addr*/,
+                                            const MCDisassembler *Decoder) {
+  assert(Imm < (1 << 8) && "8-bit encoding");
+  auto DAsm = static_cast<const AMDGPUDisassembler *>(Decoder);
+  return addOperand(Inst, DAsm->createRegOperand(RegClassID, Imm));
+}
+
+using RegClassDecoder = decltype(&decodeRegisterClassImpl<0>);
+
 #define DECODE_OPERAND_REG_8(RegClass)                                         \
-  static DecodeStatus Decode##RegClass##RegisterClass(                         \
-      MCInst &Inst, unsigned Imm, uint64_t /*Addr*/,                           \
-      const MCDisassembler *Decoder) {                                         \
-    assert(Imm < (1 << 8) && "8-bit encoding");                                \
-    auto DAsm = static_cast<const AMDGPUDisassembler *>(Decoder);              \
-    return addOperand(                                                         \
-        Inst, DAsm->createRegOperand(AMDGPU::RegClass##RegClassID, Imm));      \
-  }
+  static const constexpr RegClassDecoder Decode##RegClass##RegisterClass =     \
+      decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>;
 
 #define DECODE_SrcOp(Name, EncSize, OpWidth, EncImm)                           \
   static DecodeStatus Name(MCInst &Inst, unsigned Imm, uint64_t /*Addr*/,      \

Pierre-vh · 2025-08-28T12:05:39Z

llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp

+  static const constexpr RegClassDecoder Decode##RegClass##RegisterClass =     \
+      decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>;


Suggested change

static const constexpr RegClassDecoder Decode##RegClass##RegisterClass = \

decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>;

static const constexpr RegClassDecoder Decode##RegClass##RegisterClass = [](/* params go here*/){return decodeRegisterClassImpl(AMDGPU::RegClass##RegClassID, /* forward rest here */);

What about a constexpr lambda + no template ?
Could use std::bind too but I don't know if that's constexpr everywhere

That might as well be a macro at that point, the reason I did it this was was to avoid repeating the long argument list 3 times

Suggested change

static const constexpr RegClassDecoder Decode##RegClass##RegisterClass = \

decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>;

static const constexpr RegClassDecoder Decode##RegClass##RegisterClass = [](auto... args) { return decodeRegisterClassImpl(AMDGPU::RegClass##RegClassID, args...); }

?

Though with that one I think you need to make the RegClassDecoder signature more explicit, or even manually type it out, so you still need to type the args twice unfortunately

#define DECODE_OPERAND_REG_8(RegClass) \ static const constexpr auto Decode##RegClass##RegisterClass = \ [](auto... args) { \ return decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>(args...); \ };

Works, but it seems to cost about 5 seconds of build time

Use a template function for the implementation, and use the macro to define a constant function pointer with the expected name. Not sure if there's a cleaner way to do this. This worked out to less code using variadic templates to forward the arguments, but it added a noticable ~10 seconds to compilation time on this file. This will help avoid another copy-paste version of this function in a future change.

This seems to cost 4-5 seconds in build time in the file

This reverts commit 79d03ff80ed3ac9d8f072955c2bae5d7567ff56a.

arsenm · 2025-09-02T00:22:43Z

ping, due to the compile time issue I think this should avoid the template. I'm thinking of opening a clang bug for it

arsenm added the backend:AMDGPU label Aug 28, 2025 — with Graphite App

arsenm requested review from jayfoad, jwanggit86, kosarev, Pierre-vh, rampitec and Sisyph August 28, 2025 05:42

arsenm marked this pull request as ready for review August 28, 2025 05:42

Pierre-vh reviewed Aug 28, 2025

View reviewed changes

arsenm added 3 commits August 28, 2025 22:07

Use lambda to forward arguments

5ccb1a6

This seems to cost 4-5 seconds in build time in the file

Revert "Use lambda to forward arguments"

e6cd6ef

This reverts commit 79d03ff80ed3ac9d8f072955c2bae5d7567ff56a.

arsenm force-pushed the users/arsenm/amdgpu/disassembler-reduce-code-in-macro branch from 0f88c29 to e6cd6ef Compare August 28, 2025 13:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AMDGPU: Move some code out of macro for defining regclass decoder #155755

AMDGPU: Move some code out of macro for defining regclass decoder #155755

Uh oh!

arsenm commented Aug 28, 2025

Uh oh!

arsenm commented Aug 28, 2025

Uh oh!

llvmbot commented Aug 28, 2025

Uh oh!

Pierre-vh Aug 28, 2025 •

edited

Loading

Uh oh!

arsenm Aug 28, 2025

Uh oh!

Pierre-vh Aug 28, 2025

Uh oh!

Pierre-vh Aug 28, 2025

Uh oh!

arsenm Aug 28, 2025

Uh oh!

arsenm commented Sep 2, 2025

Uh oh!

Uh oh!

		static const constexpr RegClassDecoder Decode##RegClass##RegisterClass = \
		decodeRegisterClassImpl<AMDGPU::RegClass##RegClassID>;

AMDGPU: Move some code out of macro for defining regclass decoder #155755

Are you sure you want to change the base?

AMDGPU: Move some code out of macro for defining regclass decoder #155755

Uh oh!

Conversation

arsenm commented Aug 28, 2025

Uh oh!

arsenm commented Aug 28, 2025

Uh oh!

llvmbot commented Aug 28, 2025

Uh oh!

Pierre-vh Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arsenm Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Pierre-vh Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Pierre-vh Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm commented Sep 2, 2025

Uh oh!

Uh oh!

Pierre-vh Aug 28, 2025 •

edited

Loading