[CUTLASS][WIP] Gate rowwise matmul CUTLASS kernels by compute capability #152642

eqy · 2025-05-01T23:25:19Z

Does this abate some compile-time warning spam?

cc @ptrblck @msaroufim @jerryzh168 @yanbing-j @vkuzo @albanD @kadeng @penguinwu

pytorch-bot · 2025-05-01T23:25:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152642

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CI workflows being skipped on PR

❌ 4 New Failures

As of commit c474c5b with merge base b6c5886 ():

NEW FAILURES - The following jobs have failed:

Lint / Link checks / Lint URLs / linux-job (gh)
RuntimeError: Command docker exec -t cece38ddaabc5bbc7d6505ad9b91ab5a60314c7bb2f97abb29f645779e8c1929 /exec failed with exit code 1
pull / linux-focal-cuda12.6-py3.10-gcc11-sm89 / test (default, 2, 5, ephemeral.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
test_matmul_cuda.py::TestFP8MatmulCUDA::test_float8_rowwise_scaling_sanity_use_fast_accum_False_cuda
pull / linux-focal-cuda12.6-py3.10-gcc11-sm89 / test (default, 3, 5, ephemeral.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
inductor/test_fp8.py::TestFP8Lowering::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False
pull / linux-jammy-rocm-py3.10 / build (gh)
Final attempt failed. Child_process exited with error code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Aidyn-A · 2025-06-10T15:48:49Z

aten/src/ATen/native/cuda/RowwiseScaledMM.cu

@@ -498,6 +502,7 @@ void f8f8bf16_rowwise_impl_sm89(
    at::Tensor w_scale,
    std::optional<at::Tensor> bias,
    at::Tensor out) {
+#if (defined(__CUDA_ARCH__)) && (__CUDA_ARCH__ == 890)


The test failures are really strange. As if this condition was evaluated as false.

github-actions · 2025-08-09T16:40:01Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

check in

c95e87b

eqy requested a review from syed-ahmed as a code owner May 1, 2025 23:25

pytorch-bot bot added the release notes: cuda release notes category label May 1, 2025

albanD self-requested a review May 2, 2025 14:01

albanD added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 2, 2025

Update RowwiseScaledMM.cu

c474c5b

Aidyn-A reviewed Jun 10, 2025

View reviewed changes

github-actions bot added the Stale label Aug 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CUTLASS][WIP] Gate rowwise matmul CUTLASS kernels by compute capability #152642

[CUTLASS][WIP] Gate rowwise matmul CUTLASS kernels by compute capability #152642

Uh oh!

eqy commented May 1, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented May 1, 2025 •

edited

Loading

Uh oh!

Aidyn-A Jun 10, 2025

Uh oh!

github-actions bot commented Aug 9, 2025

Uh oh!

Uh oh!

[CUTLASS][WIP] Gate rowwise matmul CUTLASS kernels by compute capability #152642

Are you sure you want to change the base?

[CUTLASS][WIP] Gate rowwise matmul CUTLASS kernels by compute capability #152642

Uh oh!

Conversation

eqy commented May 1, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152642

❗ 1 Active SEVs

❌ 4 New Failures

Uh oh!

Aidyn-A Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 9, 2025

Uh oh!

Uh oh!

eqy commented May 1, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented May 1, 2025 •

edited

Loading