Align memory_format for conv2d/3d in Float8Tensor with hp Tensor #3352

jerryzh168 · 2025-11-18T22:55:54Z

Summary:
att, we want to make sure the output of

F.conv3d(input, weight, ...) and F.conv3d(input, fp8_weight, ...) have the same memory_format

Test Plan:
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_variants

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2025-11-18T22:55:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3352

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fe4167f with merge base ff0e461 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jbschlosser · 2025-11-18T23:25:09Z

torchao/quantization/quantize_/workflows/float8/float8_tensor.py

+    # output should use channels_last format as long as any of the
+    # input or weight is channels_last
+    if is_input_channels_last or is_weight_channels_last:
+        output = output.to(memory_format=torch.channels_last_3d)


I think this is the right thing to semantics-wise, but note that this will incur a copy if the output isn't already in channels_last. Ideally, the kernel itself would output into channels_last directly to avoid the copy.

Edit: oh I think you're already aware of this :)

yes, the fbgemm kernel should output a tensor that's already in this format, so this becomes a no-op when either input or weight is in channels_last format

jbschlosser · 2025-11-18T23:27:01Z

torchao/quantization/quantize_/workflows/float8/float8_tensor.py

+    act_qdata = act_qdata.contiguous()
+    weight_qdata = weight_qdata.contiguous()


I don't think these contiguous() calls are the right thing to do here - note that this will clobber channels_last for the activation and weight. Calling contiguous(memory_format=torch.channels_last) would be more correct

Edit: from offline discussion, can't forget the permute()! we want a contiguous() (N, D, H, W, C_in) tensor, which is equivalent to a properly-permuted, contiguous(channels_last) (N, C_in, D, H, W) tensor

refactored this to first to contiguous(memory_format=torch.channels_last_3d) and then do permute to make it easier to follow

…recision Tensors Summary: att, we want to make sure the output of `F.conv3d(input, weight, ...)` and `F.conv3d(input, fp8_weight, ...)` have the same memory_format Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_variants Reviewers: Subscribers: Tasks: Tags:

…orch#3352) Align memory_format for conv2d and conv3d in Float8Tensor with high precision Tensors Summary: att, we want to make sure the output of `F.conv3d(input, weight, ...)` and `F.conv3d(input, fp8_weight, ...)` have the same memory_format Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_variants Reviewers: Subscribers: Tasks: Tags:

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 18, 2025

jerryzh168 changed the title ~~Align memory_format for conv2d and conv3d in Float8Tensor with high p…~~ Align memory_format for conv2d/3d in Float8Tensor with hp Tensor Nov 18, 2025

jerryzh168 force-pushed the conv-memory-format branch from a25ffda to 8bbb423 Compare November 18, 2025 22:57

jerryzh168 requested review from andrewor14 and vkuzo November 18, 2025 22:58

jerryzh168 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Nov 18, 2025

jerryzh168 force-pushed the conv-memory-format branch from 8bbb423 to 16dba36 Compare November 18, 2025 23:01

jbschlosser reviewed Nov 18, 2025

View reviewed changes

jerryzh168 force-pushed the conv-memory-format branch 2 times, most recently from 58bf5d1 to 642e3d0 Compare November 19, 2025 00:45

jerryzh168 force-pushed the conv-memory-format branch from 642e3d0 to fe4167f Compare November 19, 2025 19:37

vkuzo approved these changes Nov 19, 2025

View reviewed changes

jerryzh168 merged commit 4f5bc7a into main Nov 19, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Align memory_format for conv2d/3d in Float8Tensor with hp Tensor #3352

Align memory_format for conv2d/3d in Float8Tensor with hp Tensor #3352

Uh oh!

jerryzh168 commented Nov 18, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 18, 2025 •

edited

Loading

Uh oh!

jbschlosser Nov 18, 2025 •

edited

Loading

Uh oh!

jerryzh168 Nov 18, 2025

Uh oh!

jbschlosser Nov 18, 2025 •

edited

Loading

Uh oh!

jerryzh168 Nov 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		act_qdata = act_qdata.contiguous()
		weight_qdata = weight_qdata.contiguous()

Align memory_format for conv2d/3d in Float8Tensor with hp Tensor #3352

Align memory_format for conv2d/3d in Float8Tensor with hp Tensor #3352

Uh oh!

Conversation

jerryzh168 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3352

✅ No Failures

Uh oh!

jbschlosser Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

jbschlosser Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jerryzh168 commented Nov 18, 2025 •

edited

Loading

pytorch-bot bot commented Nov 18, 2025 •

edited

Loading

jbschlosser Nov 18, 2025 •

edited

Loading

jbschlosser Nov 18, 2025 •

edited

Loading

jerryzh168 Nov 19, 2025 •

edited

Loading