Add deprecation warnings for various inference configs #3294

vkuzo · 2025-11-05T11:18:04Z

Summary:

Adds warnings that the following configs will be moved to prototype in a
future release:

Int8DynamicActivationInt4WeightConfig
Int4DynamicActivationInt4WeightConfig
GemliteUIntXWeightOnlyConfig
Float8StaticActivationFloat8WeightConfig
UIntXWeightOnlyConfig
FPXWeightOnlyConfig

See #2752 for more context

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-11-05T11:18:06Z

Stack from ghstack (oldest at bottom):

-> Add deprecation warnings for various inference configs #3294

Summary: Adds warnings that the following configs will be moved to prototype in a future release: * `Int8DynamicActivationInt4WeightConfig` * `Int4DynamicActivationInt4WeightConfig` * `GemliteUIntXWeightOnlyConfig` * `Float8StaticActivationFloat8WeightConfig` * `UIntXWeightOnlyConfig` * `FPXWeightOnlyConfig` See #2752 for more context Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6eec9d6 ghstack-comment-id: 3490630687 Pull-Request: #3294

pytorch-bot · 2025-11-05T11:18:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3294

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e83eb83 with merge base 9266734 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

Summary: Adds warnings that the following configs will be moved to prototype in a future release: * `Int8DynamicActivationInt4WeightConfig` * `Int4DynamicActivationInt4WeightConfig` * `GemliteUIntXWeightOnlyConfig` * `Float8StaticActivationFloat8WeightConfig` * `UIntXWeightOnlyConfig` * `FPXWeightOnlyConfig` See #2752 for more context Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 548fca9 ghstack-comment-id: 3490630687 Pull-Request: #3294

andrewor14 · 2025-11-06T15:20:11Z

torchao/quantization/quant_api.py

            "torchao.quantization.Int8DynamicActivationInt4WeightConfig"
        )
+        warnings.warn(
+            "`Int8DynamicActivationInt4WeightConfig` will be moving to prototype in a future release of torchao. Please see https://github.com/pytorch/ao/issues/2752 for more details."


I feel this config can just be removed in the future in favor of Int8DynamicActivationIntXWeightConfig. Also maybe add the replacement in the warning?

I agree long term, I want to prioritize moving these out of main folder more urgently to clean up the main folder, and deleting outright is less urgent. If someone wants to own deleting outright on a tight timeline, sounds good to me!

andrewor14 · 2025-11-06T15:23:21Z

test/quantization/test_quant_api.py

+                # Each call should have at least one warning.
+                # Some of them can have two warnings - one for deprecation,
+                # one for moving to prototype
+                self.assertTrue(len(_warnings) > 0)


I think we want to assert len(_warnings) == 1 here, since we moved the warnings context manager outside the loop? E.g. if it's 2 then that means we've logged a warning each time we call the API, which could be very noisy

I can fix this, we can assert for length 1 to 2. The 2nd warning is the one being added in this PR.

andrewor14 · 2025-11-06T15:24:31Z

torchao/quantization/quant_api.py

            "torchao.quantization.Int4DynamicActivationInt4WeightConfig"
        )
+        warnings.warn(
+            "`Int4DynamicActivationInt4WeightConfig` will be moving to prototype in a future release of torchao. Please see https://github.com/pytorch/ao/issues/2752 for more details."


Do we know if anyone's actually using these configs? Is it moving to prototype just in case someone is still using them?

Moving to prototype is to clean up main folder faster, and we can delete from prototype at a later time. Note that there are some internal use cases using some of these configs.

[ghstack-poisoned]

Summary: Adds warnings that the following configs will be moved to prototype in a future release: * `Int8DynamicActivationInt4WeightConfig` * `Int4DynamicActivationInt4WeightConfig` * `GemliteUIntXWeightOnlyConfig` * `Float8StaticActivationFloat8WeightConfig` * `UIntXWeightOnlyConfig` * `FPXWeightOnlyConfig` See #2752 for more context Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 2568e85 ghstack-comment-id: 3490630687 Pull-Request: #3294

* Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned]

Update

768107f

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 5, 2025

vkuzo added the topic: deprecation Use this tag if this PR deprecates a feature label Nov 5, 2025

vkuzo requested review from andrewor14, jainapurva and jerryzh168 November 5, 2025 11:18

Update

32d6d8a

[ghstack-poisoned]

andrewor14 reviewed Nov 6, 2025

View reviewed changes

andrewor14 approved these changes Nov 6, 2025

View reviewed changes

Update

e83eb83

[ghstack-poisoned]

vkuzo merged commit 1fbc364 into main Nov 7, 2025
50 checks passed

namgyu-youn pushed a commit to namgyu-youn/ao that referenced this pull request Nov 21, 2025

Add deprecation warnings for various inference configs (pytorch#3294)

6e59b6a

* Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add deprecation warnings for various inference configs #3294

Add deprecation warnings for various inference configs #3294

vkuzo commented Nov 5, 2025

Uh oh!

vkuzo commented Nov 5, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

andrewor14 Nov 6, 2025

Uh oh!

vkuzo Nov 6, 2025

Uh oh!

andrewor14 Nov 6, 2025

Uh oh!

vkuzo Nov 6, 2025

Uh oh!

andrewor14 Nov 6, 2025

Uh oh!

vkuzo Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add deprecation warnings for various inference configs #3294

Add deprecation warnings for various inference configs #3294

Conversation

vkuzo commented Nov 5, 2025

Uh oh!

vkuzo commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3294

✅ No Failures

Uh oh!

andrewor14 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vkuzo commented Nov 5, 2025 •

edited

Loading

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading