[2/x]: fix numerics integration test and test delayed vs dynamic #291

vkuzo · 2024-06-28T22:33:14Z

Stack from ghstack (oldest at bottom):

Summary:

the SAM test wasn't easy to use because it had real weights and hence
required real data for useful testing, which is not convenient from
an integration test. Switched to LLaMa FFN with random weights, and
made all the thresholds tight to actually check numerics are close.
extended numerics test to check all combinations of delayed vs
dynamic
to be able to do (2), extended the module swap utility to configure
delayed vs dynamic on a model level, for now without an option to
customize further

Test Plan:

pytest test/test_numerics_integration.py -s -x
./test/test_everything.sh

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D59305796

Summary: 1. the SAM test wasn't easy to use because it had real weights and hence required real data for useful testing, which is not convenient from an integration test. Switched to LLaMa FFN with random weights, and made all the thresholds tight to actually check numerics are close. 2. extended numerics test to check all combinations of delayed vs dynamic 3. to be able to do (2), extended the module swap utility to configure delayed vs dynamic on a model level, for now without an option to customize further Test Plan: ``` pytest test/test_numerics_integration.py -s -x ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: 1. the SAM test wasn't easy to use because it had real weights and hence required real data for useful testing, which is not convenient from an integration test. Switched to LLaMa FFN with random weights, and made all the thresholds tight to actually check numerics are close. 2. extended numerics test to check all combinations of delayed vs dynamic 3. to be able to do (2), extended the module swap utility to configure delayed vs dynamic on a model level, for now without an option to customize further Test Plan: ``` pytest test/test_numerics_integration.py -s -x ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 954ce82 Pull Request resolved: #291

drisspg · 2024-06-28T23:03:20Z

test/test_everything.sh

@@ -5,8 +5,8 @@ set -e
 IS_ROCM=$(rocm-smi --version || true)

 pytest test/test_base.py
-pytest test/test_sam.py


love it, this test has been annoying

drisspg · 2024-06-28T23:09:11Z

test/test_numerics_integration.py

+        model_fp8_out.sum().backward()
+
+        out_sqnr = compute_error(model_ref_out, model_fp8_out)
+        assert out_sqnr > 20.0


nit: maybe a message

drisspg

great clean up, love it!

…ynamic" Summary: 1. the SAM test wasn't easy to use because it had real weights and hence required real data for useful testing, which is not convenient from an integration test. Switched to LLaMa FFN with random weights, and made all the thresholds tight to actually check numerics are close. 2. extended numerics test to check all combinations of delayed vs dynamic 3. to be able to do (2), extended the module swap utility to configure delayed vs dynamic on a model level, for now without an option to customize further Test Plan: ``` pytest test/test_numerics_integration.py -s -x ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

vkuzo · 2024-07-02T23:58:09Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-07-03T14:34:06Z

This pull request has been merged in 1e71def.

vkuzo mentioned this pull request Jun 28, 2024

[1/x]: Make Float8Linear support dynamic scaling #290

Closed

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 28, 2024

drisspg reviewed Jun 28, 2024

View reviewed changes

drisspg approved these changes Jun 28, 2024

View reviewed changes

facebook-github-bot closed this in 1e71def Jul 3, 2024

facebook-github-bot added the Merged label Jul 3, 2024

vkuzo mentioned this pull request Jul 16, 2024

Investigate Sam test tolerances #242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[2/x]: fix numerics integration test and test delayed vs dynamic #291

[2/x]: fix numerics integration test and test delayed vs dynamic #291

Uh oh!

vkuzo commented Jun 28, 2024 •

edited

Loading

Uh oh!

drisspg Jun 28, 2024

Uh oh!

drisspg Jun 28, 2024

Uh oh!

drisspg left a comment

Uh oh!

vkuzo commented Jul 2, 2024

Uh oh!

facebook-github-bot commented Jul 3, 2024

Uh oh!

Uh oh!

[2/x]: fix numerics integration test and test delayed vs dynamic #291

[2/x]: fix numerics integration test and test delayed vs dynamic #291

Uh oh!

Conversation

vkuzo commented Jun 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drisspg Jun 28, 2024

Choose a reason for hiding this comment

Uh oh!

drisspg Jun 28, 2024

Choose a reason for hiding this comment

Uh oh!

drisspg left a comment

Choose a reason for hiding this comment

Uh oh!

vkuzo commented Jul 2, 2024

Uh oh!

facebook-github-bot commented Jul 3, 2024

Uh oh!

Uh oh!

vkuzo commented Jun 28, 2024 •

edited

Loading