Move memory from utils to training #1456

RdoubleA · 2024-08-29T22:33:41Z

Context

What is the purpose of this PR? Is it to

add a new feature
fix a bug
update tests and/or documentation
other (please add here)

Addresses #1437.

Changelog

What are the changes made in this PR?

Test plan

Please make sure to do each of the following if applicable to your PR. (If you're not sure about any one of these just ask and we will happily help. We also have a contributing page for some guidance on contributing.)

run pre-commit hooks and linters (make sure you've first installed via pre-commit install)
add unit tests for any new functionality
update docstrings for any new or updated methods or classes
run unit tests via pytest tests
run recipe tests via pytest tests -m integration_test
manually run any new or modified recipes with sufficient proof of correctness
include relevant commands and any other artifacts in this summary (pastes of loss curves, eval results, etc.)

UX

If your function changed a public API, please add a dummy example of what the user experience will look like when calling it.
Example of docstring: https://github.com/pytorch/torchtune/blob/6a7951f1cdd0b56a9746ef5935106989415f50e3/torchtune/modules/vision_transformer.py#L285
Example in our docs: https://pytorch.org/torchtune/main/tutorials/qat_finetune.html#applying-qat-to-llama3-models

I did not change any public API;
I have added an example to docs or docstrings;

pytorch-bot · 2024-08-29T22:33:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1456

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3cdfe03 with merge base e959321 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

felipemello1 · 2024-09-02T19:14:24Z

torchtune/utils/__init__.py

+    "DEFAULT_TRACE_OPTS",
+    "DummyProfiler",
+    "PROFILER_KEY",
+    "setup_torch_profiler",


do you mind double checking those? i dont understand what is being added here or why

ah good catch, these should be removed

felipemello1 · 2024-09-02T19:15:46Z

there are conflicts. I can approve after that.

ebsmothers · 2024-09-03T22:31:25Z

docs/source/api_ref_training.rst


+.. _ac_label:
+
+Memory Management


This is a duplicate of the section immediately following

ebsmothers

One comment, other than that looks good

felipemello1

comment below

felipemello1 · 2024-09-03T22:36:46Z

torchtune/utils/__init__.py

-    log_memory_stats,
-    OptimizerInBackwardWrapper,
-    register_optim_in_bwd_hooks,
-    set_activation_checkpointing,


some of the deleted imports are still in __ all __

move memory

459bb6a

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 29, 2024

RdoubleA added 4 commits August 29, 2024 16:51

fix typo

3b6af1f

Merge branch 'main' into move_memory

41f7bd4

fix imports

9ed335c

Merge branch 'main' into move_memory

5b40156

felipemello1 reviewed Sep 2, 2024

View reviewed changes

RdoubleA added 4 commits September 3, 2024 08:59

Merge branch 'main' into move_memory

600b7fb

fix init

709baf9

Merge branch 'main' into move_memory

043dea4

Merge branch 'main' into move_memory

bf455be

ebsmothers reviewed Sep 3, 2024

View reviewed changes

ebsmothers approved these changes Sep 3, 2024

View reviewed changes

felipemello1 requested changes Sep 3, 2024

View reviewed changes

address comments

8b4031a

felipemello1 approved these changes Sep 3, 2024

View reviewed changes

Merge branch 'main' into move_memory

3cdfe03

RdoubleA merged commit c6693d4 into meta-pytorch:main Sep 4, 2024
20 checks passed

RdoubleA deleted the move_memory branch September 4, 2024 00:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move memory from utils to training #1456

Move memory from utils to training #1456

Uh oh!

RdoubleA commented Aug 29, 2024

Uh oh!

pytorch-bot bot commented Aug 29, 2024 •

edited

Loading

Uh oh!

felipemello1 Sep 2, 2024

Uh oh!

RdoubleA Sep 3, 2024

Uh oh!

felipemello1 commented Sep 2, 2024

Uh oh!

ebsmothers Sep 3, 2024

Uh oh!

ebsmothers left a comment

Uh oh!

felipemello1 left a comment •

edited

Loading

Uh oh!

felipemello1 Sep 3, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Move memory from utils to training #1456

Move memory from utils to training #1456

Uh oh!

Conversation

RdoubleA commented Aug 29, 2024

Context

Changelog

Test plan

UX

Uh oh!

pytorch-bot bot commented Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1456

✅ No Failures

Uh oh!

felipemello1 Sep 2, 2024

Choose a reason for hiding this comment

Uh oh!

RdoubleA Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

felipemello1 commented Sep 2, 2024

Uh oh!

ebsmothers Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

ebsmothers left a comment

Choose a reason for hiding this comment

Uh oh!

felipemello1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

felipemello1 Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Aug 29, 2024 •

edited

Loading

felipemello1 left a comment •

edited

Loading