don't error out in empty_cache under mempool context #158152

ngimel · 2025-07-11T21:33:06Z

Now instead of erroring out on empty_cache call during graph capture or under mempool context, we will just silently do nothing. This used to be the behavior for mempools, cudagraphs used to error out, but it's fine to just ignore the call.

pytorch-bot · 2025-07-11T21:33:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158152

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 0be0650 with merge base 1f1f229 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable) (gh) (#153987)
MISSING REGRESSION TEST

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ngimel · 2025-07-12T00:33:29Z

@pytorchbot merge -i

pytorchmergebot · 2025-07-12T00:35:20Z

Merge started

Your change will be merged while ignoring the following 5 checks: pull / linux-jammy-py3.9-clang12 / build, pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / build, pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable), Lint / lintrunner-noclang / linux-job, Lint / toc / linux-job

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2025-07-12T04:38:44Z

@pytorchbot --help

pytorch-bot · 2025-07-12T04:38:46Z

PyTorchBot Help

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci,cherry-pick} ...

In order to invoke the bot on your PR, include a line that starts with
@pytorchbot anywhere in a comment. That line will form the command; no
multi-line commands are allowed. Some commands may be used on issues as specified below.

Example:
    Some extra context, blah blah, wow this PR looks awesome

    @pytorchbot merge

optional arguments:
  -h, --help            Show this help message and exit.

command:
  {merge,revert,rebase,label,drci,cherry-pick}
    merge               Merge a PR
    revert              Revert a PR
    rebase              Rebase a PR
    label               Add label to a PR
    drci                Update Dr. CI
    cherry-pick         Cherry pick a PR onto a release branch

Merge

usage: @pytorchbot merge [-f MESSAGE | -i] [-ic] [-r [{viable/strict,main}]]

Merge an accepted PR, subject to the rules in .github/merge_rules.json.
By default, this will wait for all required checks (lint, pull) to succeed before merging.

optional arguments:
  -f MESSAGE, --force MESSAGE
                        Merge without checking anything. This requires a reason for auditting purpose, for example:
                        @pytorchbot merge -f 'Minor update to fix lint. Expecting all PR tests to pass'
                        
                        Please use `-f` as last resort, prefer `--ignore-current` to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.
  -i, --ignore-current  Merge while ignoring the currently failing jobs.  Behaves like -f if there are no pending jobs.
  -ic                   Old flag for --ignore-current. Deprecated in favor of -i.
  -r [{viable/strict,main}], --rebase [{viable/strict,main}]
                        Rebase the PR to re run checks before merging.  Accepts viable/strict or main as branch options and will default to viable/strict if not specified.

Revert

usage: @pytorchbot revert -m MESSAGE -c
                          {nosignal,ignoredsignal,landrace,weird,ghfirst}

Revert a merged PR. This requires that you are a Meta employee.

Example:
  @pytorchbot revert -m="This is breaking tests on trunk. hud.pytorch.org/" -c=nosignal

optional arguments:
  -m MESSAGE, --message MESSAGE
                        The reason you are reverting, will be put in the commit message. Must be longer than 3 words.
  -c {nosignal,ignoredsignal,landrace,weird,ghfirst}, --classification {nosignal,ignoredsignal,landrace,weird,ghfirst}
                        A machine-friendly classification of the revert reason.

Rebase

usage: @pytorchbot rebase [-s | -b BRANCH]

Rebase a PR. Rebasing defaults to the stable viable/strict branch of pytorch.
Repeat contributor may use this command to rebase their PR.

optional arguments:
  -s, --stable          [DEPRECATED] Rebase onto viable/strict
  -b BRANCH, --branch BRANCH
                        Branch you would like to rebase to

Label

usage: @pytorchbot label labels [labels ...]

Adds label to a PR or Issue [Can be used on Issues]

positional arguments:
  labels  Labels to add to given Pull Request or Issue [Can be used on Issues]

Dr CI

usage: @pytorchbot drci 

Update Dr. CI. Updates the Dr. CI comment on the PR in case it's gotten out of sync with actual CI results.

cherry-pick

usage: @pytorchbot cherry-pick --onto ONTO [--fixes FIXES] -c
                               {regression,critical,fixnewfeature,docs,release}

Cherry pick a pull request onto a release branch for inclusion in a release

optional arguments:
  --onto ONTO, --into ONTO
                        Branch you would like to cherry pick onto (Example: release/2.1)
  --fixes FIXES         Link to the issue that your PR fixes (Example: https://github.com/pytorch/pytorch/issues/110666)
  -c {regression,critical,fixnewfeature,docs,release}, --classification {regression,critical,fixnewfeature,docs,release}
                        A machine-friendly classification of the cherry-pick reason.

huydhn · 2025-07-12T04:39:37Z

@pytorchbot cherry-pick --onto release/2.8 --fixes vllm-project/vllm#20358 -c regression

Now instead of erroring out on `empty_cache` call during graph capture or under mempool context, we will just silently do nothing. This used to be the behavior for mempools, cudagraphs used to error out, but it's fine to just ignore the call. Pull Request resolved: #158152 Approved by: https://github.com/zou3519, https://github.com/eqy (cherry picked from commit 9056279)

pytorchbot · 2025-07-12T04:44:23Z

Cherry picking #158152

The cherry pick PR is at #158180 and it is linked with issue vllm-project/vllm#20358. The following tracker issues are updated:

[v.2.8.0] Release Tracker #156745 (comment)

Details for Dev Infra team

Raised by workflow job

don't error out in empty_cache under mempool context (#158152) Now instead of erroring out on `empty_cache` call during graph capture or under mempool context, we will just silently do nothing. This used to be the behavior for mempools, cudagraphs used to error out, but it's fine to just ignore the call. Pull Request resolved: #158152 Approved by: https://github.com/zou3519, https://github.com/eqy (cherry picked from commit 9056279) Co-authored-by: Natalia Gimelshein <ngimel@meta.com>

don't error out in empty_cache under mempool context

0be0650

ngimel requested review from eqy and syed-ahmed as code owners July 11, 2025 21:33

ngimel added the release notes: cuda release notes category label Jul 11, 2025

zou3519 approved these changes Jul 11, 2025

View reviewed changes

eqy approved these changes Jul 12, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 12, 2025

pytorchmergebot added the merging label Jul 12, 2025

pytorchmergebot added the Merged label Jul 12, 2025

pytorchmergebot closed this in 9056279 Jul 12, 2025

pytorchmergebot removed the merging label Jul 12, 2025

pytorchbot mentioned this pull request Jul 12, 2025

[v.2.8.0] Release Tracker #156745

Open

ngimel mentioned this pull request Jul 13, 2025

torch.cuda.use_mem_pool is not thread safe #152861

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

don't error out in empty_cache under mempool context #158152

don't error out in empty_cache under mempool context #158152

Uh oh!

ngimel commented Jul 11, 2025

Uh oh!

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

ngimel commented Jul 12, 2025

Uh oh!

pytorchmergebot commented Jul 12, 2025

Uh oh!

huydhn commented Jul 12, 2025

Uh oh!

pytorch-bot bot commented Jul 12, 2025

Uh oh!

huydhn commented Jul 12, 2025

Uh oh!

pytorchbot commented Jul 12, 2025

Uh oh!

Uh oh!

don't error out in empty_cache under mempool context #158152

don't error out in empty_cache under mempool context #158152

Uh oh!

Conversation

ngimel commented Jul 11, 2025

Uh oh!

pytorch-bot bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158152

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

ngimel commented Jul 12, 2025

Uh oh!

pytorchmergebot commented Jul 12, 2025

Merge started

Uh oh!

huydhn commented Jul 12, 2025

Uh oh!

pytorch-bot bot commented Jul 12, 2025

PyTorchBot Help

Merge

Revert

Rebase

Label

Dr CI

cherry-pick

Uh oh!

huydhn commented Jul 12, 2025

Uh oh!

pytorchbot commented Jul 12, 2025

Cherry picking #158152

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading