Skip to content

local_proc_mesh with sim channels #475

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

thomasywang
Copy link
Contributor

Summary:
We want to instantiate local_proc_meshes that use ChannelTransport::SIm instead of ChannelTranport::Local so that the simulator can intercept and control the delivery of messages

Rollback Plan:

Differential Revision: D77941640

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 9, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77941640

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77941640

thomasywang added a commit to thomasywang/monarch-1 that referenced this pull request Jul 9, 2025
Summary:
Pull Request resolved: pytorch-labs#475

We want to instantiate local_proc_meshes that use ChannelTransport::SIm instead of ChannelTranport::Local so that the simulator can intercept and control the delivery of messages

To preserve to `allocate()` interface so that we can reuse existing test generation macros we will create a wrapper class for this around `LocalAlloc`

Differential Revision: D77941640
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77941640

thomasywang added a commit to thomasywang/monarch-1 that referenced this pull request Jul 9, 2025
Summary:
Pull Request resolved: pytorch-labs#475

We want to instantiate local_proc_meshes that use ChannelTransport::SIm instead of ChannelTranport::Local so that the simulator can intercept and control the delivery of messages

To preserve to `allocate()` interface so that we can reuse existing test generation macros we will create a wrapper class for this around `LocalAlloc`

Differential Revision: D77941640
thomasywang and others added 3 commits July 10, 2025 09:53
Summary: Operational messages were used in order to signal to the simulator that it should perform certain actions like growing or shrinking the mesh. This was needed since the python and rust were running in separate processes, and messages were needed to communicate between the two, but now everything is on the same process so we can do this in memory.

Differential Revision: D77941643
Summary: The proxy was previously used since the simulator and the python were run in separate processes. Since both are now run in the same process we no longer need a proxy

Differential Revision: D77941641
Summary:
Pull Request resolved: pytorch-labs#475

We want to instantiate local_proc_meshes that use ChannelTransport::SIm instead of ChannelTranport::Local so that the simulator can intercept and control the delivery of messages

To preserve to `allocate()` interface so that we can reuse existing test generation macros we will create a wrapper class for this around `LocalAlloc`

Differential Revision: D77941640
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77941640

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants