You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running lora qwen3 model with FSDP on 8 GPUs, we met the following issue that the device index has not been assigned to the self.device, so when FSDP uses the self.device to init the FSDP model, there is no device index info. Below is the warning message:
home/sdp/miniforge3/envs/zejun_ccl/lib/python3.10/site-packages/torch/distributed/fsdp/_init_utils.py:831: UserWarning: FSDP got the argument device_idxpu on rank 1, which does not have an explicit index. FSDP will use the current device 0. If this is incorrect, please explicitly calltorch.xpu.set_device()before FSDP initialization or pass in the explicit device index as thedevice_id argument.
Uh oh!
There was an error while loading. Please reload this page.
Reminder
System Info
Hi, @hiyouga
When running lora qwen3 model with FSDP on 8 GPUs, we met the following issue that the device index has not been assigned to the self.device, so when FSDP uses the self.device to init the FSDP model, there is no device index info. Below is the warning message:
home/sdp/miniforge3/envs/zejun_ccl/lib/python3.10/site-packages/torch/distributed/fsdp/_init_utils.py:831: UserWarning: FSDP got the argument
device_idxpu on rank 1, which does not have an explicit index. FSDP will use the current device 0. If this is incorrect, please explicitly call
torch.xpu.set_device()before FSDP initialization or pass in the explicit device index as the
device_idargument.
Reproduction
Running command:
examples/accelerate/fsdp_config_4c8t.yaml
Others
No response
The text was updated successfully, but these errors were encountered: