Skip to content

Conversation

fegin
Copy link
Contributor

@fegin fegin commented May 24, 2024

[ghstack-poisoned]
Copy link

pytorch-bot bot commented May 24, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127069

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1126ab5 with merge base c921c5c (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copy link
Contributor

@wz337 wz337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for adding the test.

@fegin
Copy link
Contributor Author

fegin commented May 24, 2024

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

fegin added a commit that referenced this pull request May 24, 2024
Summary:
Distributed state_dict should not error out because the `model.state_dict()` will trigger FSDP to initialize.

Pull Request resolved: #127069
Approved by: https://github.com/wz337
pytorchmergebot pushed a commit that referenced this pull request May 27, 2024
Summary:
Distributed state_dict should not error out because the `model.state_dict()` will trigger FSDP to initialize.

Pull Request resolved: #127069
Approved by: https://github.com/wz337
atalman added a commit that referenced this pull request May 27, 2024
…127130)

* [DSD] Add a test to verify FSDP lazy initialization case (#127069)

Summary:
Distributed state_dict should not error out because the `model.state_dict()` will trigger FSDP to initialize.

Pull Request resolved: #127069
Approved by: https://github.com/wz337

* Add missing import get_optimizer_state_dict

---------

Co-authored-by: Andrey Talman <atalman@fb.com>
titaiwangms pushed a commit to titaiwangms/pytorch that referenced this pull request May 28, 2024
)

Summary:
Distributed state_dict should not error out because the `model.state_dict()` will trigger FSDP to initialize.

Pull Request resolved: pytorch#127069
Approved by: https://github.com/wz337
bigfootjon pushed a commit that referenced this pull request May 28, 2024
Summary:
Distributed state_dict should not error out because the `model.state_dict()` will trigger FSDP to initialize.

Pull Request resolved: #127069
Approved by: https://github.com/wz337

(cherry picked from commit b0871f9)
@github-actions github-actions bot deleted the gh/fegin/243/head branch June 25, 2024 01:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/trunk Trigger trunk jobs on your pull request Merged oncall: distributed Add this issue/PR to distributed oncall triage queue topic: not user facing topic category
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants