Fixed device binding for active learning loop#1692
Conversation
|
/blossom-ci |
Greptile SummaryThis PR fixes a bug where
Important Files Changed
Reviews (1): Last reviewed commit: "Fixed device binding" | Re-trigger Greptile |
| finally: | ||
| if not was_initialized: | ||
| DistributedManager._shared_state.clear() |
There was a problem hiding this comment.
The teardown accesses the internal Borg-pattern dict directly with
.clear() instead of using DistributedManager.cleanup(), the official API. cleanup() also handles process-group teardown when _distributed is True, so the manual .clear() could skip that step in an environment where distributed training is actually active (e.g. when this test runs as part of a multi-process suite).
| finally: | |
| if not was_initialized: | |
| DistributedManager._shared_state.clear() | |
| finally: | |
| if not was_initialized: | |
| DistributedManager.cleanup() |
Signed-off-by: Mehdi Ataei <ataei8@gmail.com>
9785f5b to
8ed3be0
Compare
PhysicsNeMo Pull Request
Fixed device binding.
DistributedManager.devicebinds the descriptor, not the torch.device.Description
Checklist
Dependencies
Review Process
All PRs are reviewed by the PhysicsNeMo team before merging.
Depending on which files are changed, GitHub may automatically assign a maintainer for review.
We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.
AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.