feat(pt_expt): implement DeepSpin model in pt_expt backend by wanghan-iapcm · Pull Request #5293 · deepmodeling/deepmd-kit

wanghan-iapcm · 2026-03-05T11:09:23Z

Summary

Implement SpinModel and SpinEnergyModel in the pt_expt backend, supporting spin degrees of freedom for magnetic systems
Make dpmodel SpinModel array-API compatible so the same code works across numpy/torch/jax backends
Add spin virial correction (coord_corr_for_virial) to dpmodel and pt_expt, matching the pt backend
Fix get_spin_model in dpmodel to not mutate the caller's input data dict (pt backend already used deepcopy)

Changes

dpmodel (`deepmd/dpmodel/model/`)

spin_model.py: Replace all np.* operations with array_api_compat equivalents (xp.concat, xp.where, xp.zeros with device=, slicing instead of xp.split). Add compute_or_load_stat and virial correction support via
coord_corr_for_virial / extended_coord_corr.
make_model.py: Thread coord_corr_for_virial through call_common → model_call_from_call_lower (extends to ghost atoms via mapping) → call_common_lower → forward_common_atomic.
model.py: Add copy.deepcopy(data) in get_spin_model to prevent in-place mutation of input dict.

pt_expt (`deepmd/pt_expt/model/`)

spin_model.py (new): @torch_module wrapper inheriting from dpmodel SpinModel.
spin_ener_model.py (new): SpinEnergyModel with forward() / forward_lower() / forward_lower_exportable() providing user-facing output translation.
make_model.py, transform_output.py: Accept extended_coord_corr for virial correction.

Tests

test_spin_ener_model.py (new): Unit tests for output keys/shapes, serialize/deserialize round-trip, dpmodel consistency, force finite-difference, virial finite-difference, and torch.export exportability.
test_spin_ener.py: Cross-backend consistency tests for call/call_lower, compute_or_load_stat, and load-from-file. Virial output now compared across pt and pt_expt.

Test plan

python -m pytest source/tests/pt_expt/model/ -v — all 28 tests pass
python -m pytest source/tests/consistent/model/test_spin_ener.py -v — all 12 tests pass (18 skipped for uninstalled backends)
Force and virial verified by finite-difference tests
torch.export.export verified on forward_lower_exportable
compute_or_load_stat load-from-file verified across dp/pt/pt_expt

Summary by CodeRabbit

New Features
- Added SpinEnergyModel with exportable lower-level forward, energy/force/virial outputs, and compute_or_load_stat preprocessing.
- Optional virial coordinate-correction can be supplied and is propagated through forward paths.
Bug Fixes
- Prevented in-place mutation of input data during model preparation.
Tests
- Expanded tests for exportable workflows, force/virial validation, multi-backend (including PT_EXPT) and array‑API strict modes.

…and load-from-file support - Add coord_corr_for_virial support to dpmodel and pt_expt spin model, matching the pt backend's virial correction for virtual atom displacement. The correction flows through process_spin_input -> call_common -> call_common_lower -> forward_common_atomic -> fit_output_to_model_output. - Add torch.export test (TestSpinEnerModelExportable) verifying make_fx tracing and torch.export.export work for the spin energy model. - Fix get_spin_model in dpmodel to deepcopy input data dict, preventing in-place mutation of type_map/sel/exclude_types (pt backend already did this). - Un-skip test_load_stat_from_file for spin model; the load-from-file mechanism already worked correctly once the data mutation bug was fixed. - Add virial output comparison to spin consistency tests (TestSpinEner and TestSpinEnerLower) across pt and pt_expt backends.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0abbf38d24

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

deepmd/pt_expt/model/spin_ener_model.py

coderabbitai · 2026-03-05T11:18:26Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Threads coordinate-correction tensors (coord_corr_for_virial / extended_coord_corr) through DP, JAX, and PT_EXPT model call paths for virial computations; migrates DP spin processing to array‑API xp usage; adds PT_EXPT SpinEnergyModel with an exportable lower-forward; and expands cross‑backend tests and stat sampling support.

Changes

Cohort / File(s)	Summary
Core DP call threading `deepmd/dpmodel/model/make_model.py`	Added `coord_corr_for_virial` parameter; compute/expand `extended_coord_corr` when provided and pass through to lower call paths.
DP spin model & utils (array‑API migration) `deepmd/dpmodel/model/model.py`, `deepmd/dpmodel/model/spin_model.py`	Defensive deepcopy in `get_spin_model`; migrated spin processing to array_api_compat (`xp`) with non‑mutating ops; propagate coord‑correction tensors and add `compute_or_load_stat`.
PT_EXPT model additions `deepmd/pt_expt/model/spin_ener_model.py`, `deepmd/pt_expt/model/spin_model.py`, `deepmd/pt_expt/model/__init__.py`	Added `SpinEnergyModel` (energy outputs, exportable `forward_lower_exportable`) and a PT_EXPT `SpinModel` wrapper with attribute delegation and deserialization; exported `SpinEnergyModel`.
PT_EXPT forward/output wiring `deepmd/pt_expt/model/make_model.py`, `deepmd/pt_expt/model/transform_output.py`	Threaded optional `extended_coord_corr` into `forward_common_atomic` and `fit_output_to_model_output`; apply dc correction when `extended_coord_corr` is provided and differentiable.
JAX model forwarding `deepmd/jax/model/base_model.py`, `deepmd/jax/model/dp_model.py`, `deepmd/jax/model/dp_zbl_model.py`	Added optional `extended_coord_corr` parameter to `forward_common_atomic` signatures and propagated it to underlying atomic forwards.
PT model small virial fix `deepmd/pt/model/model/spin_model.py`	Adjusted squeezing axis for `atom_virial` / `extended_virial` from -3 to -2.
Tests — cross‑backend & export `source/tests/consistent/model/test_spin_ener.py`, `source/tests/pt_expt/model/test_spin_ener_model.py`	Large test additions/changes to cover PT_EXPT/ARRAY_API_STRICT, deserialization, compute_or_load_stat, finite‑difference force/virial checks, and exportable `forward_lower` verification.

Sequence Diagram(s)

sequenceDiagram
    participant Caller as Caller
    participant CM as CM / public model
    participant Backbone as BackboneModel
    participant Atomic as AtomicModel
    participant Transform as fit_output_to_model_output
    Note over CM,Backbone: coord_corr_for_virial supplied at top-level
    Caller->>CM: call_common(..., coord_corr_for_virial)
    CM->>Backbone: model_call_from_call_lower(..., coord_corr_for_virial)
    Backbone->>Atomic: forward_common_atomic(..., extended_coord_corr)
    Atomic-->>Backbone: atomic outputs (+ extended_virial if requested)
    Backbone->>Transform: fit_output_to_model_output(outputs, extended_coord_corr)
    Transform-->>Caller: final outputs (energy/forces/virial)

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related PRs

feat(pt): support spin virial (Rebased and updated version of #4545 by @iProzd.) #5156: Threads coord_corr_for_virial / extended_coord_corr through the same model call paths and virial-related logic.
refact(dpmodel): model output made the same as pt backend #5250: Modifies call_common/call_common_lower and threads virial-correction tensors through DP spin/model call stack.
feat(jax): force & virial #4251: Overlaps changes to model call/forward_common_atomic wiring and virial/atomic-forward plumbing.

Suggested reviewers

iProzd
njzjz

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 49.48% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main objective of the PR: implementing DeepSpin model support in the pt_expt backend, which is the primary focus of the comprehensive changes across multiple files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

deepmd/dpmodel/model/make_model.py (1)

374-399: ⚠️ Potential issue | 🔴 Critical

extended_coord_corr is dropped before output transformation.

extended_coord_corr reaches forward_common_atomic (Line 383) but is never consumed, so virial coordinate correction is effectively not applied on this path.

Proposed fix

         def forward_common_atomic(
             self,
             extended_coord: Array,
             extended_atype: Array,
             nlist: Array,
             mapping: Array | None = None,
             fparam: Array | None = None,
             aparam: Array | None = None,
             do_atomic_virial: bool = False,
             extended_coord_corr: Array | None = None,
         ) -> dict[str, Array]:
             atomic_ret = self.atomic_model.forward_common_atomic(
                 extended_coord,
                 extended_atype,
                 nlist,
                 mapping=mapping,
                 fparam=fparam,
                 aparam=aparam,
             )
             return fit_output_to_model_output(
                 atomic_ret,
                 self.atomic_output_def(),
                 extended_coord,
+                extended_coord_corr=extended_coord_corr,
                 do_atomic_virial=do_atomic_virial,
                 mask=atomic_ret["mask"] if "mask" in atomic_ret else None,
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@deepmd/dpmodel/model/make_model.py` around lines 374 - 399, The function
forward_common_atomic drops the extended_coord_corr parameter instead of using
it for virial corrections; forward extended_coord_corr into the downstream calls
— pass extended_coord_corr to atomic_model.forward_common_atomic (so the atomic
model can apply coordinate corrections) and also supply extended_coord_corr into
fit_output_to_model_output (so virial/correction logic runs on the transformed
output) while leaving all other args unchanged.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@deepmd/dpmodel/model/spin_model.py`:
- Line 248: The tuple unpacking "nframes, nloc, nnei = nlist.shape" introduces
an unused variable `nnei` causing a linter warning; change the unpacking in
spin_model.py (the line using `nlist.shape`) to discard the third value (e.g.,
"nframes, nloc, _ = nlist.shape" or remove the third target) so only used
variables remain, then run ruff check/format.

In `@source/tests/pt_expt/model/test_spin_ener_model.py`:
- Line 403: Remove the unused local variable nall assigned from
ext_coord.shape[1] in test_spin_ener_model.py (around the test that references
ext_coord) to satisfy lint (Ruff F841); simply delete the line "nall =
ext_coord.shape[1]" or replace it with a used expression if the count is needed
elsewhere (search for ext_coord and the surrounding test function to locate the
assignment).

---

Outside diff comments:
In `@deepmd/dpmodel/model/make_model.py`:
- Around line 374-399: The function forward_common_atomic drops the
extended_coord_corr parameter instead of using it for virial corrections;
forward extended_coord_corr into the downstream calls — pass extended_coord_corr
to atomic_model.forward_common_atomic (so the atomic model can apply coordinate
corrections) and also supply extended_coord_corr into fit_output_to_model_output
(so virial/correction logic runs on the transformed output) while leaving all
other args unchanged.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 302ec291-5d78-47e1-9bb8-0fb738497162

📥 Commits

Reviewing files that changed from the base of the PR and between fdeff2b and 0abbf38.

📒 Files selected for processing (10)

deepmd/dpmodel/model/make_model.py
deepmd/dpmodel/model/model.py
deepmd/dpmodel/model/spin_model.py
deepmd/pt_expt/model/__init__.py
deepmd/pt_expt/model/make_model.py
deepmd/pt_expt/model/spin_ener_model.py
deepmd/pt_expt/model/spin_model.py
deepmd/pt_expt/model/transform_output.py
source/tests/consistent/model/test_spin_ener.py
source/tests/pt_expt/model/test_spin_ener_model.py

deepmd/dpmodel/model/spin_model.py

source/tests/pt_expt/model/test_spin_ener_model.py

Signed-off-by: Han Wang <92130845+wanghan-iapcm@users.noreply.github.com>

deepmd/pt_expt/model/spin_ener_model.py

+    def forward(
+        self,
+        coord: torch.Tensor,
+        atype: torch.Tensor,
+        spin: torch.Tensor,
+        box: torch.Tensor | None = None,
+        fparam: torch.Tensor | None = None,
+        aparam: torch.Tensor | None = None,
+        do_atomic_virial: bool = False,
+    ) -> dict[str, torch.Tensor]:


deepmd/pt_expt/model/spin_ener_model.py

+    def forward_lower(
+        self,
+        extended_coord: torch.Tensor,
+        extended_atype: torch.Tensor,
+        extended_spin: torch.Tensor,
+        nlist: torch.Tensor,
+        mapping: torch.Tensor | None = None,
+        fparam: torch.Tensor | None = None,
+        aparam: torch.Tensor | None = None,
+        do_atomic_virial: bool = False,
+    ) -> dict[str, torch.Tensor]:


source/tests/pt_expt/model/test_spin_ener_model.py

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@deepmd/pt_expt/model/__init__.py`:
- Around line 26-28: SpinEnergyModel is not registered with the BaseModel
registry and conflicts with EnergyModel via model_type, so add the decorator
`@BaseModel.register`("spin_ener") above the SpinEnergyModel class definition in
spin_ener_model.py and update SpinEnergyModel.model_type = "spin_ener" (and any
related registry key uses) so get_model() can discover and instantiate it via
config without colliding with EnergyModel.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 8004a2eb-6b10-4f6b-8ae5-da2a9b68f557

📥 Commits

Reviewing files that changed from the base of the PR and between 0abbf38 and 0332199.

📒 Files selected for processing (3)

deepmd/pt_expt/model/__init__.py
deepmd/pt_expt/model/make_model.py
deepmd/pt_expt/model/transform_output.py

🚧 Files skipped from review as they are similar to previous changes (1)

deepmd/pt_expt/model/make_model.py

deepmd/pt_expt/model/__init__.py

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

deepmd/dpmodel/model/spin_model.py (1)
248-248: ⚠️ Potential issue | 🟡 Minor

nnei is still unused in extend_nlist.

Please rename it to _nnei (or _) to clear the Ruff warning.
Proposed fix
-        nframes, nloc, nnei = nlist.shape
+        nframes, nloc, _nnei = nlist.shape
As per coding guidelines **/*.py: Always run ruff check . and ruff format . before committing changes or CI will fail.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@deepmd/dpmodel/model/spin_model.py` at line 248, The tuple unpacking in
extend_nlist assigns nframes, nloc, nnei from nlist but nnei is unused and
triggers Ruff; change the third variable name to _nnei (or _) in the unpacking
within the extend_nlist function (where nframes, nloc, nnei = nlist.shape is
declared) to silence the unused-variable warning and run ruff check/format
before committing.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@deepmd/dpmodel/model/spin_model.py`:
- Around line 630-638: translated_output_def currently omits the virial keys
that call()/call_lower() now emit; update the translated_output_def
implementation in the SpinModel (or related class) to include "virial" and
"atom_virial" in the output schema whenever the model can compute gradients
(e.g., when backbone_model.do_grad_c(var_name) is true and do_atomic_virial is
possible). Specifically, add entries for "virial" and conditional "atom_virial"
matching the runtime names produced in model_predict (those set from
model_ret[f"{var_name}_derv_c_redu"] and model_ret[f"{var_name}_derv_c"]) so
schema and runtime keys align; apply the same change to the other
translated_output_def occurrence referenced in the file.

---

Duplicate comments:
In `@deepmd/dpmodel/model/spin_model.py`:
- Line 248: The tuple unpacking in extend_nlist assigns nframes, nloc, nnei from
nlist but nnei is unused and triggers Ruff; change the third variable name to
_nnei (or _) in the unpacking within the extend_nlist function (where nframes,
nloc, nnei = nlist.shape is declared) to silence the unused-variable warning and
run ruff check/format before committing.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 0ea24918-0bd0-4f9f-84b5-b3e833befeb9

📥 Commits

Reviewing files that changed from the base of the PR and between 0332199 and 587392f.

📒 Files selected for processing (6)

deepmd/dpmodel/model/spin_model.py
deepmd/jax/model/base_model.py
deepmd/jax/model/dp_model.py
deepmd/jax/model/dp_zbl_model.py
deepmd/pt/model/model/spin_model.py
deepmd/pt_expt/model/spin_ener_model.py

🚧 Files skipped from review as they are similar to previous changes (1)

deepmd/pt_expt/model/spin_ener_model.py

deepmd/dpmodel/model/spin_model.py

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

deepmd/dpmodel/model/spin_model.py (1)

630-638: ⚠️ Potential issue | 🟠 Major

Add virial keys to translated_output_def() to match emitted outputs.

call()/call_lower() now emit virial fields, but translated_output_def() still omits virial schema keys. This leaves a runtime/schema mismatch for output-definition consumers.

Proposed fix

         if self.backbone_model.do_grad_r(var_name):
             output_def["force"] = deepcopy(out_def_data[f"{var_name}_derv_r"])
             output_def["force"].squeeze(-2)
             output_def["force_mag"] = deepcopy(out_def_data[f"{var_name}_derv_r_mag"])
             output_def["force_mag"].squeeze(-2)
+        if self.backbone_model.do_grad_c(var_name):
+            output_def["virial"] = deepcopy(out_def_data[f"{var_name}_derv_c_redu"])
+            output_def["virial"].squeeze(-2)
+            output_def["atom_virial"] = deepcopy(out_def_data[f"{var_name}_derv_c"])
+            output_def["atom_virial"].squeeze(-2)
         return output_def

Also applies to: 813-821, 824-846

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@deepmd/dpmodel/model/spin_model.py` around lines 630 - 638,
translated_output_def() is missing schema entries for the newly emitted virial
fields causing a runtime/schema mismatch; update translated_output_def() to
include keys "virial" and "atom_virial" (matching the outputs set in call() /
call_lower()) with the appropriate shape/type descriptors, and ensure any
conditional inclusion logic mirrors the conditions used when populating
model_predict (e.g., respect do_grad_c and do_atomic_virial); also add the same
keys in the other translated_output_def() occurrences mentioned so the schema
aligns with the outputs at emission sites.

🧹 Nitpick comments (3)

source/tests/pt_expt/model/test_spin_ener_model.py (3)

170-177: Include virial in serialize/deserialize parity checks.

This test validates round-trip correctness but currently skips virial, which is a core path touched in this PR.

Proposed patch

-        for key in ["energy", "atom_energy", "force", "force_mag", "mask_mag"]:
+        for key in ["energy", "atom_energy", "force", "force_mag", "mask_mag", "virial"]:
             np.testing.assert_allclose(
                 ret1[key].detach().cpu().numpy(),
                 ret2[key].detach().cpu().numpy(),
                 rtol=1e-10,
                 atol=1e-10,
                 err_msg=f"Mismatch in {key} after round-trip",
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@source/tests/pt_expt/model/test_spin_ener_model.py` around lines 170 - 177,
The parity test loop in test_spin_ener_model.py currently checks keys
["energy","atom_energy","force","force_mag","mask_mag"] but omits "virial";
update the keys list used in the loop (the one iterating over keys and calling
np.testing.assert_allclose on ret1[...] vs ret2[...]) to include "virial" so the
serialize/deserialize round-trip asserts also verify virial equality for ret1
and ret2.

323-332: Avoid repeated CPU→device transfers in virial finite-difference evaluations.

np_infer builds CPU tensors each step and eval_model moves them to env.DEVICE; constructing directly on env.DEVICE reduces overhead in this hot loop.

Proposed patch

         def np_infer(new_cell):
+            stretched_coord = torch.tensor(
+                stretch_box(coord, cell, new_cell), dtype=dtype, device=env.DEVICE
+            ).unsqueeze(0)
+            new_cell_t = torch.tensor(
+                new_cell, dtype=dtype, device=env.DEVICE
+            ).unsqueeze(0)
             result = eval_model(
                 self.model,
-                torch.tensor(
-                    stretch_box(coord, cell, new_cell), device="cpu"
-                ).unsqueeze(0),
-                torch.tensor(new_cell, device="cpu").unsqueeze(0),
+                stretched_coord,
+                new_cell_t,
                 atype,
                 spin.unsqueeze(0),
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@source/tests/pt_expt/model/test_spin_ener_model.py` around lines 323 - 332,
np_infer is creating tensors on CPU and relying on eval_model to move them to
env.DEVICE each finite-difference step; change the tensor constructions inside
np_infer to use device=env.DEVICE (e.g., torch.tensor(stretch_box(...),
device=env.DEVICE) and torch.tensor(new_cell, device=env.DEVICE)) and ensure
spin.unsqueeze(0) is also on env.DEVICE so eval_model doesn't perform repeated
CPU→device transfers. Keep the same shapes/unsqueeze calls and use the existing
stretch_box/new_cell inputs and atype argument names so callers remain
unchanged.

95-102: Consider extracting shared random-system setup into one helper.

Cell/coord/atype/spin initialization is repeated in several tests; a common helper would reduce duplication and drift.

Also applies to: 121-130, 155-162, 271-280, 310-319, 372-381

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@source/tests/pt_expt/model/test_spin_ener_model.py` around lines 95 - 102,
Extract the repeated random-system setup
(torch.Generator(device="cpu").manual_seed(GLOBAL_SEED), creation of cell,
coord, atype, and spin using the test-local dtype and generator) into a single
helper function (e.g., make_random_system or build_test_system) and call that
helper from each test; ensure the helper accepts dtype and optional seed or
generator and returns the tuple (generator, cell, coord, atype, spin) so
existing tests that reference generator, cell, coord, atype, and spin keep using
the same symbols without duplicating initialization code.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@deepmd/dpmodel/model/spin_model.py`:
- Around line 347-364: The per-atom field "aparam" is copied unchanged when
doubling virtual spin atoms causing shape mismatches; in the block inside
compute_or_load_stat()/the spin-sampling loop where tmp_dict is built (same area
that expands "natoms"), detect if "aparam" is in sys and expand it the same way
as other per-atom arrays: get the array namespace via
array_api_compat.array_namespace(sys["aparam"]) or using the existing
expand_aparam helper, and concat the slices to duplicate the spin-local atoms
(e.g. xp.concat([2 * aparam[:, :2], aparam[:, 2:], aparam[:, 2:]], axis=-1) or
call expand_aparam(aparam, nloc * 2)); assign the result to tmp_dict["aparam"]
so per-atom shapes stay synchronized with coord/atype.

---

Duplicate comments:
In `@deepmd/dpmodel/model/spin_model.py`:
- Around line 630-638: translated_output_def() is missing schema entries for the
newly emitted virial fields causing a runtime/schema mismatch; update
translated_output_def() to include keys "virial" and "atom_virial" (matching the
outputs set in call() / call_lower()) with the appropriate shape/type
descriptors, and ensure any conditional inclusion logic mirrors the conditions
used when populating model_predict (e.g., respect do_grad_c and
do_atomic_virial); also add the same keys in the other translated_output_def()
occurrences mentioned so the schema aligns with the outputs at emission sites.

---

Nitpick comments:
In `@source/tests/pt_expt/model/test_spin_ener_model.py`:
- Around line 170-177: The parity test loop in test_spin_ener_model.py currently
checks keys ["energy","atom_energy","force","force_mag","mask_mag"] but omits
"virial"; update the keys list used in the loop (the one iterating over keys and
calling np.testing.assert_allclose on ret1[...] vs ret2[...]) to include
"virial" so the serialize/deserialize round-trip asserts also verify virial
equality for ret1 and ret2.
- Around line 323-332: np_infer is creating tensors on CPU and relying on
eval_model to move them to env.DEVICE each finite-difference step; change the
tensor constructions inside np_infer to use device=env.DEVICE (e.g.,
torch.tensor(stretch_box(...), device=env.DEVICE) and torch.tensor(new_cell,
device=env.DEVICE)) and ensure spin.unsqueeze(0) is also on env.DEVICE so
eval_model doesn't perform repeated CPU→device transfers. Keep the same
shapes/unsqueeze calls and use the existing stretch_box/new_cell inputs and
atype argument names so callers remain unchanged.
- Around line 95-102: Extract the repeated random-system setup
(torch.Generator(device="cpu").manual_seed(GLOBAL_SEED), creation of cell,
coord, atype, and spin using the test-local dtype and generator) into a single
helper function (e.g., make_random_system or build_test_system) and call that
helper from each test; ensure the helper accepts dtype and optional seed or
generator and returns the tuple (generator, cell, coord, atype, spin) so
existing tests that reference generator, cell, coord, atype, and spin keep using
the same symbols without duplicating initialization code.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 0c984edb-03ca-4bc1-b82e-7822457e9aca

📥 Commits

Reviewing files that changed from the base of the PR and between 587392f and 6cdab09.

📒 Files selected for processing (2)

deepmd/dpmodel/model/spin_model.py
source/tests/pt_expt/model/test_spin_ener_model.py

deepmd/dpmodel/model/spin_model.py

…ing all api. fix tranform_output_def issue

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

deepmd/pt_expt/model/spin_model.py (1)

126-139: Keep deserialize() model-type driven to avoid future backbone lock-in.

SpinModel.deserialize() currently hardcodes DPEnergyAtomicModel; consider dispatching from serialized backbone metadata so the base SpinModel stays extensible.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@deepmd/pt_expt/model/spin_model.py` around lines 126 - 139,
SpinModel.deserialize currently hardcodes DPEnergyAtomicModel; change it to pick
the backbone class from the serialized metadata instead. In
SpinModel.deserialize use the model identifier stored in data["backbone_model"]
(e.g. a "type" or "model_name" field) to resolve the correct class via
make_model or your model registry, then call .deserialize on that instance
(replace the direct DPEnergyAtomicModel reference), keeping
Spin.deserialize(data["spin"]) and the cls(...) return path unchanged so the
SpinModel remains extensible.

deepmd/pt_expt/model/spin_ener_model.py (1)

96-111: Extract shared lower-output translation logic to reduce drift risk.

forward_lower() and forward_lower_exportable() duplicate nearly the same key translation/squeeze steps; a shared helper would make future output-key changes safer.

Also applies to: 181-196

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@deepmd/pt_expt/model/spin_ener_model.py` around lines 96 - 111,
Forward_lower() and forward_lower_exportable() duplicate the key mapping/squeeze
logic that builds model_predict (keys like "atom_energy", "energy",
"extended_mask_mag", "extended_force", "extended_force_mag", "virial",
"extended_virial") and the conditional squeezes based on
self.backbone_model.do_grad_r("energy") and do_grad_c("energy"); extract that
block into a single helper method (e.g., _translate_model_output or
build_model_predict) that takes model_ret and do_atomic_virial and returns the
normalized model_predict dict, then call this helper from both forward_lower and
forward_lower_exportable (and the other duplicated location around lines
181-196) so all key translations and squeeze operations are centralized.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@deepmd/pt_expt/model/spin_ener_model.py`:
- Around line 96-111: Forward_lower() and forward_lower_exportable() duplicate
the key mapping/squeeze logic that builds model_predict (keys like
"atom_energy", "energy", "extended_mask_mag", "extended_force",
"extended_force_mag", "virial", "extended_virial") and the conditional squeezes
based on self.backbone_model.do_grad_r("energy") and do_grad_c("energy");
extract that block into a single helper method (e.g., _translate_model_output or
build_model_predict) that takes model_ret and do_atomic_virial and returns the
normalized model_predict dict, then call this helper from both forward_lower and
forward_lower_exportable (and the other duplicated location around lines
181-196) so all key translations and squeeze operations are centralized.

In `@deepmd/pt_expt/model/spin_model.py`:
- Around line 126-139: SpinModel.deserialize currently hardcodes
DPEnergyAtomicModel; change it to pick the backbone class from the serialized
metadata instead. In SpinModel.deserialize use the model identifier stored in
data["backbone_model"] (e.g. a "type" or "model_name" field) to resolve the
correct class via make_model or your model registry, then call .deserialize on
that instance (replace the direct DPEnergyAtomicModel reference), keeping
Spin.deserialize(data["spin"]) and the cls(...) return path unchanged so the
SpinModel remains extensible.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: d186edf4-c54b-4127-a381-283b3502a87a

📥 Commits

Reviewing files that changed from the base of the PR and between 6cdab09 and 2b095e4.

📒 Files selected for processing (4)

deepmd/dpmodel/model/spin_model.py
deepmd/pt_expt/model/spin_ener_model.py
deepmd/pt_expt/model/spin_model.py
source/tests/consistent/model/test_spin_ener.py

source/tests/consistent/model/test_spin_ener.py

…load_stat When computing statistics for spin models with aparam, the aparam array must be expanded to include virtual atoms (zeros appended) before being passed to the backbone model, matching how aparam is expanded during forward inference. Without this, the fitting net computes incorrect aparam statistics (mean diluted by factor of 2). Also make dpmodel general_fitting.compute_input_stats array-API compatible (np.concatenate/np.sum → xp.concat/xp.sum) so it works when called with torch tensors from the pt_expt backend. Add fparam and aparam (numb_fparam=2, numb_aparam=3) to the spin model consistency tests to exercise these code paths.

codecov · 2026-03-06T01:36:35Z

Codecov Report

❌ Patch coverage is 94.19643% with 13 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.36%. Comparing base (4a29836) to head (43e2cc4).
⚠️ Report is 5 commits behind head on master.

Files with missing lines	Patch %	Lines
deepmd/dpmodel/model/spin_model.py	91.39%	8 Missing ⚠️
deepmd/pt_expt/model/spin_ener_model.py	95.58%	3 Missing ⚠️
deepmd/jax/model/base_model.py	50.00%	1 Missing ⚠️
deepmd/pt_expt/model/spin_model.py	97.05%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5293      +/-   ##
==========================================
+ Coverage   82.30%   82.36%   +0.06%     
==========================================
  Files         767      769       +2     
  Lines       76984    77151     +167     
  Branches     3659     3660       +1     
==========================================
+ Hits        63359    63547     +188     
+ Misses      12454    12432      -22     
- Partials     1171     1172       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Han Wang added 2 commits March 5, 2026 16:32

first implementation

9d86116

wanghan-iapcm requested a review from iProzd March 5, 2026 11:09

dosubot bot added the new feature label Mar 5, 2026

github-actions bot added the Python label Mar 5, 2026

chatgpt-codex-connector bot reviewed Mar 5, 2026

View reviewed changes

deepmd/pt_expt/model/spin_ener_model.py Outdated Show resolved Hide resolved