[tx] Introduce optimization step metrics dataclass by pcmoritz · Pull Request #1191 · NovaSky-AI/SkyRL

pcmoritz · 2026-02-20T19:22:21Z

This is in preparation for merging #1008 and to make it easier to introduce metrics.

pcmoritz · 2026-02-20T21:24:53Z

/gemini review

gemini-code-assist

Code Review

The pull request introduces a new OptimStepMetrics dataclass to standardize the reporting of optimization metrics in the JAX backend. It also includes a safety fix for gradient averaging and updates tests to reflect these changes. The most significant change is in the optim_step logic, which now proceeds with an optimizer update even when no gradients are accumulated, potentially applying weight decay unexpectedly.

gemini-code-assist · 2026-02-20T21:30:10Z

skyrl-tx/tx/tinker/backends/jax.py

+            "skyrl.ai/grad_norm": self.grad_norm.item(),
+            "skyrl.ai/learning_rate": self.learning_rate.item(),


The metric keys "skyrl.ai/grad_norm" and "skyrl.ai/learning_rate" are hardcoded as strings. It would be better to define these as constants (e.g., in tx.tinker.types) to ensure consistency across the codebase and tests, and to avoid potential typos.

gemini-code-assist · 2026-02-20T21:30:10Z

skyrl-tx/tx/tinker/backends/jax.py

        if self.accumulated_grads.counts[adapter_index] == 0:
-            logger.warning(f"No accumulated gradients for model {model_id}, skipping optimizer step")
-            return types.OptimStepOutput(metrics={"skyrl.ai/learning_rate": learning_rate})
+            logger.warning(f"No accumulated gradients for model {model_id}; applying step with zero gradients")


The removal of the early return when counts[adapter_index] == 0 changes the behavior of optim_step. Previously, the step was skipped entirely. Now, the code proceeds to call _compute_grads_and_update, which applies an optimizer update with zero gradients. For optimizers like AdamW, this will still apply weight decay to the parameters, which might be unintended if the user expects the step to be a no-op when no gradients are present. If the goal is to ensure metrics are always returned, consider restoring the early return but returning a OptimStepOutput with the expected metrics.

Suggested change

logger.warning(f"No accumulated gradients for model {model_id}; applying step with zero gradients")

logger.warning(f"No accumulated gradients for model {model_id}, skipping optimizer step")

return types.OptimStepOutput(metrics={"skyrl.ai/grad_norm": 0.0, "skyrl.ai/learning_rate": learning_rate})

I don't think the semantics of an empty gradient step are well specified, but applying an empty optimizer step seems like a reasonable thing to do and is also in line with reporting a zero gradient norm. This case shouldn't be common, and handling everything uniformly (zero gradient norm, apply optimizer step) seems like good semantics to me.

[tx] Introduce optimization step metrics dictionary

2586cee

pcmoritz added the tx label Feb 20, 2026

This comment was marked as resolved.

Sign in to view

update

59cc4b7

pcmoritz changed the title ~~[tx] Introduce optimization step metrics dictionary~~ [tx] Introduce optimization step metrics dataclass Feb 20, 2026

fix tests

baa8988

gemini-code-assist bot reviewed Feb 20, 2026

View reviewed changes

pcmoritz merged commit 8b5fa9c into NovaSky-AI:main Feb 20, 2026
1 of 2 checks passed

pcmoritz mentioned this pull request Feb 20, 2026

[tx] General implementation of trainable Hyper Connections #1008

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[tx] Introduce optimization step metrics dataclass#1191

[tx] Introduce optimization step metrics dataclass#1191
pcmoritz merged 3 commits intoNovaSky-AI:mainfrom
pcmoritz:optim-step-metrics

pcmoritz commented Feb 20, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

pcmoritz commented Feb 20, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 20, 2026

Uh oh!

gemini-code-assist bot Feb 20, 2026

Uh oh!

pcmoritz Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		"skyrl.ai/grad_norm": self.grad_norm.item(),
		"skyrl.ai/learning_rate": self.learning_rate.item(),

	logger.warning(f"No accumulated gradients for model {model_id}; applying step with zero gradients")
	logger.warning(f"No accumulated gradients for model {model_id}, skipping optimizer step")
	return types.OptimStepOutput(metrics={"skyrl.ai/grad_norm": 0.0, "skyrl.ai/learning_rate": learning_rate})

Comments

Conversation

pcmoritz commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

pcmoritz commented Feb 20, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

pcmoritz Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pcmoritz commented Feb 20, 2026 •

edited

Loading