refactor autoencoder tests (temporal decoder, cosmos, kvae, mochi) by akshan-main · Pull Request #13832 · huggingface/diffusers

akshan-main · 2026-05-29T09:20:23Z

What does this PR do?

Part of the ongoing modeling-test migration (following #13369 and #13153). Migrates the AutoencoderKLTemporalDecoder, AutoencoderKLCosmos, AutoencoderKLKVAE, and AutoencoderKLMochi test suites to the mixin-based structure (Config + ModelTesterMixin / TrainingTesterMixin / MemoryTesterMixin / NewAutoencoderTesterMixin).

Also fixes AutoencoderKLMochi.forward to propagate return_dict to decode, which was returning a DecoderOutput instead of a tensor with return_dict=False.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? (Discussed on Slack with @DN6.)
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul @DN6

sayakpaul · 2026-05-29T10:28:51Z

@askserge could you do a review?

github-actions

🤗 Serge says:

Overall

Clean migration of four autoencoder test suites to the new mixin-based structure. The pattern is consistent with other already-migrated tests in the repo. The model-side fix to AutoencoderKLMochi.forward is correct and well-motivated.

Correctness

Mochi forward fix — The old code called self.decode(z) (always return_dict=True), then manually checked if not return_dict: return (dec,), which would wrap a DecoderOutput in a tuple instead of returning a raw tensor tuple. The new code correctly propagates return_dict to decode, which already handles both branches. Good fix.
Dropped test_sharded_checkpoints_device_map skip — The old Mochi test skipped this test (with a multi-GPU error reason). The new ModelTesterMixin (from tests/models/testing_utils/common.py) does not define test_sharded_checkpoints_device_map, so the skip is correctly dropped.
Renamed test_effective_gradient_checkpointing → test_gradient_checkpointing_equivalence — Matches the method name in the new TrainingTesterMixin. Correct.
input_shape property removed — Not used anywhere in the new mixin framework or NewAutoencoderTesterMixin. Correct to drop.

Minor observations

The generator property creates a new torch.Generator on every access. This is intentional (ensures fresh state per call) and matches the pattern in other migrated tests (test_models_autoencoder_kl.py, test_models_autoencoder_dc.py, etc.).
The main_input_name property is only consumed by MemoryTesterMixin (in test_layerwise_casting_training). All four configs correctly define it.
Pre-existing: the return type annotation on AutoencoderKLMochi.forward is torch.Tensor | torch.Tensor (line 1095, not in this diff) — should probably be DecoderOutput | tuple[torch.Tensor], but that's out of scope for this PR.

20 LLM turns · 28 tool calls · 89.2s · 439893 in / 4044 out tokens

akshan-main · 2026-05-29T10:35:32Z

@sayakpaul
every other VAE uses

DecoderOutput | torch.Tensor

return type annotation

I'll make that change too

akshan-main added 5 commits May 28, 2026 23:57

refactor autoencoder_kl_temporal_decoder tests

cc699e9

refactor autoencoder_kl_cosmos tests

1167d1d

refactor autoencoder_kl_kvae tests

2064526

fix return_dict propagation in AutoencoderKLMochi.forward

a790bae

refactor autoencoder_kl_mochi tests

d9b5436

github-actions Bot added size/L PR with diff > 200 LOC models tests and removed size/L PR with diff > 200 LOC labels May 29, 2026

add docstrings

f3a0388

github-actions Bot added the size/L PR with diff > 200 LOC label May 29, 2026

github-actions Bot reviewed May 29, 2026

View reviewed changes

fix return type annotation

f373fa9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor autoencoder tests (temporal decoder, cosmos, kvae, mochi)#13832

refactor autoencoder tests (temporal decoder, cosmos, kvae, mochi)#13832
akshan-main wants to merge 7 commits into
huggingface:mainfrom
akshan-main:tests-refactor-autoencoder-temporal-decoder

akshan-main commented May 29, 2026

Uh oh!

sayakpaul commented May 29, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

akshan-main commented May 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akshan-main commented May 29, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented May 29, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Overall

Correctness

Minor observations

Uh oh!

akshan-main commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

akshan-main commented May 29, 2026 •

edited

Loading