Some configs missing dataset.vocab_size in MockIterableDataset

**Describe the bug**

Some benchmarking configs, like https://github.com/NVIDIA-NeMo/Automodel/blob/main/examples/benchmark/configs/qwen3_moe_30b_te_deepep.yaml, use the MockIterableDataset but do not specify a vocab_size. This leads to ` TypeError: MockIterableDataset.__init__() missing 1 required positional argument: 'vocab_size'`
.
**Steps/Code to reproduce bug**

Run a recipe with qwen3_moe_30b_te_deepep.yaml


**Additional context**

There are multiple configs missing dataset.vocab_size, haven't found them all.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some configs missing dataset.vocab_size in MockIterableDataset #1286

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some configs missing dataset.vocab_size in MockIterableDataset #1286

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions