Documentation: dataset teminology consistency by zhen0427 · Pull Request #1411 · PowerGridModel/power-grid-model

zhen0427 · 2026-05-28T07:45:49Z

This PR updates and aligns dataset terminology across the documentation and user-facing API comments.

Main changes :

introducing consistent terminology for:
- buffer type (row-based / columnar)
- buffer representation (dense / sparse)
- component data uniformity (uniform / non-uniform)
- serialization representation (compact_list / named_map)
removing legacy terminology such as:
- scenario homogeneity
- attribute homogeneity
- IDs match
updating Serialization.md
updating dataset terminology documentation
updating Python API docstrings for dense/sparse batch representations and indptr behavior

This PR only contains documentation and comment updates. No functional code changes are included.

Signed-off-by: zhen0427 <Zhen.Wang@alliander.com>

mgovers

good progress. here's a preliminary review because i noticed the (in)homogeneous thingy

mgovers · 2026-05-28T15:09:24Z

+It is required when a component uses `DenseComponentData`, since dense representation relies on a fixed attribute order.
+
+It may be omitted for components that only use `SparseComponentData`.



this is not correct. Both dense and sparse component data may or may not use Attributes. Instead, the Attributes section provides the list of attributes that is used when using use_compact_list=True (see also https://github.com/PowerGridModel/power-grid-model/pull/1411/changes#r3318722900 ).

Homogeneous / inhomogeneous is a different distinction, independent of sparse vs dense

mgovers · 2026-05-28T15:10:54Z

-A [`ComponentData`](#json-schema-component-data-object) object is either a
-[`HomogeneousComponentData`](#json-schema-homogeneous-component-data-object) object or an
-[`InhomogeneousComponentData`](#json-schema-inhomogeneous-component-data-object) object
+A [`ComponentData`](#json-schema-component-data-object) represents the data of a single component instance.
+
+It can be stored in either dense or sparse representation:

- [`ComponentData`](#json-schema-component-data-object):
-  [`HomogeneousComponentData`](#json-schema-homogeneous-component-data-object) |
-  [`InhomogeneousComponentData`](#json-schema-inhomogeneous-component-data-object)
+- [`DenseComponentData`](#json-schema-component-data-object-dense-representation)
+- [`SparseComponentData`](#json-schema-component-data-object-sparse-representation)

-#### JSON schema homogeneous component data object
+#### JSON schema component data object (dense representation)


here and below, same as before: this is a different way of slicing.

Co-authored-by: Martijn Govers <martijn.govers@alliander.com> Signed-off-by: Zhen Wang <Zhen.Wang@alliander.com>

sonarqubecloud · 2026-05-29T08:41:14Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

dataset teminology consistency

19d398d

Signed-off-by: zhen0427 <Zhen.Wang@alliander.com>

zhen0427 added the documentation Improvements or additions to documentation label May 28, 2026

zhen0427 and others added 2 commits May 28, 2026 10:14

Merge branch 'main' into feature/documentation-consistency

c9993af

Merge branch 'main' into feature/documentation-consistency

8f70896

mgovers reviewed May 28, 2026

View reviewed changes

Update docs/user_manual/dataset-terminology.md

344199b

Co-authored-by: Martijn Govers <martijn.govers@alliander.com> Signed-off-by: Zhen Wang <Zhen.Wang@alliander.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation: dataset teminology consistency#1411

Documentation: dataset teminology consistency#1411
zhen0427 wants to merge 4 commits into
mainfrom
feature/documentation-consistency

zhen0427 commented May 28, 2026

Uh oh!

mgovers left a comment

Uh oh!

Uh oh!

mgovers May 28, 2026

Uh oh!

mgovers May 28, 2026

Uh oh!

Uh oh!

sonarqubecloud Bot commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		It is required when a component uses `DenseComponentData`, since dense representation relies on a fixed attribute order.

		It may be omitted for components that only use `SparseComponentData`.

Conversation

zhen0427 commented May 28, 2026

Uh oh!

mgovers left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mgovers May 28, 2026

Choose a reason for hiding this comment

Uh oh!

mgovers May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud Bot commented May 29, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants