Skip to content

Release 5.0.1#610

Merged
maismail merged 3 commits into
logicalclocks:branch-5.0from
maismail:release_5.0.1
Jun 30, 2026
Merged

Release 5.0.1#610
maismail merged 3 commits into
logicalclocks:branch-5.0from
maismail:release_5.0.1

Conversation

@maismail

Copy link
Copy Markdown
Contributor

No description provided.

manu-sj and others added 3 commits June 30, 2026 11:15
logicalclocks#580)

* [FSTORE-1938] Support chaining of Transformation Functions using an Execution DAG
https://hopsworks.atlassian.net/browse/FSTORE-1938

Document chaining of transformation functions across the user guides:
how the output of one function feeds another, how the execution DAG
resolves the order, how cycles and duplicate output columns are
rejected, and how the DAG is rendered from the UI and from the SDK
with visualize_transformations().

A Transformation Functions Performance Tuning subsection in the
transformation functions guide covers the node-parallel execution
model: the n_processes argument and its defaults per input shape,
pool pre-spawning through init_serving and init_batch_scoring, Arrow
shared-memory staging, and the HSFS_TF_POOL_START_METHOD override.

The model-dependent transformations guide notes that statistics for
chained functions are fit in dependency order on the data each
function sees. The on-demand transformations guide covers chains
whose intermediate output is dropped from the feature group. No
migration entry is included since the changes are backwards
compatible.

Signed-off-by: Manu Sathyarajan Joseph <manu.joseph@logicalclocks.com>
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

* [FSTORE-1938] Support chaining of Transformation Functions using an Execution DAG
https://hopsworks.atlassian.net/browse/FSTORE-1938

Restructure the performance tuning section so it reads in order: what
the n_processes argument is, how parallelism maps to the DAG, when it
pays off, online serving specifics, implementation notes. The previous
version stated the sequential default three times across the first
three paragraphs and placed the practical guidance after the
implementation internals.

Content changes: a call-shape distinction in the guidance (batch and
offline calls benefit from worker processes, single feature vectors
rarely do because the per-call dispatch cost usually exceeds the work),
and a note that pre-spawning the pool removes the startup cost but not
the per-call dispatch cost. Both reflect the measured behavior of the
online batch chaining benchmark in the loadtest repository.

Signed-off-by: Manu Sathyarajan Joseph <manu.joseph@hopsworks.ai>
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* [FSTORE-1938] Support chaining of Transformation Functions using an Execution DAG
https://hopsworks.atlassian.net/browse/FSTORE-1938

Rework the chaining documentation for reading order on all three
pages. The hub page now flows what chaining is, example, uniform
offline and online behavior, statistics over chains with a link to the
model-dependent page, cross-type chaining, and invalid configurations
last instead of interleaved. The model-dependent page gives the
statistics-over-chains behavior its own subsection instead of a single
dangling sentence after the example, and states that statistics are
fit on the train split, each transformation executes once, and the
fitted values are persisted for serving. The on-demand page leads with
the example like the other pages, and the example now demonstrates the
dropped-column claims it previously only stated: both the raw input
and the intermediate are dropped, leaving one stored output.

Signed-off-by: Manu Sathyarajan Joseph <manu.joseph@hopsworks.ai>
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Improving docs

---------

Signed-off-by: Manu Sathyarajan Joseph <manu.joseph@logicalclocks.com>
Signed-off-by: Manu Sathyarajan Joseph <manu.joseph@hopsworks.ai>
Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
* Update app docs for root-based routing

Document the new Python app routing model in the Hopsworks app guide.\n\n- make root-based routing the default for new Streamlit and custom apps\n- describe the UI proxy path mode and readiness probe settings\n- mark APP_BASE_URL_PATH as legacy-only for migration\n- explain the browser mount prefix and the forwarded prefix header\n- update the app details page to show proxy path and readiness settings

* Document app base path routing and readiness

- Update the apps guide to use app base path terminology
- Describe how Hopsworks mounts apps under / or a user-chosen subpath
- Clarify the readiness path behavior and keep APP_BASE_URL_PATH as migration-only guidance

* Document legacy Python app migration guidance
@maismail maismail merged commit c7e47fe into logicalclocks:branch-5.0 Jun 30, 2026
1 of 2 checks passed
@maismail maismail deleted the release_5.0.1 branch June 30, 2026 11:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants