Tabular Deep Learning parameters by edgararuiz · Pull Request #454 · tidymodels/dials

edgararuiz · 2026-06-12T19:26:04Z

Closes #452

Adds 14 new parameters for deep learning models:

SAINT (via brulee):
- attention_type() - values: "column", "row", "both"
- dropout_hidden() - range: 0, 0.5
- dropout_last() - range: 0, 0.5
tdl package (brulee engine):
- bottleneck_units() - range: 2L, 25L
- dropout_attn() - range: 0, 0.5
- dropout_embedding() - range: 0, 0.5
- num_attn_blocks() - range: 1L, 6L
- num_attn_feat() - range: 8L, 64L
- num_attn_heads() - range: 1L, 8L
- num_embedding() - range: 8L, 64L
- penalty_average() - range: -15, -5 (log10 scale)
- penalty_type() - values: "L1", "L2"
- resid_at() - range: 2L, unknown()
- step_rate() - range: 0, 8 (log10 scale)

All new parameters are added to the pkgdown reference index.
Several existing .Rd files have incidental changes from roxygen2 dropping \docType{data} and \format{} blocks for values_* vectors.

Example:

library(dials)

attention_type()
#> Attention Type (qualitative)
#> 3 possible values include:
#> 'column', 'row', and 'both'

dropout_hidden()
#> Hidden Dropout Rate (quantitative)
#> Range: [0, 0.5]

dropout_last()
#> Final Layer Dropout Rate (quantitative)
#> Range: [0, 0.5]

saint_params <- parameters(attention_type(), dropout_hidden(), dropout_last())
grid_random(saint_params, size = 5)
#> # A tibble: 5 × 3
#>   attention_type dropout_hidden dropout_last
#>   <chr>                   <dbl>        <dbl>
#> 1 column                 0.286        0.447 
#> 2 row                    0.497        0.0539
#> 3 both                   0.0125       0.0425
#> 4 column                 0.331        0.305 
#> 5 both                   0.0550       0.285

- 11 parameter source files (R/param_*.R): bottleneck_units, dropout_attn, dropout_embedding, num_attn_blocks, num_attn_feat, num_attn_heads, num_embedding, penalty_average, penalty_type, resid_at, step_rate - 11 corresponding Rd documentation files (man/*.Rd) Modified files Updates - NEWS.md and _pkgdown.yml - tests/testthat/test-params.R — 11 new range assertions + 1 values assertion Roxygen2 cleanup (unrelated to our changes) 18 existing man/*.Rd files stripped of \docType{data}, \format{}, and \keyword{datasets} — side effect of running devtools::document() with a newer roxygen2

Adds attention_type(), dropout_hidden(), and dropout_last() parameters

topepo · 2026-06-13T17:23:06Z

+#' @inheritParams Laplace
+#'
+#' @details
+#' Used as a tuning parameter for `tabular_auto_int()` in the `tdl` package


I guess it is time to make a call on the new package name: ~~bartab~~ tabular.

I'll update this and start the renaming process.

topepo

Looks good. I changed the package name from tdl

hfrick · 2026-06-15T18:07:58Z

I usually do a separate PR for updating the roxygen2 version so that it's easier to check for diffs due to that vs diffs of other nature. I've updated this PR from main with those changes so that we now only have the changes related to the new parameters on here.

Are all these parameters ones you expect to reuse across different engines or models? If not, could you please group them in a meaningful way? So far we've grouped by engine, that might also be right choice here. Grouping helps with navigating the options/docs and the code base. The rest is probably very straightforward and I'll take a look once we've established if they can be grouped. 🙌

Groups parameters in 4 families. Attention, RLN, Restnet, and the more generic ones into the existing param_network script

edgararuiz · 2026-06-15T20:00:08Z

Hi @hfrick!

Thanks! So the new groupings will be:

Group	File	Doc topic	Parameters
Neural network (generic)	`R/param_network.R`	`?dropout`	`dropout`, `epochs`, `hidden_units`, `hidden_units_2`, `batch_size`, `dropout_hidden`, `dropout_last`, `num_embedding`, `dropout_embedding`
Attention-based tabular models	`R/param_attention.R`	`?attention-param`	`attention_type`, `dropout_attn`, `num_attn_heads`, `num_attn_blocks`, `num_attn_feat`
Regularization learning networks	`R/param_rln.R`	`?rln-param`	`penalty_average`, `step_rate`, `penalty_type`
Residual networks	`R/param_resnet.R`	`?resnet-param`	`bottleneck_units`, `resid_at`

hfrick

Thanks!

topepo · 2026-06-16T12:34:09Z

I have one more parameter to add. I'll put it in this PR today.

edgararuiz added 2 commits June 12, 2026 09:48

Adds Saint parameters

a16c809

Adds attention_type(), dropout_hidden(), and dropout_last() parameters

edgararuiz requested a review from topepo June 12, 2026 19:26

topepo reviewed Jun 13, 2026

View reviewed changes

tdl -> tabular

f013535

topepo reviewed Jun 13, 2026

View reviewed changes

edgararuiz requested a review from hfrick June 14, 2026 19:45

Merge commit 'e33f75fe02af748cb3ce87ff64ba7fd23557ee03'

ca9d29c

Groups parameters

ea72504

Groups parameters in 4 families. Attention, RLN, Restnet, and the more generic ones into the existing param_network script

hfrick approved these changes Jun 16, 2026

View reviewed changes

Merge branch 'main' into deep-learning

a352a29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tabular Deep Learning parameters#454

Tabular Deep Learning parameters#454
edgararuiz wants to merge 6 commits into
mainfrom
deep-learning

edgararuiz commented Jun 12, 2026

Uh oh!

topepo Jun 13, 2026

Uh oh!

topepo left a comment

Uh oh!

hfrick commented Jun 15, 2026

Uh oh!

edgararuiz commented Jun 15, 2026

Uh oh!

hfrick left a comment

Uh oh!

topepo commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

edgararuiz commented Jun 12, 2026

Uh oh!

topepo Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

topepo left a comment

Choose a reason for hiding this comment

Uh oh!

hfrick commented Jun 15, 2026

Uh oh!

edgararuiz commented Jun 15, 2026

Uh oh!

hfrick left a comment

Choose a reason for hiding this comment

Uh oh!

topepo commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants