perf: welford's algorithm for mean-var aggregation by ilan-gold · Pull Request #4147 · scverse/scanpy

ilan-gold · 2026-06-08T12:14:02Z

From discussions with @zboldyga.

This should then in theory be reused with #4143 instead of its custom moments calculation

Closes #
Tests included or not required because:

Release notes not necessary because:

ilan-gold · 2026-06-08T12:16:20Z

                out[cat, col] += data.data[j]


+@njit


We should make these nogil or provide an option for fau to provide nogil njit

codecov · 2026-06-08T12:22:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.65%. Comparing base (62d4c97) to head (ff0ac25).
⚠️ Report is 1 commits behind head on main.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4147      +/-   ##
==========================================
- Coverage   79.68%   79.65%   -0.04%     
==========================================
  Files         120      120              
  Lines       12801    12795       -6     
==========================================
- Hits        10201    10192       -9     
- Misses       2600     2603       +3

Flag	Coverage Δ
hatch-test.low-vers	`78.90% <100.00%> (-0.04%)`	⬇️
hatch-test.pre	`79.52% <100.00%> (-0.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/scanpy/get/_aggregated.py	`92.66% <100.00%> (-0.65%)`	⬇️
src/scanpy/get/_kernels.py	`100.00% <ø> (ø)`

... and 1 file with indirect coverage changes

zboldyga

@ilan-gold I reviewed since this plays into our other work

I don't see any issues with the welfords implementation, lgtm!

and overall I'm new with njit, OMP, TBB. But I see your point about nogil, and was able to create some scenarios that trigger that path. So agreed on the suggestion... I also found that it does also slightly improve the chan njit work when fau falls back on the serial build, and I guess it would generally fix any of these types of things across scanpy, so maybe that does belong in fau?

There's probably some threading stuff I'm not fully grasping with scanpy, fau, those libraries yet though; I will be more confident in that in a few weeks. I figure I will revisit threading and numba as a whole as part of building a good understanding of these libraries, and if I find any thread issues throughout scanpy/fau I will raise them at that point.

ilan-gold · 2026-06-12T09:07:39Z

so maybe that does belong in fau?

Yes the issue with nogil is that it means your code is no longer threadsafe with respect to its inputs (in a certain mental model of things, I guess). So introducing this change needs some though - I don't think anything in FAU actually alters its inputs though so we should be good.

There's probably some threading stuff I'm not fully grasping with scanpy, fau, those libraries yet though; I will be more confident in that in a few weeks. I figure I will revisit threading and numba as a whole as part of building a good understanding of these libraries, and if I find any thread issues throughout scanpy/fau I will raise them at that point.

That would be amazing because it is something we have (clearly) struggled with

flying-sheep

nice! some questions.

scverse-benchmark · 2026-06-18T12:21:07Z

Benchmark changes

Change	Before [`39f1241`]	After [`25e6bfc`]	Ratio	Benchmark (Parameter)
+	131±1ms	182±2ms	1.39	preprocessing_counts.Agg.time_agg('var', False)
+	49.0±2ms	70.2±4ms	1.43	preprocessing_counts.Agg.time_agg('var', True)

Comparison: https://github.com/scverse/scanpy/compare/39f12414fea9cea9439a3d9f665d1e17636092a9..25e6bfcdbb14be0ca79a555e0939fe81dfe34bdb
Last changed: Mon, 22 Jun 2026 12:58:40 +0000

More details: https://github.com/scverse/scanpy/pull/4147/checks?check_run_id=82708559802

flying-sheep

Was bd85e03 supposed to be a response to the benchmark results? because it didn’t change them.

Looks great, but maybe the test can be made slightly more clear.

ilan-gold · 2026-06-22T10:14:14Z

Was bd85e03 supposed to be a response to the benchmark results? because it didn’t change them.

It was certainly an attempt! The number did go down a bit, but not much

…n-var aggregation) (#4176) Co-authored-by: Ilan Gold <ilanbassgold@gmail.com>

perf: welford's algorithm for mean-var

21f5ddc

ilan-gold added this to the 1.12.2 milestone Jun 8, 2026

ilan-gold changed the title ~~perf: welford's algorithm for mean-var~~ perf: welford's algorithm for mean-var aggregation Jun 8, 2026

chore: relnote

514bd17

ilan-gold commented Jun 8, 2026

View reviewed changes

ilan-gold requested a review from flying-sheep June 8, 2026 12:24

Merge branch 'main' into ig/welford

48230af

zboldyga reviewed Jun 12, 2026

View reviewed changes

ilan-gold added 2 commits June 12, 2026 11:07

Merge branch 'main' into ig/welford

afc24a1

Merge branch 'main' into ig/welford

35b0ff6

flying-sheep reviewed Jun 17, 2026

View reviewed changes

Comment thread docs/release-notes/4147.perf.md

Comment thread src/scanpy/get/_aggregated.py Outdated

Comment thread src/scanpy/get/_kernels.py

ilan-gold added 5 commits June 17, 2026 14:33

chore: add cancelling test

05daadb

chore: finish sentence

e437714

chore: relnote

ec72679

chore: add context

9dcfcc7

Merge branch 'main' into ig/welford

3fa0884

ilan-gold requested a review from flying-sheep June 17, 2026 13:23

ilan-gold added the benchmark label Jun 18, 2026

perf: less memory touches

bd85e03

flying-sheep approved these changes Jun 19, 2026

View reviewed changes

Comment thread tests/test_aggregated.py Outdated

Comment thread tests/test_aggregated.py Outdated

Comment thread tests/test_aggregated.py

refactor: cleanup

497809e

ilan-gold removed the benchmark label Jun 22, 2026

ilan-gold enabled auto-merge (squash) June 22, 2026 11:35

chore: csc benchmarks

25e6bfc

ilan-gold added benchmark and removed benchmark labels Jun 22, 2026

ilan-gold added 6 commits June 23, 2026 12:05

Merge branch 'main' into ig/welford

1a19312

Merge branch 'main' into ig/welford

81ae72b

fix: tests

9004cc0

Merge branch 'ig/welford' of github.com:scverse/scanpy into ig/welford

cc5ac95

Merge branch 'main' into ig/welford

eb03735

chore: spelling

ff0ac25

ilan-gold merged commit 6afdc6e into main Jun 23, 2026
14 checks passed

ilan-gold deleted the ig/welford branch June 23, 2026 13:08

meeseeksmachine mentioned this pull request Jun 23, 2026

Backport PR #4147 on branch 1.12.x (perf: welford's algorithm for mean-var aggregation) #4176

Merged

ilan-gold added a commit that referenced this pull request Jun 23, 2026

Backport PR #4147 on branch 1.12.x (perf: welford's algorithm for mea…

d364f06

…n-var aggregation) (#4176) Co-authored-by: Ilan Gold <ilanbassgold@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: welford's algorithm for mean-var aggregation#4147

perf: welford's algorithm for mean-var aggregation#4147
ilan-gold merged 19 commits into
mainfrom
ig/welford

ilan-gold commented Jun 8, 2026

Uh oh!

ilan-gold Jun 8, 2026

Uh oh!

codecov Bot commented Jun 8, 2026 •

edited

Loading

Uh oh!

zboldyga left a comment •

edited

Loading

Uh oh!

ilan-gold commented Jun 12, 2026

Uh oh!

flying-sheep left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

scverse-benchmark Bot commented Jun 18, 2026 •

edited

Loading

Uh oh!

flying-sheep left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilan-gold commented Jun 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ilan-gold commented Jun 8, 2026

Uh oh!

ilan-gold Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zboldyga left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilan-gold commented Jun 12, 2026

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

scverse-benchmark Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark changes

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilan-gold commented Jun 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Jun 8, 2026 •

edited

Loading

zboldyga left a comment •

edited

Loading

scverse-benchmark Bot commented Jun 18, 2026 •

edited

Loading