perf: Optimise map_rows performance for Object dtype#25702
perf: Optimise map_rows performance for Object dtype#25702alexander-beedie wants to merge 1 commit intopola-rs:mainfrom
map_rows performance for Object dtype#25702Conversation
e0ac212 to
18e87bf
Compare
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #25702 +/- ##
==========================================
- Coverage 80.55% 80.54% -0.01%
==========================================
Files 1756 1757 +1
Lines 241908 242143 +235
Branches 3040 3040
==========================================
+ Hits 194869 195036 +167
- Misses 46256 46325 +69
+ Partials 783 782 -1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
I literally just removed all the code that special-cases all this stuff because it all wasn't being updated and was buggy :( Can we not make the generic path faster by profiling/optimizing |
Not sure how I'm supposed to know that code I can't see used to be there 🤷
I'll take a look tomorrow. |
You're not, I'm not blaming you :) |
Ref: #25688.
Added a dedicated
Objectpath inmap_rowsthat gets us a 50-60% speedup (on the given test-case). I think I can see a few opportunities to get a smaller speed-up for scalar/primitive types too, but will leave that for a separate PR.Benchmark
Timings1 🚀
Footnotes
Tested using
make build-dist-releaseon an Apple Silicon M4 Max ↩