File tree Expand file tree Collapse file tree 1 file changed +4
-4
lines changed
UTILS/dfextensions/groupby_regression/docs Expand file tree Collapse file tree 1 file changed +4
-4
lines changed Original file line number Diff line number Diff line change @@ -192,7 +192,7 @@ df_out, df_group = make_parallel_fit_v4(
192192
193193Based on benchmarks on Apple M2 Max (12 cores):
194194
195- ![ Throughput by Engine] ( benchmarks/bench_out/throughput_by_engine.png )
195+ ![ Throughput by Engine] ( ../ benchmarks/bench_out/throughput_by_engine.png)
196196
197197** Key insights:**
198198- v4 is ** 76-194× faster** than robust on typical scenarios
@@ -415,14 +415,14 @@ python benchmarks/plot_groupby_regression_optimized.py
415415
416416# ## Scaling Analysis
417417
418- 
418+ 
419419
420420** Key findings:**
421421- v2/ v3: ~ 2. 5k - 15k groups/ s (stable across scales)
422422- v4: ~ 0. 5M - 1. 8M groups/ s (scales excellently)
423423- v4 performance dip at 5k groups (investigation ongoing)
424424
425- 
425+ 
426426
427427** Parallel scaling:**
428428- v2/ v3: Plateau at ~ 8 - 16 cores (process/ thread overhead)
@@ -449,7 +449,7 @@ See your benchmark results for machine-specific scaling behavior.
449449
450450# ## Speedup Summary
451451
452- 
452+ 
453453
454454** Typical speedups (v4 vs robust):**
455455- Small groups (5 rows): 76 ×
You can’t perform that action at this time.
0 commit comments