Skip to content

docs(speed-bench): add M3 Ultra 60-core Q2 and Q4 benchmarks#324

Open
unsaltedbutter-ai wants to merge 1 commit into
antirez:mainfrom
unsaltedbutter-ai:bench/m3-ultra-60core
Open

docs(speed-bench): add M3 Ultra 60-core Q2 and Q4 benchmarks#324
unsaltedbutter-ai wants to merge 1 commit into
antirez:mainfrom
unsaltedbutter-ai:bench/m3-ultra-60core

Conversation

@unsaltedbutter-ai
Copy link
Copy Markdown
Contributor

ds4-bench sweeps (ctx 2048 to 65536, 128 gen tokens) on a Mac Studio M3 Ultra with the 60-core GPU and 256 GB RAM, captured at commit ba00a8a. Q4 prefill matches Q2 within about 1 percent across the range, and generation runs about 4 percent slower. Includes generated SVG plots.

ds4-bench sweeps (ctx 2048 to 65536, 128 gen tokens) on a Mac Studio
M3 Ultra with the 60-core GPU and 256 GB RAM, captured at commit ba00a8a.
Q4 prefill matches Q2 within about 1 percent across the range, and
generation runs about 4 percent slower. Includes generated SVG plots.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant