via localmaxxing
Localmaxxing hosts community-submitted benchmarks for local LLM inference.
Users compare tok/s, TTFT, and hardware efficiency across GPUs, Apple Silicon, and CPU setups. The site tracks speed and helps identify optimal hardware configurations.