New · Blackwell PRO 6000 addedv0.4.1
The GPU index
for local inference
Independent benchmarks across consumer cards, workstation Blackwell, Apple Silicon, and DGX Spark. Find the cheapest hardware that fits the model you actually want to run.
7
GPUs tracked
5
Models profiled
105
Benchmarks run
2h ago
Last update
01 // Editorial picks · April 2026
The shortlist
Best overall#01
RTX PRO 6000 Blackwell
Blackwell · NVIDIA
VRAM
96 GB
Tok/s
142
Bandwidth
1792 GB/s
Price
$8,499
96GB. Period.
Best consumer#02
GeForce RTX 5090
Blackwell · NVIDIA
VRAM
32 GB
Tok/s
138
Bandwidth
1792 GB/s
Price
$1,999
32GB at $2k
Best value#04
GeForce RTX 3090
Ampere · NVIDIA
VRAM
24 GB
Tok/s
64
Bandwidth
936 GB/s
Price
$749
24GB used <$800
Best for 200B+#06
Apple M3 Ultra
M3 Ultra · Apple
VRAM
512 GB
Tok/s
72
Bandwidth
819 GB/s
Price
$9,499
512GB unified
03 // Will it fit?
Pick a model.
See what runs it.
Hardware is wasted if it can't load the weights you care about. Start with the model — we'll tell you the cheapest GPU that fits.
Model
Quantization
Estimated VRAM required
78 GB
Compatible GPUs
4 / 7
04 // Field notes
From the lab
Benchmark2026-04-22
Running Qwen3 235B on a single Mac Studio
We pushed Apple's M3 Ultra with 512GB unified memory to its limits. Here's what 22 tok/s of dense inference actually feels like.
M. Chen12 min read
Comparison2026-04-14
RTX PRO 6000 Blackwell vs H100: which one for your home lab?
96GB at $8.5k vs. 80GB at $30k. We profiled both on Qwen3 72B Q8.
S. Kapoor9 min read
Buying Guide2026-04-08
The 2026 used RTX 3090 buyer's guide
Mining cards, OEM pulls, dual-fan vs blower — what to look for and what to avoid in today's market.
J. Voss14 min read