blog · field notes

Notes from the lab

Long-form benchmarks, buying guides, and write-ups on local AI hardware. Updated weekly.

Best GPUs for Running AI Models Locally in 2026: Ranked by tok/s per Dollar

Benchmarks show 7 GPUs from $749 to $9,499 on Llama 8B Q4 with llama.cpp. The RTX 3090 at $749 used delivers the best value. The RTX 5090 at $1,999 is the best overall. Here is every data point.

GPU Hunter · 2026-04-30T10:00:00.000Z21 min

[hero · placeholder]

tok/sec on 235B params

M3 Ultra · 512GB

budget-gpu#002

Best Budget GPU for AI Under $1,000 in 2026: Every Option Ranked

We ranked every GPU under $1,000 for local AI inference. The used RTX 3090 at $749 wins on VRAM. The RTX 5070 Ti at $749 wins on tok/s. Here is the full breakdown with benchmarks.

GPU Hunter · 2026-04-25T10:00:00.000Z

28 min

amd#003

AMD vs NVIDIA for Local AI Inference in 2026: ROCm Has Finally Caught Up

ROCm 7.2 changed the game. The AMD RX 7900 XTX with 24GB at $849 now runs Ollama, llama.cpp, and vLLM out of the box. We compare the full AMD vs NVIDIA stack for local inference — hardware, software, and real-world experience.

GPU Hunter · 2026-04-20T10:00:00.000Z

23 min

rtx-pro-6000#004

RTX PRO 6000 Blackwell vs H100: Which One for Your Home Lab? (2026)

96GB at $8.5k vs 80GB at $30k. Benchmarks show the RTX PRO 6000 at 141 tok/s on Llama 8B Q4 vs the H100 at ~120 tok/s. The PRO 6000 wins on value. The H100 wins on throughput. Here is every benchmark.

GPU Hunter · 2026-04-14T10:00:00.000Z

20 min

rtx-3090#005

The 2026 Used RTX 3090 Buyer's Guide: Mining Cards, OEM Pulls & What to Avoid

The RTX 3090 remains the best $/VRAM GPU for local AI in 2026. 24GB for under $800. Here is exactly what to look for, what to avoid, and where to buy.

GPU Hunter · 2026-04-08T10:00:00.000Z

22 min

weekly digest

Get the index
delivered Mondays.

New benchmarks, price drops, and one well-tested buying recommendation. No spam.