#hardware

4 posts tagged with "hardware"

May 24, 2025

Groq, Cerebras, and other custom silicon promise 10x speed. Here's how to evaluate them without getting burned.

May 14, 2025

H100 spot at $0.15/1M tokens. A100 on-demand at $0.40/1M. API at $1.00/1M. Here's the full comparison.

Apr 26, 2025

H100 costs 2x more than A100 but delivers 2x memory bandwidth. For decode-bound inference, that math matters.

Apr 23, 2025

GPUs dominate LLM inference. TPUs offer interesting economics. Here's how to think about the choice.