#economics

7 posts tagged with "economics"

Nov 5, 2025

The Real Cost: Fine-tuning vs Prompting

Prompting has high per-call cost but zero upfront investment. Fine-tuning has low per-call cost but significant upfront investment. The crossover point matters.

May 14, 2025

Cost Per Token Across Hardware Options

H100 spot at $0.15/1M tokens. A100 on-demand at $0.40/1M. API at $1.00/1M. Here's the full comparison.

May 7, 2025

Understanding Inference Platform Economics

GPU cost is just the beginning. Egress, logging, on-call—add 40% to your compute estimate for the real number.

Apr 30, 2025

What TPU Economics Look Like in Practice

$4/hour vs $10/hour sounds great. But conversion cost, ecosystem limitations, and operational overhead change the math.

Apr 12, 2025

The Math on Self-Hosting vs API

When does self-hosting break even? Here's the formula, the variables, and the 6-month reality check most teams skip.

Apr 9, 2025

When Self-Hosting Actually Saves Money

Everyone wants to self-host LLMs to save money. Most shouldn't. Here's the math on when it actually makes sense.

Jan 8, 2025

Why Output Tokens Cost 4x More Than Input

Input tokens are cheap. Output tokens are expensive. The physics of transformer inference explains why, and what you can do about it.