All Tags

#context

5 posts tagged with "context"

Why Tokens at Position 50K Get Ignored

Attention scores decay with distance. By position 50K, tokens may have near-zero influence. Positional encodings have practical limits, regardless of window size.

Getting 95% Quality at 12% Cost

Most queries don't need the full context. Selecting the right 12% often preserves 95% of quality at a fraction of the cost and latency.