All Tags

#precision

2 posts tagged with "precision"

When to Use FP8 for Inference

H100's FP8 gives near-FP16 quality at near-INT8 speed. It's becoming the new default. Here's when and how to use it.