All Tags

#inference

3 posts tagged with "inference"

When to Use FP8 for Inference

H100's FP8 gives near-FP16 quality at near-INT8 speed. It's becoming the new default. Here's when and how to use it.