Skip to content

[Common/PyTorch] Grouped-quantize kernels for 1D and 2D FP8 block-scaling#3135

Open
denera wants to merge 5 commits into
NVIDIA:mainfrom
denera:common/fp8-block-scaling-grouped-quantize
Open

[Common/PyTorch] Grouped-quantize kernels for 1D and 2D FP8 block-scaling#3135
denera wants to merge 5 commits into
NVIDIA:mainfrom
denera:common/fp8-block-scaling-grouped-quantize

Move GEMM-swizzled scale helpers out of mxfp8 namespace

f76cf77
Select commit
Loading
Failed to load commit list.