fp8 scaled discussion

by Muawiz - opened 12 days ago

Discussion

Muawiz

12 days ago

Great work!

I wanted to enquire about how you do fp8 scaled conversion and if you know fp8 matmul as well.

thank you

silveroxides

Owner 12 days ago

Great work!

I wanted to enquire about how you do fp8 scaled conversion and if you know fp8 matmul as well.

thank you

I currently use my own WIP module. Not everything works though.
ofc I know about fp8 matmul.

silveroxides

Owner 12 days ago

oops forgot link https://github.com/silveroxides/convert_to_quant

Georgiy1108

5 days ago

Does it make sense to download the int8 version if I have an RTX 4090, and will I get any performance boost?

silveroxides

Owner 4 days ago

Does it make sense to download the int8 version if I have an RTX 4090, and will I get any performance boost?

No it makes no sense at all haha. You have a 4000 series card. You want fp8 and if you want to ensure fast generation speed, use the --fast fp8_matmul launch argument in ComfyUI

droma

4 days ago

Does it make sense to download the int8 version if I have an RTX 4090, and will I get any performance boost?

No it makes no sense at all haha. You have a 4000 series card. You want fp8 and if you want to ensure fast generation speed, use the --fast fp8_matmul launch argument in ComfyUI

there no fp8_matmul flag in comfyui

zootkitty

3 days ago

Does it make sense to download the int8 version if I have an RTX 4090, and will I get any performance boost?

No it makes no sense at all haha. You have a 4000 series card. You want fp8 and if you want to ensure fast generation speed, use the --fast fp8_matmul launch argument in ComfyUI

there no fp8_matmul flag in comfyui

it's --fast fp8_matrix_mult

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment