Help #30

ganjuzhizihuai · 2024-07-08T09:25:39Z

Hello, when I run the code, I print the parameter information of the quantization model. Why is the parameter type of the model still float32 after replacing the quantization layer?

Quantized Layer: layer3.2.conv1
Weight dtype: torch.float32
Weight range: -0.37524235248565674 to 0.42818304896354675
Quant scale: Parameter containing:

zhutmost · 2024-07-15T06:20:15Z

Sorry for the delayed response. My code will store the raw weights instead of the quantized weights. You can scale the saved floating-point weights with the saved s and then round them to get quantized ones.
Or you can modify the code to also save the quantized weights.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help #30

Help #30

ganjuzhizihuai commented Jul 8, 2024

zhutmost commented Jul 15, 2024

Help #30

Help #30

Comments

ganjuzhizihuai commented Jul 8, 2024

zhutmost commented Jul 15, 2024