What is the purpose of weight quantization and activation quantization? #3176

CoinCheung · 2023-04-10T14:29:39Z

CoinCheung
Apr 10, 2023

Hi,

I did not find enough specification of weight/activation quantization. I can thought of two usage of this: the first one is to speed up training with 8 bit or 4 bit computation, in which process we need to quantize the weights or activation. The second usage is the so-called quantized-aware-training(QAT), in which we want the model to be adapted to the precision of quantization during training process. Would you tell me which one is the purpose of weight/activation quantization in deepspeed?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

What is the purpose of weight quantization and activation quantization? #3176

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

What is the purpose of weight quantization and activation quantization? #3176

Uh oh!

CoinCheung Apr 10, 2023

Replies: 0 comments

CoinCheung
Apr 10, 2023