Conversation
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
for more information, see https://pre-commit.ci
yiliu30
left a comment
There was a problem hiding this comment.
Overall LGTM, left a few comments.
|
RTN:
Tuning:
|
Thanks for the data, add mmlu and mmlu pro please |
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
for more information, see https://pre-commit.ci
|
ut depends on #1525 |
|
I will create another PR to optimize the quant function for rtn/opt_rtn/tuning @wenhuach21 |
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Signed-off-by: Mengni Wang <mengni.wang@intel.com>
Description
Support block-wise fp8 quant
#959
Type of Change