Skip to content

Comments

Int8 + microscaling support for kv cache formats.#841

Open
copybara-service[bot] wants to merge 1 commit intodevfrom
test_874047973
Open

Int8 + microscaling support for kv cache formats.#841
copybara-service[bot] wants to merge 1 commit intodevfrom
test_874047973

Conversation

@copybara-service
Copy link

Int8 + microscaling support for kv cache formats.
Right now multiplication is done by converting to corresponding float format.
Can yield up to 2x improvements for membw constrained shapes

Right now multiplication is done by converting to corresponding float format.
Can yield up to 2x improvements for membw constrained shapes

PiperOrigin-RevId: 874047973
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants