Is there a way to let it use an matmul in int8? Does it now use bf16 even if the model is in int8? Thank you so much.
· Sign up or log in to comment