torch transformers accelerate sentencepiece bitsandbytes auto-gptq