torch transformers numpy pandas tokenizers sentencepiece rdkit altair<5 scikit-learn