asjc-classification/scibert_multilabel_asjc_classifier
Text Classification
•
0.1B
•
Updated
•
115
Natural Language Processing, Scientometrics
We present the first multi-label classification model built on the ASJC taxonomy that reliably assigns subject categories to individual documents—including those published in general-science or interdisciplinary journals—using Title, Container Title, and Abstract metadata.
Traditional ASJC classification approaches are limited by incomplete sources, journal-level labels, or single-label assignments. This project provides:
If you use this work, please cite:
@article{Gusenbauer.2025,
author = {Gusenbauer, Michael and Endermann, Jochen and Huber, Harald and Strasser, Simon and Granitzer, Andreas-Nizar and Ströhle, Thomas},
year = {2025},
title = {Fine-tuning SciBERT to enable ASJC-based assessments of the disciplinary orientation of research collections},
keywords = {All Science Journal Classification;Disciplinary coverage;Fine-tuning;multi-label classification;SciBERT;Transformer-based language models},
issn = {0138-9130},
journal = {Scientometrics},
doi = {10.1007/s11192-025-05490-0},
}