ASR Models
Collection
Automatic Speech Recognition models • 2 items • Updated
This model is a finetuned version of Facebook's Wav2Vec2 XLS-R 300M for Galician on the datasets Common Voice Corpus 17.0, Open SLR77, FalAI, Fleurs and Nos_ParlaSpeech-GL.
This model has been tested in the test splits of the Galician OpenSLR dataset, the Galician Common Voice 17.0 dataset, the FalAI dataset, the Galician FLEURS dataset and Nos_Parlaspeech-GL. The results are shown in the following tables:
| Corpus | WER | CER | RTF |
|---|---|---|---|
| Common Voice 17.0 | 7.85 | 1.66 | 0.0085 |
| Open SLR77 | 12.04 | 3.82 | 0.0087 |
| FalAI | 4.39 | 1.17 | 0.0260 |
| FLEURS | 15.83 | 5.08 | 0.0091 |
| Nos_Parlaspeech-GL | 8.92 | 2.65 | 0.0114 |
If you use this model, please cite as follows:
Moscoso Sánchez, Antonio; Magariños, Carmen; Castedo, Carla. 2025. Nos_ASR-wav2vec2-xls-r-300m-gl. URL: https://huggingface.co/proxectonos/Nos_ASR-wav2vec2-xls-r-300m-gl
Base model
facebook/wav2vec2-xls-r-300m