Model Card for Model ID

This is a fine-tuned version of the OpenAI Whisper-small model for ASR in Russian on the Golos Farfield dataset.

Model Details

  • Base Model: openai/whisper-small
  • Fine-tuning dataset: Golos Farfield
  • Language: Russian
  • Training Regime: The model was fine-tuned for 600 steps with a learning rate of 1e-5.

How to Get Started with the Model

You can use this model directly with the Hugging Face pipeline API:

from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="feelmadrain/whisper-small-ru")
result = pipe("your_audio.wav")
print(result["text"])

Results

The model achieves a Word Error Rate (WER) of ~ 16.7 % on the validation data and ~ 20.1 % on the test data

{'test_loss': 0.2500078082084656, 'test_wer_ortho': 20.090670484371273, 'test_wer': 20.078740157480315, 'test_runtime': 606.858, 'test_samples_per_second': 3.156, 'test_steps_per_second': 0.789}

Downloads last month
3
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for feelmadrain/whisper-small-ru

Finetuned
(3207)
this model

Dataset used to train feelmadrain/whisper-small-ru