mozilla-foundation/common_voice_17_0
Updated • 6.15k • 18
How to use xbilek25/whisper-medium-en-cv-4.2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="xbilek25/whisper-medium-en-cv-4.2") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("xbilek25/whisper-medium-en-cv-4.2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("xbilek25/whisper-medium-en-cv-4.2")This model is a fine-tuned version of openai/whisper-medium.en on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.2332 | 0.1667 | 2250 | 0.4139 | 12.7057 |
| 0.0826 | 1.1667 | 4500 | 0.4543 | 14.2596 |
| 0.0267 | 2.1667 | 6750 | 0.4961 | 14.5338 |
| 0.0066 | 3.1667 | 9000 | 0.5053 | 14.6252 |
| 0.0019 | 4.1667 | 11250 | 0.5349 | 13.9854 |
| 0.0011 | 5.1667 | 13500 | 0.5540 | 13.3455 |
Base model
openai/whisper-medium.en