DTU54DL/common-accent
Viewer • Updated • 10.5k • 213 • 10
How to use Scav6411/speecht5_tts_voxpopuli_nl with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-audio", model="Scav6411/speecht5_tts_voxpopuli_nl") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("Scav6411/speecht5_tts_voxpopuli_nl")
model = AutoModelForTextToSpectrogram.from_pretrained("Scav6411/speecht5_tts_voxpopuli_nl")This model is a fine-tuned version of microsoft/speecht5_tts on the DTU54DL/common-accent dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5484 | 5.6180 | 1000 | 0.5067 |
| 0.5304 | 11.2360 | 2000 | 0.4926 |
| 0.5346 | 16.8539 | 3000 | 0.4876 |
| 0.5235 | 22.4719 | 4000 | 0.4884 |
Base model
microsoft/speecht5_tts