-
Notifications
You must be signed in to change notification settings - Fork 605
Closed
Description
I'm trying to fine-tune the LibriTTS checkpoint on ~1 hour of LJSpeech but get poor results. Could you please give me some directions or help to spot the issue?
How I fine-tuned:
- Pulled the latest changes from the repo
- Replaced
Data/train_list.txt
with a copy that only has the first 1000 lines (~1 hour for training) - Changed batch_size to 4 and max_len to 100, otherwise it doesn't fit into the memory of my 4090 (24GB).
- After training it for 50-100 epochs, I tested new checkpoints with both
Inference_LibriTTS.ipynb
andInference_LJSpeech.ipynb
notebooks by changing themultispeaker
parameter in the config to true/false. Inference_LJSpeech.ipynb
produces very noisy results with a poor pronunciation.Inference_LibriTTS.ipynb
with reference audio from LJSpeech has a good pronunciation, but there are noticeable noises (example - https://voca.ro/1nQ8Ltjhsh9y)
Thank you again for the awesome project!
Metadata
Metadata
Assignees
Labels
No labels