Skip to content

【PaddleSpeech No.13、14】补全合成系列中的脚本中参数缺失 #4013

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Mar 31, 2025

Conversation

Echo-Nie
Copy link
Contributor

PR types
Function optimization, Docs

PR changes
Docs, Others

Describe
本次修改主要包含:

修改examples/vctk/tts3和examples/vctk/ernie_sat

脚本优化: 为 run.sh 中的合成阶段添加 --stage 参数,支持通过0/1选择 PWGAN/HiFiGAN 作为声码器;在 synthesize.sh 和 synthesize_e2e.sh 中实现多声码器切换逻辑

文档完善: 在 README.md 中补充 stage 参数说明,明确 vocoder 选择逻辑。

Issue链接:#3997

@luotao1 @zxcd

Copy link

paddle-bot bot commented Mar 16, 2025

Thanks for your contribution!

@luotao1 luotao1 changed the title 【Doc】补全合成系列中的脚本中参数缺失No.13-14 【PaddleSpeech No.13、14】补全合成系列中的脚本中参数缺失 Mar 17, 2025
@luotao1 luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Mar 17, 2025
@Echo-Nie
Copy link
Contributor Author

@zxcd pls review

@@ -27,10 +27,11 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi

if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
# synthesize, vocoder is hifigan by default stage 0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only stage 0 support does not need to be written.

fi

if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
# synthesize, vocoder is pwgan by default stage 0, stage 1 will use hifigan as vocoder
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

stage control task_name, not vocoder. Follow code examples/vctk/ernie_sat/local/synthesize_e2e.sh

```
`--stage` controls the vocoder model during synthesis, which can be `0` or `1`, use `pwgan` or `hifigan` model as vocoder.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also add this line in synthesize_e2e.sh in below

@@ -27,12 +27,12 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi

if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan by default
# synthesize, vocoder is pwgan by default stage 0, stage 1 will use hifigan as vocoder
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

miss --stage 0 ?

Copy link
Collaborator

@zxcd zxcd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zxcd zxcd merged commit 1f377a4 into PaddlePaddle:develop Mar 31, 2025
5 checks passed
@Echo-Nie Echo-Nie deleted the vctkUpdate branch April 9, 2025 15:46
@zxcd zxcd added this to the r1.5.0 milestone May 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants