【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 #4008

Echo-Nie · 2025-03-15T10:00:26Z

PR types

Function optimization, Docs

PR changes

Docs, Others

Describe

本次修改主要包含：

对 examples/csmsc/ 文件夹下的 tts0、tts2、tts3；以及examples/csmsc/tts3_rhy/ 下的READEME.md文档和run.sh脚本均进行修改。其中，在修改过程中发现 tts3 下还有README_cn.md文档，也对其同时进行修改。
脚本优化： 为 run.sh 中的合成阶段添加 --stage 参数，根据对应的sh下文件的合成阶段进行stage添加
文档完善： 在 README.md 中补充 stage 参数说明，明确 vocoder 选择逻辑，优化文档措辞，如将0 or 1 or 2 or 3 ...改为0-4。

Issue链接：#3997

@luotao1 @zxcd

…EAMDE.md修改：补充 stage 参数说明，明确 vocoder 选择逻辑

…失内容（不在开源活动范畴内）

paddle-bot · 2025-03-15T10:00:32Z

Thanks for your contribution!

zxcd · 2025-03-18T06:58:36Z

建议：是否可以让开发者们参考 tss3 下的中文文档为项目的其他文件构建类似的中文文档呢？

如果您看到了文档缺失的部分，可以提出来，我会把它继续新增到快乐开源任务中。

examples/csmsc/tts0/run.sh

…update the READEM to be consistent with the script

zxcd · 2025-03-18T07:01:08Z

examples/csmsc/tts2/run.sh

 fi

 if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
-    # synthesize_e2e, vocoder is pwgan by default
-    CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
+    # synthesize_e2e, vocoder is pwgan by default stage 0, stage 1 will use hifigan as vocoder


same with above

zxcd · 2025-03-18T07:02:05Z

examples/csmsc/tts3/README_cn.md

 ```
+`--stage` 用于合成过程中控制声码器模型，可取值为 `0` 或 `1`，分别对应使用 `pwgan` 或 `hifigan` 模型作为声码器。


same with above. pls check all files.

also change this.

zxcd · 2025-03-19T07:00:13Z

examples/csmsc/tts0/README.md

 ```
+`--stage` controls the vocoder model during synthesis, which can be `0` or `1` or `2` or `3`, use `pwgan` or `multi band melgan` or `style melgan` or `hifigan`model as vocoder.


why don't use dict to present this message?
such as use stage 0-4 to select the vocoder to use {pwgan, multi band melgan, ....}
This kind of expression is a bit cumbersome now.

OK, I checked the README and sh files in the four folders under csmsc and believe there should be no issues.

Echo-Nie

@zxcd pls review

zxcd · 2025-03-24T09:09:28Z

examples/csmsc/tts3_rhy/README.md

 1. **source path**.
 2. preprocess the dataset.
 3. train the model.
 4. synthesize wavs.
    - synthesize waveform from `metadata.jsonl`.
+    - use stage `1,3,4` to select the vocoder to use {`multi band melgan`, `hifigan`, `wavernn`}


add the usage of synthesize.sh and synthesize_e2e.sh like other files.

zxcd · 2025-03-24T09:09:57Z

examples/csmsc/tts3_rhy/README.md

@@ -14,11 +14,13 @@ Remember in our repo, you should add `--rhy-with-duration` flag to obtain the rh
 Assume the path to the dataset is `~/datasets/BZNSYP`.
 Assume the path to the MFA result of CSMSC is `./baker_alignment_tone`.
 Run the command below to
+


extra space

also change ths file add stage information?

zxcd · 2025-03-24T09:11:58Z

examples/csmsc/tts3_rhy/run.sh

@@ -28,11 +28,12 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
 fi

 if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
-    # synthesize, vocoder is pwgan by default
-    CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
+    # synthesize, vocoder is pwgan by default stage 0, stage 1 will use hifigan as vocoder


zxcd · 2025-03-24T09:12:23Z

examples/csmsc/tts3/README_cn.md

 ```
+`--stage` 用于合成过程中控制声码器模型，可取值为 `0` 或 `1`，分别对应使用 `pwgan` 或 `hifigan` 模型作为声码器。


also change this.

Echo-Nie · 2025-04-03T13:19:22Z

This PR has too much content and is a bit messy, so close

Echo-Nie added 6 commits March 14, 2025 20:44

run.sh修改：为 synthesize 和 synthesize_e2e 添加 --stage 参数控制 vocoder 模型选择，R…

a8252c4

…EAMDE.md修改：补充 stage 参数说明，明确 vocoder 选择逻辑

添加run.sh中stage参数相关的注释

74d6eaf

补全合成系列中的脚本中参数缺失：csmsc/tts0

b893b8e

补全合成系列中的脚本中参数缺失：csmsc/tts2

e3c0a5a

补全合成系列中的脚本中参数缺失：csmsc/tts3；为了保证文档的一致性，在更新语英文文档的同时也补充其中的README_cn文档中的缺…

5d5de12

…失内容（不在开源活动范畴内）

补全合成系列中的脚本中参数缺失：csmsc/tts3_rhy

3e1ba86

paddle-bot bot added the contributor label Mar 15, 2025

mergify bot added Example README labels Mar 15, 2025

Echo-Nie mentioned this pull request Mar 15, 2025

PaddleSpeech 快乐开源活动 (2025 H1) #3997

Closed

Echo-Nie changed the title ~~PaddleSpeech 快乐开源活动【任务二：No.7-10】~~ 【Doc】补全合成系列中的脚本中参数缺失No.7-10 Mar 16, 2025

luotao1 assigned luotao1 and zxcd Mar 17, 2025

luotao1 changed the title ~~【Doc】补全合成系列中的脚本中参数缺失No.7-10~~ 【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 Mar 17, 2025

luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Mar 17, 2025

zxcd reviewed Mar 18, 2025

View reviewed changes

examples/csmsc/tts0/run.sh Outdated Show resolved Hide resolved

Echo-Nie and others added 5 commits March 18, 2025 21:24

Merge branch 'PaddlePaddle:develop' into csmscUpdate

b2bae2f

update the vocoder of synthesize and synthesize_e2e to 4 stages, and …

d8f1d03

…update the READEM to be consistent with the script

update csmsc/tts2

bb8ea1a

update csmsc/tts3

39cead0

update csmsc/tts3_rhy

6077fa5

zxcd reviewed Mar 19, 2025

View reviewed changes

update examples/csmsc/README.md and examples/csmsc/{tts0,tts2,tts3}

e46a165

Echo-Nie commented Mar 21, 2025

View reviewed changes

zxcd reviewed Mar 24, 2025

View reviewed changes

fix the errors 2025/3/24

bef985c

Echo-Nie closed this Apr 3, 2025

Echo-Nie deleted the csmscUpdate branch April 9, 2025 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 #4008

【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 #4008

Uh oh!

Echo-Nie commented Mar 15, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Mar 15, 2025

Uh oh!

zxcd commented Mar 18, 2025

Uh oh!

Uh oh!

zxcd Mar 18, 2025

Uh oh!

zxcd Mar 18, 2025

Uh oh!

zxcd Mar 24, 2025

Uh oh!

zxcd Mar 19, 2025 •

edited

Loading

Uh oh!

Echo-Nie Mar 19, 2025

Uh oh!

Echo-Nie left a comment

Uh oh!

zxcd Mar 24, 2025

Uh oh!

zxcd Mar 24, 2025

Uh oh!

zxcd Mar 31, 2025

Uh oh!

zxcd Mar 24, 2025

Uh oh!

zxcd Mar 24, 2025

Uh oh!

Echo-Nie commented Apr 3, 2025

Uh oh!

Uh oh!

		```
		`--stage` 用于合成过程中控制声码器模型，可取值为 `0` 或 `1`，分别对应使用 `pwgan` 或 `hifigan` 模型作为声码器。

		```
		`--stage` controls the vocoder model during synthesis, which can be `0` or `1` or `2` or `3`, use `pwgan` or `multi band melgan` or `style melgan` or `hifigan`model as vocoder.

【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 #4008

【PaddleSpeech No.7-10】补全合成系列中的脚本中参数缺失 #4008

Uh oh!

Conversation

Echo-Nie commented Mar 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot bot commented Mar 15, 2025

Uh oh!

zxcd commented Mar 18, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zxcd Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Echo-Nie left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Echo-Nie commented Apr 3, 2025

Uh oh!

Uh oh!

Echo-Nie commented Mar 15, 2025 •

edited

Loading

zxcd Mar 19, 2025 •

edited

Loading