[Hackathon 7th] 修复 `panns` 中 `predict.py` 对于 pir 的 json 模型路径 #3914

megemini · 2024-11-26T14:20:50Z

PR types

Bug fixes

PR changes

Others

Describe

修复 panns 中 predict.py 对于 pir 的 json 模型路径

这里修复了两个问题：

没有导入 paddle （这个文件当时运行过吗？？？）
修复对于 pir 的 json 模型路径问题

注意：PaddleSpeech/examples/esc50 测试

$ CUDA_VISIBLE_DEVICES=0 ./run.sh 4 cpu ./export ~/datasets/5-9032-A-0.wav

过程中发现此问题，但是，修改了这里的两个问题之后，模型仍然不能进行推理，仍需定位问题～

@zxcd @Liyulingyue

paddle-bot · 2024-11-26T14:20:59Z

Thanks for your contribution!

megemini · 2024-11-27T12:14:34Z

Update 20241127

模型推理的问题已经解决，paddlespeech/cls/exps/panns/deploy/predict.py 中的 feat 的 dimension 问题，以及不能使用 mkldnn ～

zxcd · 2024-11-28T07:38:42Z

paddlespeech/cls/exps/panns/deploy/predict.py

@@ -74,12 +75,18 @@ def __init__(self,
        self.batch_size = batch_size

        model_file = os.path.join(model_dir, "inference.pdmodel")
+        if not os.path.exists(model_file):


建议直接判断是否有.json文件，如果有model_file=.json，没有model_file=.pdmodel，报错统一由85行来

zxcd

LGTM

zxcd · 2024-12-03T09:21:42Z

paddlespeech/cls/exps/panns/deploy/predict.py

@@ -73,13 +74,18 @@ def __init__(self,
                 enable_mkldnn=False):
        self.batch_size = batch_size

-        model_file = os.path.join(model_dir, "inference.pdmodel")
+        if os.path.exists(os.path.join(model_dir, "inference.json")):


same comment with #3923

…into fix_ex_esc50

megemini · 2024-12-03T13:47:04Z

@zxcd 关于 PR 里面去掉 unsqueeze 的问题，测了一下，2.6.2/2.5.1版本也不需要 unsqueeze

另外，这里貌似也不需要判断 feat.dim() == 1，比如下面这样：

        feat = paddle.transpose(feat, perm=[1, 0])
        if feat.dim() == 1:
            feat = feat.unsqueeze(0)

如果 feat 可以 transpose 的话，那么 dim 不可能为 1，所以加这个判断也没啥用 ... ...

zxcd · 2024-12-04T03:13:43Z

paddlespeech/cls/exps/panns/deploy/predict.py

@@ -55,8 +58,7 @@ def extract_features(files: str, **kwargs):

        feature_extractor = LogMelSpectrogram(sr, **kwargs)
        feat = feature_extractor(paddle.to_tensor(waveforms[i]))
-        feat = paddle.transpose(feat, perm=[1, 0]).unsqueeze(0)
-
+        feat = paddle.transpose(feat, perm=[1, 0])


single audio still need unsqueeze, suggest adding an if judgment.

这里应该不需要 unsqueeze ~ 后面代码用的是 np.stack ，而 np.stack 会自动在 axis=0 加一个维度，也就是咱们需要的 batch ～如果这里加了 unsqueeze ，那后面就需要用 np.vstack ，不然会多出来一个维度～

>>> import numpy as np >>> a = np.random.rand(2, 3) >>> b = [a] >>> np.stack(b, axis=0) array([[[0.67850623, 0.57210335, 0.21218978], [0.10639948, 0.49831181, 0.45858706]]]) >>> np.stack(b, axis=0).shape (1, 2, 3) >>> b = [a, a] >>> np.stack(b, axis=0).shape (2, 2, 3) >>> np.vstack(b) array([[0.67850623, 0.57210335, 0.21218978], [0.10639948, 0.49831181, 0.45858706], [0.67850623, 0.57210335, 0.21218978], [0.10639948, 0.49831181, 0.45858706]]) >>> np.vstack(b).shape (4, 3)

zxcd

LGTM

[Fix] panns predict.py

1c540d8

paddle-bot bot added the contributor label Nov 26, 2024

megemini added 2 commits November 27, 2024 15:32

[Update] path exists

0fc74d9

[Fix] disable mkldnn and transpose dimension

29ec0a6

zxcd reviewed Nov 28, 2024

View reviewed changes

[Update] model_file check json first

2d96506

megemini requested a review from zxcd November 28, 2024 09:50

zxcd previously approved these changes Dec 2, 2024

View reviewed changes

zxcd reviewed Dec 3, 2024

View reviewed changes

[Update] satisty version

42b0572

megemini dismissed zxcd’s stale review via 42b0572 December 3, 2024 10:54

megemini requested a review from zxcd December 3, 2024 10:55

megemini added 4 commits December 3, 2024 20:34

[Update] satisty version

e23d758

[Update] satisty version

e8998b9

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech …

dd79cd6

…into fix_ex_esc50

[Update] config disable_mkldnn

8d2f176

zxcd reviewed Dec 4, 2024

View reviewed changes

megemini requested a review from zxcd December 4, 2024 05:24

[Update] unsqueeze

2279e6f

zxcd approved these changes Dec 5, 2024

View reviewed changes

zxcd merged commit f582cb6 into PaddlePaddle:develop Dec 5, 2024
5 checks passed

GreatV mentioned this pull request Mar 5, 2025

PaddleSpeech 1.5.0 Release Note #3996

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Hackathon 7th] 修复 `panns` 中 `predict.py` 对于 pir 的 json 模型路径 #3914

[Hackathon 7th] 修复 `panns` 中 `predict.py` 对于 pir 的 json 模型路径 #3914

Uh oh!

megemini commented Nov 26, 2024

Uh oh!

paddle-bot bot commented Nov 26, 2024

Uh oh!

megemini commented Nov 27, 2024

Uh oh!

zxcd Nov 28, 2024

Uh oh!

megemini Nov 28, 2024

Uh oh!

zxcd left a comment

Uh oh!

zxcd Dec 3, 2024

Uh oh!

megemini Dec 3, 2024

Uh oh!

megemini commented Dec 3, 2024

Uh oh!

zxcd Dec 4, 2024

Uh oh!

megemini Dec 4, 2024

Uh oh!

zxcd left a comment

Uh oh!

Uh oh!

Uh oh!

[Hackathon 7th] 修复 panns 中 predict.py 对于 pir 的 json 模型路径 #3914

[Hackathon 7th] 修复 panns 中 predict.py 对于 pir 的 json 模型路径 #3914

Uh oh!

Conversation

megemini commented Nov 26, 2024

PR types

PR changes

Describe

Uh oh!

paddle-bot bot commented Nov 26, 2024

Uh oh!

megemini commented Nov 27, 2024

Update 20241127

Uh oh!

zxcd Nov 28, 2024

Choose a reason for hiding this comment

Uh oh!

megemini Nov 28, 2024

Choose a reason for hiding this comment

Uh oh!

zxcd left a comment

Choose a reason for hiding this comment

Uh oh!

zxcd Dec 3, 2024

Choose a reason for hiding this comment

Uh oh!

megemini Dec 3, 2024

Choose a reason for hiding this comment

Uh oh!

megemini commented Dec 3, 2024

Uh oh!

zxcd Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

megemini Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

zxcd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[Hackathon 7th] 修复 `panns` 中 `predict.py` 对于 pir 的 json 模型路径 #3914

[Hackathon 7th] 修复 `panns` 中 `predict.py` 对于 pir 的 json 模型路径 #3914