Add diarization recipe v3 #347

xx205 · 2024-08-11T15:55:50Z

Add diarization recipe v3 for voxconverse dataset.

Highlights

update silero-vad to v5.1 from v3.1
new diarization method umap+hdbscan

Results

Dev set

system	MISS	FA	SC	DER
This repo (with oracle SAD)	2.3	0.0	1.3	3.6
This repo (with system SAD)	3.4	0.6	1.4	5.4
DIHARD 2019 baseline ¹	11.1	1.4	11.3	23.8
DIHARD 2019 baseline w/ SE ¹	9.3	1.3	9.7	20.2
(SyncNet ASD only) ¹	2.2	4.1	4.0	10.4
(AVSE ASD only) ¹	2.0	5.9	4.6	12.4
(proposed) ¹	2.4	2.3	3.0	7.7

Test set

system MISS FA SC DER

This repo (with oracle SAD) 1.6 0.0 1.9 3.5

This repo (with system SAD) 3.8 1.7 1.8 7.4

Spot the conversation: speaker diarisation in the wild, https://arxiv.org/pdf/2007.01216.pdf ↩ ↩² ↩³ ↩⁴ ↩⁵

…init

czy97 · 2024-08-19T09:17:10Z

The news part should be updated

czy97 · 2024-08-19T11:09:38Z

I think it is better to link the local directory and path.sh file directly if we reuse them.

Update News section in README.md

Update clustering method

czy97

Well Done!

czy97 · 2024-08-19T09:07:01Z

examples/voxconverse/v3/README.md

+  * Refer to [voxceleb sv recipe](https://github.com/wenet-e2e/wespeaker/tree/master/examples/voxceleb/v2)
+  * [pretrained model path](https://wespeaker-1256283475.cos.ap-shanghai.myqcloud.com/models/voxceleb/voxceleb_resnet34_LM.onnx)
+* Speaker activity detection model: oracle SAD (from ground truth annotation) or system SAD (VAD model pretrained by silero, https://github.com/snakers4/silero-vad)
+* Clustering method: spectral clustering


The clustering method should be umap + dbscan?

czy97 · 2024-08-19T11:03:26Z

wespeaker/cli/speaker.py

@@ -29,7 +29,7 @@
 from wespeaker.cli.utils import get_args
 from wespeaker.models.speaker_model import get_speaker_model
 from wespeaker.utils.checkpoint import load_checkpoint
-from wespeaker.diar.spectral_clusterer import cluster
+from wespeaker.diar.umap_clusterer import cluster


@JiJiJiang I am not sure whether we should change the client script.

Yes, just keep it as the better one.

czy97 · 2024-08-19T11:06:37Z

wespeaker/diar/make_system_sad.py


 import torch
+import silero_vad
 from wespeaker.utils.file_utils import read_scp


 def get_args():
    parser = argparse.ArgumentParser(description='')


should we also edit the v1 and v2 version, if we change the arguments of this script?

Yes, also update the results if change into silero vad v5.1.

* Add diarization recipe v3 * resolve pylint issues and add missing modifications * eliminate trailing whitespace * deterministic clustering; update README.md * fix args usage in umap_clusterer.py * local import; remove unused diarization args; self.model.eval() when init * compact embedding clustering procedure into a single source file * link to local and path.sh; update requirements.txt and extract_emb.py * fix lint error: extract_emb.py * Update README.md Update News section in README.md * Update voxconverse/v3/README.md Update clustering method * Update README.md --------- Co-authored-by: Zhengyang Chen <chenzhengyang117@gmail.com>

xx205 added 4 commits August 11, 2024 15:51

Add diarization recipe v3

46707ab

resolve pylint issues and add missing modifications

fdcf72a

eliminate trailing whitespace

700dfe0

deterministic clustering; update README.md

77c340d

xx205 requested review from cdliang11 and JiJiJiang August 12, 2024 02:44

xx205 added 3 commits August 12, 2024 04:49

fix args usage in umap_clusterer.py

7636e32

local import; remove unused diarization args; self.model.eval() when …

2731690

…init

compact embedding clustering procedure into a single source file

4ac134d

xx205 requested review from wsstriving and czy97 August 12, 2024 16:27

xx205 and others added 6 commits August 19, 2024 16:20

link to local and path.sh; update requirements.txt and extract_emb.py

f894bb2

Merge branch 'master' into voxconverse_v3

5f9e416

fix lint error: extract_emb.py

69d2134

Update README.md

03c0e48

Update News section in README.md

Update voxconverse/v3/README.md

a33c1ce

Update clustering method

Update README.md

78e52f8

czy97 approved these changes Aug 20, 2024

View reviewed changes

czy97 merged commit 5ac089e into wenet-e2e:master Aug 20, 2024
4 checks passed

xx205 deleted the voxconverse_v3 branch August 20, 2024 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add diarization recipe v3 #347

Add diarization recipe v3 #347

Uh oh!

xx205 commented Aug 11, 2024 •

edited

Loading

Uh oh!

czy97 commented Aug 19, 2024

Uh oh!

czy97 commented Aug 19, 2024

Uh oh!

czy97 left a comment

Uh oh!

czy97 Aug 19, 2024

Uh oh!

czy97 Aug 19, 2024

Uh oh!

JiJiJiang Aug 20, 2024

Uh oh!

czy97 Aug 19, 2024

Uh oh!

JiJiJiang Aug 20, 2024

Uh oh!

Uh oh!

Uh oh!

Add diarization recipe v3 #347

Add diarization recipe v3 #347

Uh oh!

Conversation

xx205 commented Aug 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Highlights

Results

Footnotes

Uh oh!

czy97 commented Aug 19, 2024

Uh oh!

czy97 commented Aug 19, 2024

Uh oh!

czy97 left a comment

Choose a reason for hiding this comment

Uh oh!

czy97 Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

czy97 Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

JiJiJiang Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

czy97 Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

JiJiJiang Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

xx205 commented Aug 11, 2024 •

edited

Loading