-
Notifications
You must be signed in to change notification settings - Fork 149
Add diarization recipe v3 #347
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
I think it is better to link the local directory and path.sh file directly if we reuse them. |
Update News section in README.md
Update clustering method
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well Done!
examples/voxconverse/v3/README.md
Outdated
* Refer to [voxceleb sv recipe](https://github.com/wenet-e2e/wespeaker/tree/master/examples/voxceleb/v2) | ||
* [pretrained model path](https://wespeaker-1256283475.cos.ap-shanghai.myqcloud.com/models/voxceleb/voxceleb_resnet34_LM.onnx) | ||
* Speaker activity detection model: oracle SAD (from ground truth annotation) or system SAD (VAD model pretrained by silero, https://github.com/snakers4/silero-vad) | ||
* Clustering method: spectral clustering |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The clustering method should be umap + dbscan?
@@ -29,7 +29,7 @@ | |||
from wespeaker.cli.utils import get_args | |||
from wespeaker.models.speaker_model import get_speaker_model | |||
from wespeaker.utils.checkpoint import load_checkpoint | |||
from wespeaker.diar.spectral_clusterer import cluster | |||
from wespeaker.diar.umap_clusterer import cluster |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JiJiJiang I am not sure whether we should change the client script.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, just keep it as the better one.
|
||
import torch | ||
import silero_vad | ||
from wespeaker.utils.file_utils import read_scp | ||
|
||
|
||
def get_args(): | ||
parser = argparse.ArgumentParser(description='') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we also edit the v1 and v2 version, if we change the arguments of this script?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, also update the results if change into silero vad v5.1.
* Add diarization recipe v3 * resolve pylint issues and add missing modifications * eliminate trailing whitespace * deterministic clustering; update README.md * fix args usage in umap_clusterer.py * local import; remove unused diarization args; self.model.eval() when init * compact embedding clustering procedure into a single source file * link to local and path.sh; update requirements.txt and extract_emb.py * fix lint error: extract_emb.py * Update README.md Update News section in README.md * Update voxconverse/v3/README.md Update clustering method * Update README.md --------- Co-authored-by: Zhengyang Chen <chenzhengyang117@gmail.com>
Add diarization recipe v3 for voxconverse dataset.
Highlights
Results
Dev set
Test set
Footnotes
Spot the conversation: speaker diarisation in the wild, https://arxiv.org/pdf/2007.01216.pdf ↩ ↩2 ↩3 ↩4 ↩5