Embedded Segmental K-Means (ES-KMeans)

Overview

Unsupervised acoustic word segmentation and clustering using the embedded segmental K-means (ES-KMeans) algorithm. The algorithm is described in:

H. Kamper, K. Livescu, and S. J. Goldwater, "An embedded segmental K-means model for unsupervised segmentation and clustering of speech," in Proc. ASRU, 2017. [arXiv]

Please cite this paper if you use the code.

Installation

Dependencies can be installed in a conda environment:

conda env create -f environment.yml
conda activate eskmeans

Perform unit tests:

nosetests -v

Examples

A number of example notebooks are given in examples/. The examples/eskmeans_example.ipynb notebook provides a step-by-step example of the ES-KMeans algorithm. This notebook can also be opened directly in a Colab notebook.

Recipe

The code here only provides the main algorithm and some supporting utilities. A complete recipe where ES-KMeans is applied to the Buckeye English and NCHLT Xitsonga datasets is available here.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

The module utils/theta_oscillator.py is a derivation of Adriana Stan's Python implementation available at https://github.com/speech-utcluj/thetaOscillator-syllable-segmentation, which was released under the same license. Concrete changes to Adriana's code are listed in the documentation at the top of the module. I also list Adriana as a contributor below.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
eskmeans		eskmeans
examples		examples
utils		utils
.gitignore		.gitignore
license.md		license.md
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Embedded Segmental K-Means (ES-KMeans)

Overview

Installation

Examples

Recipe

License

Contributors

About

Uh oh!

Releases 2

Packages

Contributors 3

Uh oh!

Languages

License

kamperh/eskmeans

Folders and files

Latest commit

History

Repository files navigation

Embedded Segmental K-Means (ES-KMeans)

Overview

Installation

Examples

Recipe

License

Contributors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 3

Uh oh!

Languages

Packages