Skip to content

MontrealCorpusTools/PolyglotDB

Repository files navigation

PolyglotDB

Build Status Coverage Status Documentation Status PyPI version

PolyglotDB is a Python package for storing, phonetically analyzing, and querying speech corpora. It can be used with corpora of any size, and is built to scale to very large corpora. It constructs a "polyglot" NoSQL database mirroring the structure of phonetic data, and has a consistent Python API for interacting with the underlying databases. The online documentation, which contains tutorials and case studies, is available at http://polyglotdb.readthedocs.io/en/latest/.

Citation

McAuliffe, Michael, Elias Stengel-Eskin, Michaela Socolof, and Morgan Sonderegger (2017). Polyglot and Speech Corpus Tools: a system for representing, integrating, and querying speech corpora. In Proceedings of Interspeech 2017, pp. 3887–3891. https://doi.org/10.21437/Interspeech.2017-1390.

About

PolyglotDB is a package for phonetic corpus storage and analysis

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 12