Rethinking algorithm design and development in speech processing

authored by
Thilo Stadelmann, Yinghui Wang, Matthew Smithy, Ralph Ewerth, Bernd Freisleben
Abstract

Speech processing is typically based on a set of complex algorithms requiring many parameters to be specified. When parts of the speech processing chain do not behave as expected, trial and error is often the only way to investigate the reasons. In this paper, we present a research methodology to analyze unexpected algorithmic behavior by making (intermediate) results of the speech processing chain perceivable and intuitively comprehensible by humans. The workflow of the process is explicated using a real-world example leading to considerable improvements in speaker clustering. The described methodology is supported by a software toolbox available for download.

External Organisation(s)
Philipps-Universität Marburg
Type
Conference contribution
Pages
4476-4479
No. of pages
4
Publication date
07.10.2010
Publication status
Published
Peer reviewed
Yes
ASJC Scopus subject areas
Computer Vision and Pattern Recognition
Electronic version(s)
https://doi.org/10.1109/ICPR.2010.1087 (Access: Closed)