Using linguistic information to detect overlapping speech

Geiger, Jürgen T; Eyben, Florian; Evans, Nicholas; Schuller, Björn; Rigoll, Gerhard
INTERSPEECH 2013, 14th Annual Conference of the
International Speech Communication Association, August 25-29, 2013, Lyon, France

Overlapping speech is still a major cause of error in many speech processing applications, currently without any satisfactory solution. This paper considers the problem of detecting segments of overlapping speech within meeting recordings. Using an HMM-based framework recordings are segmented into
intervals containing non-speech, speech and overlapping speech. New to this contribution is the use of linguistic information, where spoken content is used to improve overlap detection. Using language models for speech and overlap, an overlap score is created for every spoken word and used as an additional fea-
ture within the HMM framework. Experiments conducted on the AMI corpus demonstrate the potential of the proposed linguistic features.

DOI
Type:
Conference
City:
Lyon
Date:
2013-08-25
Department:
Digital Security
Eurecom Ref:
4020
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in INTERSPEECH 2013, 14th Annual Conference of the
International Speech Communication Association, August 25-29, 2013, Lyon, France and is available at : http://dx.doi.org/10.21437/Interspeech.2013-195
See also:

PERMALINK : https://www.eurecom.fr/publication/4020