Keyword spotting for video soundtrack indexing

Gelin, Philippe; Wellekens, Christian J
ICASSP 1996, 21st IEEE international conference on acoustics, speech, and signal proceedings, May 7-10, 1996, Atlanta, USA

The amount of available video information is dramatically increasing due to the development of multimedia applications. As a consequence, content based retrieval tools are urgently needed for fast and easy access to multimedia database but also to movies and recorded video news. In particular, queries may rely on off-line indexing. Keyword spotting on video soundtracks could be of great help in this indexation process and in the future associated with pattern or event recognition out of the strictly visual information. Specific constraints for this application are identified and a solution based on phonemic lattices is proposed. The word spotter achieves indexing on open vocabularies uttered by any speaker. It is fast enough for practical applications and does not require much additional stored information.


DOI
Type:
Conférence
City:
Atlanta
Date:
1996-05-07
Department:
Sécurité numérique
Eurecom Ref:
442
Copyright:
© 1996 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/442