Congress detail
Authors: Gonzalo Sad; Lucas D. Terissi; Juan C. Gómez.
Resumen: The performance of classical speech recognition techniques based on audio features is degraded in noisy environments. The inclusion of visual features related to mouth movements into the recognition process improves the performance of the system. This paper proposes an isolated word speech recognition system based on audio-visual features. The proposed system combines three classifiers based on audio, visual and audio-visual information, respectively. An audio-visual database composed by the utterances of the digits (in Spanish language) is employed to test the proposed system. The experimental results show a significant improvement on the recognition rates through a wide range of signal-to-noise ratios.
Meeting type: Conferencia.
Type of job: Artículo Completo.
Production: Isolated Spanish Digit Recognition based on Audio-Visual Features.
Scientific meeting: XIX Congreso Argentino de Ciencias de la Computación - CACIC 2013.
Meeting place: Mar del Plata.
Organizing Institution: Red de Universidades Nacionales con carreras en Informática (RedUNCI).
It's published?: Yes
Publication place: Mar del Plata
Meeting month: 10
Year: 2013.
Link: here