The Experts below are selected from a list of 137178 Experts worldwide ranked by ideXlab platform

Alexander Waibel - One of the best experts on this subject based on the ideXlab platform.

Julian Kratt - One of the best experts on this subject based on the ideXlab platform.

Florian Metze - One of the best experts on this subject based on the ideXlab platform.

Rainer Stiefelhagen - One of the best experts on this subject based on the ideXlab platform.

Gerasimos Potamianos - One of the best experts on this subject based on the ideXlab platform.

  • Speech Recognition, Audio-Visual
    Encyclopedia of Language & Linguistics, 2006
    Co-Authors: Gerasimos Potamianos
    Abstract:

    Audio-visual Speech Recognition refers to the automatic transcription of Speech into text by exploiting information present in the video of the speaker's mouth region, in addition to the traditionally used acoustic signal. The use of visual information in automatic Speech Recognition is also known as automatic Speechreading or lipreading, and has been motivated by the bimodality of human Speech production and perception, coupled with the fact that audio-only Speech Recognition is not robust in noisy acoustic environments. Audio-visual Speech Recognition systems significantly outperform their audio-only counterparts, especially under ideal visual and noisy audio conditions. Incorporating visual information into Speech Recognition requires two new components: the visual front end, which detects the speaker's mouth area and extracts informative visual Speech features from it, and the integration of the visual features into the Speech Recognition process. The most commonly adopted designs of these components are discussed here.