Visual Speech Recognition

bg-new
Author/s

Ioannis Pitas (AUTH)

About the resource/s

This lecture overviews Visual Speech Recognition that has many applications in Human-centered Computing, Image and Video Analysis and Social Media Analytics. It covers the following topics in detail: Visual Speech RecognitionVisemes and Phonemes, Face detection, Landmark Localization, Lip readingSpeech reading beyond the lipsAudio-Visual Speech Recognition. Deep Audio-Visual Speech RecognitionConvolutional Neural NetworksRecurrent Neural Networks. Overlapped speech. Speaker targeted AVSR models. Visual Speech Recognition for mobile devices. Visual Speech Recognition DataSetsExperiments on each data set.

Other Sources