MIRACL - Publications : Human-Machine-Interaction-via-Visual-Speech-Spotting

Acceuil

Publications

Dernières mise à jour :

2022-05-12 12:45:10

Auteur princpal :

Auteur secondaire(s) :

- Achraf Ben-Hamadou

- Walid MAHDI

Titre :

Human Machine Interaction via Visual Speech Spotting

Conférence :

ACIVS 2015

Mois :

octobre

Année :

2015

Journal, revue, proceedings ... :

Pays :

Italie

Ville :

Catania

Type de publication :

Conférence intertnationale

Abstract :

In this paper, we propose an automatic visual speech spotting system adapted for RGB-D cameras and based on Hidden Markov

Models (HMMs). Our system is based on two main processing blocks,

namely, visual feature extraction and speech spotting and recognition.

In feature extraction step, the speaker’s face pose is estimated using a

3D face model including a rectangular 3D mouth patch used to precisely

extract the mouth region. Then, spatio-temporal features are computed

on the extracted mouth region. In the second step, the speech video is

segmented by finding the starting and the ending points of meaningful

utterances and recognized using Viterbi algorithm. The proposed system

is mainly evaluated on an extended version of the MIRACL-VC1 dataset.

Experimental results demonstrate that the proposed system can segment

and recognize key utterances with a recognition rates of 83 % and a reliability of 81.4 %.