Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Poster I

VALID: A New Practical Audio-Visual Database, and Comparative Results

Niall A. FoxContact Information, Brian A. O’MullaneContact Information and Richard B. ReillyContact Information

(1)  Dept. of Electronic and Electrical Engineering, University College Dublin, Belfield, Dublin 4, Ireland
Abstract
The performance of deployed audio, face, and multi-modal person recognition systems in non-controlled scenarios, is typically lower than systems developed in highly controlled environments. With the aim to facilitate the development of robust audio, face, and multi-modal person recognition systems, the new large and realistic multi-modal (audio-visual) VALID database was acquired in a noisy “real world” office scenario with no control on illumination or acoustic noise. In this paper we describe the acquisition and content of the VALID database, consisting of five recording sessions of 106 subjects over a period of one month. Speaker identification experiments using visual speech features extracted from the mouth region are reported. The performance based on the uncontrolled VALID database is compared with that of the controlled XM2VTS database. The best VALID and XM2VTS based accuracies are 63.21% and 97.17% respectively. This highlights the degrading effect of an uncontrolled illumination environment and the importance of this database for deploying real world applications. The VALID database is available to the academic community through http://ee.ucd.ie/validdb/.

Contact Information Niall A. Fox
Email: niall.fox@ee.ucd.ie
URL: http://wwwdsp.ucd.ie/

Contact Information Brian A. O’Mullane
Email: brian.omullane@ee.ucd.ie
URL: http://wwwdsp.ucd.ie/

Contact Information Richard B. Reilly
Email: richard.reilly@ucd.ie
URL: http://wwwdsp.ucd.ie/
Fulltext Preview (Small, Large)
Image of the first page of the fulltext


Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.114 • Server: mpweb19
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)