Paper
19 January 2009 Face and lip tracking in unconstrained imagery for improved automatic speech recognition
Brandon Crow, Jane Xiaozheng Zhang
Author Affiliations +
Proceedings Volume 7257, Visual Communications and Image Processing 2009; 72571Y (2009) https://doi.org/10.1117/12.817092
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
When combined with acoustical speech information, visual speech information (lip movement) significantly improves Automatic Speech Recognition (ASR) in acoustically noisy environments. Previous research has demonstrated that visual modality is a viable tool for identifying speech. However, the visual information has yet to become utilized in mainstream ASR systems due to the difficulty in accurately tracking lips in real-world conditions. This paper presents our current progress in tracking face and lips in visually challenging environments. Findings suggest the mean shift algorithm performs poorly for small regions, in this case the lips, but it achieves near 80% accuracy for facial tracking.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Brandon Crow and Jane Xiaozheng Zhang "Face and lip tracking in unconstrained imagery for improved automatic speech recognition", Proc. SPIE 7257, Visual Communications and Image Processing 2009, 72571Y (19 January 2009); https://doi.org/10.1117/12.817092
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Laser induced plasma spectroscopy

Detection and tracking algorithms

Video

Facial recognition systems

Visualization

Information visualization

RGB color model

RELATED CONTENT

Automatic lip reading by using multimodal visual features
Proceedings of SPIE (February 03 2014)
Visual-language modal hybrid tracking algorithm
Proceedings of SPIE (December 01 2023)
Lip-reading enhancement for law enforcement
Proceedings of SPIE (September 28 2006)

Back to Top