Whispery voice is a type of voice quality in which the vocal folds do not vibrate and the airflow through the resonating cavity is modulated. Previous studies have shown that there is no fundamental frequency but has formant envelope in whispery voice. This is why the listener can recognize the voice quality. In this study, acoustic and spectral measures are extracted to investigate the formant pattern and voice quality of whispery voice. The results show that the differences in harmonic amplitudes (H4-H2K, H2K-H5K) are more effective in distinguishing whispered speech from modal voice than the differences in harmonic amplitudes at low frequencies (H1-H2, H2-H4). In addition, the values of Harmonic-to-Noise Ratio (HNR) and Cepstral Peak Prominence (CPP) of whispered voice were significantly lower than that of modal phonation, and the acoustic energy of whispery voice was also significantly reduced. As to formant frequency, the lower formant frequency of whispered vowels became higher compared to those of modal phonation. These findings can not only reveal the acoustic characteristics of whispery voice, but also provide theoretical foundation for whisper automatic recognition.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.