Dr. Peipei Gao Profile

Dr. Peipei Gao

at Nankai Univ

SPIE Involvement:

Author

Publications (4)

Proceedings Article | 1 November 2016 Paper

An exploratory study on the driving method of speech synthesis based on the human eye reading imaging data

Pei-pei Gao, Feng Liu

Proceedings Volume 10157, 101573O (2016) https://doi.org/10.1117/12.2248205

Read Abstract +

Proceedings Article | 21 August 2013 Paper

Implicit prosody mining based on the human eye image capture technology

Pei-pei Gao, Feng Liu

Proceedings Volume 8908, 89081X (2013) https://doi.org/10.1117/12.2034656

KEYWORDS: Eye, Eye models, Control systems, Visual process modeling, Cognitive modeling, Mining, Signal processing, Motion controllers, Human-computer interaction, Systems modeling

Read Abstract +

The technology of eye tracker has become the main methods of analyzing the recognition issues in human-computer interaction. Human eye image capture is the key problem of the eye tracking. Based on further research, a new human-computer interaction method introduced to enrich the form of speech synthetic. We propose a method of Implicit Prosody mining based on the human eye image capture technology to extract the parameters from the image of human eyes when reading, control and drive prosody generation in speech synthesis, and establish prosodic model with high simulation accuracy. Duration model is key issues for prosody generation. For the duration model, this paper put forward a new idea for obtaining gaze duration of eyes when reading based on the eye image capture technology, and synchronous controlling this duration and pronunciation duration in speech synthesis. The movement of human eyes during reading is a comprehensive multi-factor interactive process, such as gaze, twitching and backsight. Therefore, how to extract the appropriate information from the image of human eyes need to be considered and the gaze regularity of eyes need to be obtained as references of modeling. Based on the analysis of current three kinds of eye movement control model and the characteristics of the Implicit Prosody reading, relative independence between speech processing system of text and eye movement control system was discussed. It was proved that under the same text familiarity condition, gaze duration of eyes when reading and internal voice pronunciation duration are synchronous. The eye gaze duration model based on the Chinese language level prosodic structure was presented to change previous methods of machine learning and probability forecasting, obtain readers’ real internal reading rhythm and to synthesize voice with personalized rhythm. This research will enrich human-computer interactive form, and will be practical significance and application prospect in terms of disabled assisted speech interaction. Experiments show that Implicit Prosody mining based on the human eye image capture technology makes the synthesized speech has more flexible expressions.

Proceedings Article | 21 August 2013 Paper

Independent pose measurement using monocular vision based on laser projection

Feng Liu, Pei-pei Gao, Heng Zhao, Nian Yan, Wei Jing

Proceedings Volume 8908, 89081W (2013) https://doi.org/10.1117/12.2034647

KEYWORDS: Servomechanisms, Motion controllers, Control systems, Cameras, Modulation, Motion models, Error analysis, Imaging systems, Light, Image segmentation

Read Abstract +

Proceedings Article | 11 November 2010 Paper

The research on image encryption method based on parasitic audio watermark

Pei-pei Gao, Yao-ting Zhu, Shi-tao Zhang

Proceedings Volume 7850, 785023 (2010) https://doi.org/10.1117/12.870363

KEYWORDS: Digital watermarking, Image encryption, Image processing, Chemical elements, Computer security, Digital imaging, Feature extraction, Multimedia, Image restoration, Information science

Read Abstract +

View contact details

UPDATE YOUR PROFILE

Is this your profile? Update it now.

Sign into your SPIE.org account

Don’t have a profile and want one?

Create an account on SPIE.org

Keywords/Phrases

Search In:

Publication Years