Analysis of Stream-Dependent Tying Structure for HMM-based Speech Synthesis

摘要：

In conventional HMM-based speech synthesis framework,spectral features are modeled in one stream,and stream-dependent tree-based clustering was then applied for tying the model parameters.In this paper,we investigate several different stream-dependent tying structures for spectral features by splitting the feature vector into several streams.One splitting approach is to split each feature dimension into each stream.Another one is to split the static and dynamic features into different streams.Although splitting spectral features into different streams would ignore the correlation of context dependency between them,the number of model parameters can be optimized for each stream after stream-dependent clustering.From the experimental results,both splitting approaches can improve the quality of synthesized speech.However,the quality of synthesized speech became worse when we combined these two splitting approaches.

关键词： HMM-based speech synthesis streamdependent tying structure

作者: Zhi-Peng Yu Yi-Jian Wu Heiga Zen Yoshihiko Nankaku Keiichi Tokuda

作者单位: Nagoya Institute of Technology,Japan

会议类型: 国际会议

会议名称: 9th International Conference on Signal Processing(第九届国际信号处理学术会议)(ICSP08)

会议地点: 北京

会议语种:英文

在线出版日期: 2008-10-26（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Analysis of Stream-Dependent Tying Structure for HMM-based Speech Synthesis