Mongolian Grapheme to Phoneme Conversion by Using Hybrid Approach
Grapheme to phoneme(G2P)conversion is the assignment of converting word to its pronunciation.It has important applications in text-tospeech(TTS),speech recognition and sounds-like queries in textual databases.In this paper,we present the first application of sequence-to-sequence(Seq2Seq)Long Short-Term Memory(LSTM)model with the attention mechanism for Mongolian G2P conversion.Furthermore,we propose a novel hybrid approach of combining rules with Seq2Seq LSTM model for Mongolian G2P conversion,and implement the Mongolian G2P conversion system.The experimental results show that: Adopting Seq2Seq LSTM model can obtain better performance than traditional methods of Mongolian G2P conversion,and the hybrid approach further improves G2P conversion performance.The word error rate(WER)relatively reduces by 10.8%and the phoneme error rate(PER)approximately reduces by 1.6%through comparing with the Mongolian G2P conversion method being used based on the joint-sequence models,which completely meets the practical requirements of Mongolian G2P conversion.
Mongolian Grapheme-to-phoneme Sequence-to-sequence LSTM
Zhinan Liu Feilong Bao Guanglai Gao Suburi
College of Computer Science,Inner Mongolia University,Huhhot 010021,China Inner Mongolia Public Security Department,Huhhot 010021,China
国际会议
2018自然语言处理与中文计算国际会议(NLPCC2018)
呼和浩特
英文
40-50
2018-08-26(万方平台首次上网日期,不代表论文的发表时间)