Using SIMD Technology to Speed up Likelihood Computation in HMM-based Speech Recognition Systems
Most state-of-the-art LVCSR systems are based on continuous density HMMs, which are typically implemented using Gaussian mixture distributions. Such statistical modeling systems usually operate slower than real-time, largely because of the heavy computational overhead of the likelihood computation. The objective of our research is to investigate application of modern SIMD technology to speed up the likelihood computation without degrading the recognition accuracy. In this paper, the likelihood computation of continuous density HMMs is analyzed to show that the conventional way of sequential computing is time-consuming and the likelihood computation itself can be implemented in parallel. A SIMD-based algorithm which can carry out parallel likelihood computation is presented in this paper. Likelihood computation modules in HTK3.4 toolkit have been modified with SIMD instructions to implement this algorithm. Experiments on TIMIT and WSJO corpora show that the SIMD-based data-level parallelism can significantly reduce the time overhead for likelihood computation.
Jianlin Ou Jun Cai Qian Lin
Department of Cognitive Science, Xiamen University, 361005 Xiamen, China Department of Cognitive Science, Xiamen University, 361005 Xiamen, China Groupe Parole, LORIA-CNRS &
国际会议
2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)
镇江
英文
123-127
2008-07-07(万方平台首次上网日期,不代表论文的发表时间)