会议专题

Unsupervised Query-by-example spoken term detection based on DPHMM tokenizer

  This paper investigates the use of Dirichlet process hidden Markov model(DPHMM)tokenizer for the template matching based query-by-example spoken term detection(QbE-STD)task.DPHMM can be obtained following an unsupervised iterative procedure without any training transcriptions.The STD performance of the DPHMM tokenizer is evaluated on TIMIT Corpus.We construct three kinds of DPHMM based QbE-STD systems.Compared to GMM baseline system,posteriorgram based system has a better detection precision,acoustic unit has a better detection speed,and pseudo feedback based system has a good balance between precision and speed.Although nonparametric DPHMM is still worse than supervised language-matched phoneme recognizer,but it outperforms other unsupervised tokenizers and language-mismatched phoneme recognizer.

unsupervised Dirichlet process hidden Markov model query-by-example spoken term detection pseudo feedback

Cao Jiankai Zhang Lianhai

NDSC Zhengzhou,China

国际会议

2017 IEEE 2nd Advanced Information Technology,Electronic and Automation Control Conference(IAEAC 2017)(2017 IEEE 第2届先进信息技术、电子与自动化控制国际会议)

重庆

英文

1321-1325

2017-03-25(万方平台首次上网日期,不代表论文的发表时间)