会议专题

Transcribing Bach Chorales Using Non-negative Matrix Factorisation

This paper discusses our research on polyphonic music transcription using non-negative matrix factorization (NMF). The application of NMF in polyphonic transcription has two known limitations (i) the transcription output is a permutation of the input source signals (e.g., the following polyphonic input notes c, e, g and b may produce polyphonic output notes in the following order c, b, g and e) and (ii) the accuracy of the transcription depends on the accuracy of the factor r where r is the actual number of active pitches. This work proposes a novel approach by exploiting a tone model to tackle both the permutation of transcription output and the unknown factoring r issues. In our current implementation, the tone model is learned from the training data consisting of the pitches of the desired instrument. This approach offers an effective exploitation of the domain knowledge (i.e., tone model of each pitch). The empirical results show that the proposed tone-model initialised NMF (ICTM-NMF) could significantly improve the transcription output accuracy.

Somnuk Phon-Amnuaisuk

Music Informatic Research Group, Faculty of Creative Industry, University Tunku Abdul Rahman (UTAR), Petaling Jaya Campus, Selangor Darul Ehsan, Malaysia

国际会议

第十届中国虚拟现实年会

上海

英文

688-693

2010-10-20(万方平台首次上网日期,不代表论文的发表时间)