会议专题

Musical Visualization and FO Estimation Using Neural Network

This paper investigates how to extend ability of feed forward neural network for purposes of musical note visualization and FO estimation. We set experiments to find the best features that introduce high generalization rate and good FO estimation result per single audio frame. These features were extracted from spectral data, autocorrelation, auditory filter bank, and modified Ceptral methods. The samples in our experiments were generated using real musical instrument sound recordings. To compare all investigated features, we trained 56 neural networks with random mixtures up to 4 simultaneous notes and evaluated with both random note combinations and chord patterns. The experiments shown that using features from auditory filter bank, our system gives better estimation results for musical instrument signals with variations in both amplitude and phase. Finally, we evaluated visualization of our system using audio signals from both synthesizer and CD recordings.

Pat Taweewat

School of Electrical and Information Engineering, The University of Sydney, NSW, Australia

国际会议

2010 International Conference on Audio,Language and Image Processing(2010年音频、语言与图像处理国际会议 ICALIP 2010)

上海

英文

346-352

2010-11-23(万方平台首次上网日期,不代表论文的发表时间)