Transcribing Bach Chorales Using Particle Swarm Optimisations
This paper reports a novel application of particle swarm optimisation to polyphonic transcription task.The system transforms an input audio into activation strength of pitches in the desired range.This transformation begins with audio information in time-domain to frequency-domain and finally, to activation strength of pitches (a.k.a.piano-roll representation).We can infer the likely sounding pitches by comparing the observed activation strength of input audio to reference Tone-models.Although each Tone-model is learned offline from the pitches one wish to perform transcription with, this process often only approximates the Tone-model characteristics due to the variations in vol ume and other effects introduced from the manner of note executions.Hence, predicting sounding notes based solely on Tone-models gives in accurate predictions.Here, we apply PSO to search for an optimum ag gregation of different predicted pitches that best represents the input activation strength.We describe our problem formulation and the de sign of our approach.The experimental results show our approach to be of potential in the task of polyphonic transcription.
Particle swarm optimisation Polyphonic transcription Tone-models Transcribing Bachs Chorales
Somnuk Phon-Amnuaisuk
Music Informatics Research Group,Faculty of Business and Computing,Brunei Institute of Technology, Brunei Darussalam
国际会议
4th international Conference,ICSI2013(第4届群体智能国际会议)
哈尔滨
英文
192-199
2013-06-12(万方平台首次上网日期,不代表论文的发表时间)