Determining a Suitable Desired Factors for Nonnegative Matrix Factorization in Polyphonic Music Transcription

Nonnegative Matrix Factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(V-mn) -> WmrHrn. NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NW factoring (i.e. the desired factor r). In music transcription,...

Full description

Bibliographic Details
Main Authors: Sophea, Seng, Phon-Amnuaisuk, Somnuk
Format: Book Section
Language:English
Published: IEEE Xplore 2007
Subjects:
Online Access:http://shdl.mmu.edu.my/3186/
http://shdl.mmu.edu.my/3186/1/Determining%20a%20Suitable%20Desired%20Factors%20for%20Nonnegative%20Matrix%20Factorization%20in%20Polyphonic%20Music%20Transcription.pdf
Description
Summary:Nonnegative Matrix Factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(V-mn) -> WmrHrn. NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NW factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r >= the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.