On the Perceptual Relevance of Objective Source Separation Measures for Singing Voice Separation
2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5
1543 MUSHRA test,audio signal processing,audio source separation method,Correlation,Distortion,Distortion measurement,information retrieval,Instruments,ITU-R BS,MUSHRA,music,Music Information Retrieval,objective source separation,perceptual relevance,Radio frequency,Signal processing algorithms,singing voice separation,Singing Voice Separation,source separation,Source separation,Source Separation,SVS algorithm
- U. Gupta
- E. Moore
- A. Lerch
Singing Voice Separation (SVS) is a task which uses audio source separation methods to isolate the vocal component from the background accompaniment for a song mix. This paper discusses the methods of evaluating SVS algorithms, and determines how the current state of the art measures correlate to human perception. A modified ITU-R BS.1543 MUSHRA test is used to get the human perceptual ratings for the outputs of various SVS algorithms, which are correlated with widely used objective measures for source separation quality. The results show that while the objective measures provide a moderate correlation with perceived intelligibility and isolation, they may not adequately assess the overall perceptual quality.