The MTG-Jamendo Dataset for Automatic Music Tagging

Venue

Machine Learning for Music Discovery Workshop at the International Conference on Machine Learning (ICML 2019)

Publication Year

2019

Authors

  • Dmitry Bogdanov
  • Minz Won
  • Philip Tovstogan
  • Alastair Porter
  • Xavier Serra

Abstract

We present the MTG-Jamendo Dataset, a new open dataset for music auto-tagging. It is built using music available at Jamendo under Creative Commons licenses and tags provided by content uploaders. The dataset contains over 55,000 full audio tracks with 195 tags from genre, instru- ment, and mood/theme categories. We provide elaborated data splits for researchers and report the performance of a simple baseline approach on five different sets of tags: genre, instrument, mood/theme, top-50, and overall.