Links
Abstract
Music recommendation systems are pivotal in the escalating success of online music platforms. These systems, tailored for individual users, rely on two strategies: recommendations based on a user’s listened song components or on choices made by similar users [1]. Among these musical components, genre stands out as crucial, both for system construction and music classification. Despite challenges in genre definition and automatic classification performance stagnation, recent studies highlight users’ continued preference for genre-based recommendations [2]. Genre classification faces subjectivity due to cultural influences, leading to ambiguous definitions, making it challenging to categorize songs [2]. Utilizing audio data, web scraping, and occasionally text or image data, genre classification often benefits from multimodal approaches, outperforming unimodal methods [3]. Current approaches, including deep learning techniques and matrix factorization, are prevalent in this field [4]. This work is an applied statistics project initiated by Deezer , a music platform. The project aims to build a model associating genre labels with each music title in a database using audio signal data and listener-related information, operating within a multilabel classification framework.
Collaborators
This work has been done in a group work with Zineb Bentires , Sirine Louati , and Louise Sirven , under the supervision of Léa Briand from Deezer Paris as part of the Applied statistics course at ENSAE Paris.
References
[1] Y. Koren, R. Bell and C. Volinsky, “Matrix Factorization Techniques for Recommender Systems,” in Computer, vol. 42, no. 8, pp. 30-37, Aug. 2009, doi: 10.1109/MC.2009.263.
[2] McKay, Cory & Fujinaga, Ichiro. (2006). Musical Genre Classification: Is It Worth Pursuing and How Can It be Improved?.
[3] Oramas, S., Nieto, O., Barbieri, F. et Serra, X. (2017). Multi-label music genre classification from audio, text, and images using deep features.
[4] Choi, K., Fazekas, G., & Sandler, M. (2016). Automatic tagging using deep convolutional neural networks.