Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition

We present a multi-modal genre recognition framework that considers the modalities audio, text, and image by features extracted from audio signals, album cover images, and lyrics of music tracks. In contrast to pure learning of features by a neural network as done in the related work, handcrafted fe...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Ben Wilkes, Igor Vatolkin, Heinrich Müller
Format: article
Langue:EN
Publié: MDPI AG 2021
Sujets:
Q
Accès en ligne:https://doaj.org/article/260d78d3e8cc474fbad690f2379f312d
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!