Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition
We present a multi-modal genre recognition framework that considers the modalities audio, text, and image by features extracted from audio signals, album cover images, and lyrics of music tracks. In contrast to pure learning of features by a neural network as done in the related work, handcrafted fe...
Saved in:
| Main Authors: | Ben Wilkes, Igor Vatolkin, Heinrich Müller |
|---|---|
| Format: | article |
| Language: | EN |
| Published: |
MDPI AG
2021
|
| Subjects: | |
| Online Access: | https://doaj.org/article/260d78d3e8cc474fbad690f2379f312d |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Using machine learning analysis to interpret the relationship between music emotion and lyric features
by: Liang Xu, et al.
Published: (2021) -
Audio Feature Engineering for Occupancy and Activity Estimation in Smart Buildings
by: Gabriela Santiago, et al.
Published: (2021) -
Bird Species Identification Using Spectrogram Based on Multi-Channel Fusion of DCNNs
by: Feiyu Zhang, et al.
Published: (2021) -
Grammar features and discourse style in digital genres: The case of science-focused crowdfunding projects
by: Pérez-Llantada,Carmen
Published: (2021) -
Boosting COVID-19 Image Classification Using MobileNetV3 and Aquila Optimizer Algorithm
by: Mohamed Abd Elaziz, et al.
Published: (2021)