大工至善|大学至真分享 http://blog.sciencenet.cn/u/lcj2212916

博文

[转载]【计算机科学】【2017.03】利用深度网络学习高级音乐音频特征

已有 1849 次阅读 2019-7-14 22:05 |系统分类:科研笔记|文章来源:转载


本文为美国罗彻斯特理工学院作者:Madeleine Daigneau)的硕士论文67

 

音乐是一种反映和表达情感的手段个人对音乐的偏好因人而异会受情境和环境因素的影响本研究在尝试开发音频信号替代特征提取方法的启发下分析了利用深层网络结构从频域表现出来的音乐音频数据中提取特征基于图像的网络模型被设计成对图像特征具有鲁棒性和精确性的学习者因此本研究开发了基于图像的ImageNet深度网络模型实现从音乐声谱图中学习特征数据

 

本研究亦探讨在训练网络模型前利用音源分离工具对音效进行预处理源分离的使用允许网络模型对突出音轨特征的学习并使用这些功能改进分类结果从数据中提取的特征用于突出音轨的特征然后以此训练分类器以风格流派和自动标记分类对音乐数据进行识别将每个模型的结果与最先进的音乐曲目分类和标签预测方法进行了对比结果表明采用输入源分离的更深层次的网络可以获得最佳效果

 

Music is a means of reflecting and expressing emotion. Personal preferences in music vary between individuals, influenced by situational and environmental factors. Inspired by attempts to develop alternative feature extraction methods for audio signals, this research analyzes the use of deep network structures for extracting features from musical audio data represented in the frequency domain. Image-based network models are designed to be robust and accurate learners of image features. As such, this research develops image-based ImageNet deep network models to learn feature data from music audio spectrograms. This research also explores the use of an audio source separation tool for preprocessing the musical audio before training the network models. The use of source separation allows the network model to learn features that highlight individual contributions to the audio track, and use those features to improve classification results. The features extracted from the data are used to highlight characteristics of the audio tracks, which are then used to train classifiers that categorize the musical data for genre and autotag classifications. The results obtained from each model are contrasted with state-of-the-art methods of classification and tag prediction for musical tracks. Deeper networks with input source separation are shown to yield the best results.

 

引言

1.1 音乐信息检索

1.2 深度网络

1.3 音频/音乐信息

项目背景

2.1 音乐与深度学习

2.2 数据集

研究方法

3.1 研究框架

3.2 音频预处理

3.3 深度学习模型

研究结果

3.1 音乐流派分类

3.2 标签预测

未来研究工作展望

结论 


更多精彩文章请关注公众号:qrcode_for_gh_60b944f6c215_258.jpg




https://blog.sciencenet.cn/blog-69686-1189503.html

上一篇:[转载]【源码】圆管或矩形管空心钢筋混凝土柱的相互作用图
下一篇:[转载]【源码】中性点箝位(NPC)多电平逆变器(三电平)仿真
收藏 IP: 60.169.30.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-19 12:25

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部