大工至善|大学至真分享 http://blog.sciencenet.cn/u/lcj2212916

博文

[转载]【计算机科学】【2015】用于语句分类的卷积神经网络

已有 1556 次阅读 2019-7-16 17:55 |系统分类:科研笔记|文章来源:转载


本文为加拿大滑铁卢大学作者:Yahui Chen)的硕士论文62

 

知识库支持问答(KB-supported QA)系统的目标是通过从知识数据库中获取答案来回答自然语言的查询知识数据库以实体关系三种形式存储知识QA系统通过提取实体和关系对来理解问题本论文的目的是识别出一个问题中的关系型候选人我们为这个具有挑战性的任务定义了一个多标签分类问题基于单词的word2vec表示形式我们提出了两种卷积神经网络(CNN)来解决多标签分类问题并行CNN和深度CNN并行CNN包含四个并行卷积层而深度CNN包含两个串行卷积层两个模型的卷积层都捕获了局部语义特征最大时间池化层放在最后一个卷积层的顶部以选择全局语义特性具有dropout的完全连接层用于概括所有特征实验表明这两种模型在很大程度上优于传统的支持向量分类(SVC)方法此外我们观察到深度CNN比并行CNN具有更好的性能这表明深度结构比宽而浅的网络具有更强的语义学习能力

 

The goal of a Knowledge Base–supported Question Answering (KB-supported QA) system is to answer a query natural language by obtaining the answer from a knowledge database, which stores knowledge in the form of (entity, relation, value) triples. QA systems understand questions by extracting entity and relation pairs. This thesis aims at recognizing the relation candidates inside a question. We define a multi-label classification problem for this challenging task. Based on the word2vec representation of words, we propose two convolutional neural networks (CNNs) to solve the multi-label classification problem, namely Parallel CNN and Deep CNN. The Parallel CNN contains four parallel convolutional layers while Deep CNN contains two serial convolutional layers. The convolutional layers of both the models capture local semantic features. A max over time pooling layer is placed on the top of the last convolutional layer to select global semantic features. Fully connected layers with dropout are used to summarize the features. Our experiments show that these two models outperform the traditional Support Vector Classification (SVC)–based method by a large margin. Furthermore, we observe that Deep CNN has better performance than Parallel CNN, indicating that the deep structure enables much stronger semantic learning capacity than the wide but shallow network.

 

引言

项目背景

相关工作

数据集与环境

主要研究结果

结论

附录 实现CNNPython代码 


更多精彩文章请关注公众号:qrcode_for_gh_60b944f6c215_258.jpg



https://blog.sciencenet.cn/blog-69686-1189830.html

上一篇:[转载]【新书推荐】【2018.07】缝隙波导阵列天线:理论,分析与设计
下一篇:[转载]【源码】OpenCossan:用于不确定性量化和管理的工具箱
收藏 IP: 60.169.68.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-20 10:21

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部