层次化神经网络模型下的释义识别方法

袁蕾; 高曙; 郭淼; 袁自勇

期刊检索

关键词检索

新闻公告MORE

【03-25】投稿请提供保密审查证明
【05-04】论文版权转让协议
【07-05】出版伦理声明
【04-04】告作者书
【07-11】审稿人的职责
【11-26】《哈尔滨工业大学学报》入选中国科技期刊卓越行动计划领军期刊
【10-17】《哈工大学报》入选“第5届中国精品科技期刊”
【12-30】《哈工大学报》入选“世界学术影响力Q2期刊”
【01-03】《哈工大学报》入选“2018中国国际影响力优秀学术期刊”
【11-01】哈工大学报荣获2016、2018、2020年度“中国高校百佳科技期刊奖”
【03-24】哈工大学报10篇论文入选中国精品科技期刊顶尖学术论文
【12-05】哈工大学报2024优秀审稿专家
【12-18】哈工大学报2023优秀审稿专家
【12-24】哈工大学报2022优秀审稿专家
【12-21】哈工大学报2021优秀审稿专家
【12-10】哈工大学报2020优秀审稿专家

主管单位 中华人民共和国
工业和信息化部 主办单位 哈尔滨工业大学主编李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码

微信公众号二维码

引用本文:	袁蕾,高曙,郭淼,袁自勇.层次化神经网络模型下的释义识别方法[J].哈尔滨工业大学学报,2020,52(10):175.DOI:10.11918/201910183
	YUAN Lei,GAO Shu,GUO Miao,YUAN Ziyong.Paraphrase identification based on hierarchical neural network[J].Journal of Harbin Institute of Technology,2020,52(10):175.DOI:10.11918/201910183

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

过刊浏览高级检索

本文已被：浏览 1130次下载 770次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
层次化神经网络模型下的释义识别方法
袁蕾,高曙,郭淼,袁自勇
(武汉理工大学计算机科学与技术学院,武汉 430000)

摘要:

释义识别技术（Paraphrase Identification, PI）被广泛用于问答系统、抄袭检测、个性化推荐等领域.针对已有释义识别方法缺乏有效的特征提取机制问题,提出了一种新的释义识别模型.与传统“编码-匹配”模式不同,采用“编码-匹配-提取”模式,通过添加特征提取层进一步提取分类信息.所提出模型由6层组成:输入层、嵌入层、编码层、匹配层、特征提取层、输出层.在编码层,采用基于注意力机制的上下文双向长短期记忆网络对文本上下文进行编码,充分利用句子的前向和逆向两个方向的上下文信息；在匹配层,通过多种矩阵运算,从不同角度获得句子对匹配信息；在特征提取层,利用Xception网络以便更有效地从匹配结果中提取分类信息.此外,本文采用多特征融合的方法,将GloVe预训练的词向量、字符向量和附加特征向量的连接作为最终的词向量,较普通的词向量携带更丰富的语义信息.实验结果表明,所构建的模型在Quora和SemEval-2015 PIT两个公开数据集上（分别作为大型数据集和中小型数据集的代表）都达到了竞争性效果.

关键词: 自然语言处理释义识别 Xception 注意力机制双向长短期记忆网络

DOI：10.11918/201910183

分类号:TP391

文献标识码:A

基金项目:国家自然科学基金(51679180)

Paraphrase identification based on hierarchical neural network

YUAN Lei,GAO Shu,GUO Miao,YUAN Ziyong

(School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430000, China)

Abstract:

Paraphrase identification is widely used in question answering system, plagiarism detection, and personalized recommendation. Since the existing paraphrase identification techniques are lack of effective feature extraction mechanism, a new paraphrase model was proposed. Different from previous works which normally adopt the “encoding-matching” mode, the proposed model adopts the “encoding-matching-extraction” mode by adding feature extraction layer to better acquire classification information. The proposed model is consisted of six layers: input layer, embedding layer, encoding layer, matching layer, feature extraction layer, and output layer. The encoding layer utilizes contextual bi-directional long short-term memory network (BiLSTM) with self-attention to encode context of sentences, which can make full use of contextual information in both forward and reverse directions of a sentence. The matching layer uses several matrix operations to get sentence pair matching information from different angles. The extraction layer chooses Xception as the feature extractor to better extract classification information from the matching results. Moreover, this paper combines GloVe word vectors, character vectors, and additional feature vectors as the final embeddings, which carries richer information than ordinary pretrained embeddings. Results show that the proposed model achieved competitive results on two public datasets: Quora Question Pairs (as a representative of large datasets) and SemEval-2015 PIT (as a representative of small and medium datasets).

Key words: natural language processing paraphrase identification Xception attention mechanism BiLSTM

期刊检索

关键词检索

新闻公告MORE

友情链接LINKS