A new method for extracting domain terminology

PEI Bing-zhen; CHEN Xiao-rong; HU Yi; LU Ru-zhan

Please submit manuscripts in either of the following two submission systems

ScholarOne Manuscripts

ScholarOne

勤云稿件系统

Search by Issue

Search by Keywords

News & AnnouncementMORE

【03-29】2015 Outstanding Reviewers
【03-27】2014 Outstanding Reviewers
【02-18】2013 Outstanding Reviewers
【12-29】The First Outstanding Reviewers
【05-04】Copyright Transfer Agreement
【04-04】To authors

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码

微信公众号二维码

Related citation:

PEI Bing-zhen,CHEN Xiao-rong,HU Yi,LU Ru-zhan.A new method for extracting domain terminology[J].Journal of Harbin Institute Of Technology(New Series),2009,16(2):289-296.DOI:10.11916/j.issn.1005-9113.2009.02.029.

【Print】【HTML】【PDF download】【View/Add Comment】【Download reader】【 Close 】

←Previous|Next→

Back Issue Advanced Search

This paper has been: browsed 710times downloaded 385times	码上扫一扫！
Shared by: Wechat More Font:larger+\|default\|smaller-
A new method for extracting domain terminology

Author Name	Affiliation
PEI Bing-zhen	Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
CHEN Xiao-rong	College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
HU Yi	Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com
LU Ru-zhan	Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com

Abstract:

This article proposes a new general, highly efficient algorithm for extracting domain terminologies. This domain-independent algorithm with multi-layers of filters is a hybrid of statistic-oriented and rule-oriented methods. Utilizing the features of domain terminologies and the characteristics that are unique to Chinese, this algorithm extracts domain terminologies by generating multi-word unit (MWU) candidates at first and then filtering the candidates through multi-strategies. Our test results show that this algorithm is feasible and effective.

Key words: domain terminology multi-word unit (MWU) automatic extract filter

DOI：10.11916/j.issn.1005-9113.2009.02.029

Clc Number:TP391

Fund:

Search by Issue

Search by Keywords

News & AnnouncementMORE

LINKS