Please submit manuscripts in either of the following two submission systems

    ScholarOne Manuscripts

  • ScholarOne
  • 勤云稿件系统

  • 登录

Search by Issue

  • 2024 Vol.31
  • 2023 Vol.30
  • 2022 Vol.29
  • 2021 Vol.28
  • 2020 Vol.27
  • 2019 Vol.26
  • 2018 Vol.25
  • 2017 Vol.24
  • 2016 vol.23
  • 2015 vol.22
  • 2014 vol.21
  • 2013 vol.20
  • 2012 vol.19
  • 2011 vol.18
  • 2010 vol.17
  • 2009 vol.16
  • No.1
  • No.2

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码
微信公众号二维码
Related citation:PEI Bing-zhen,CHEN Xiao-rong,HU Yi,LU Ru-zhan.A new method for extracting domain terminology[J].Journal of Harbin Institute Of Technology(New Series),2009,16(2):289-296.DOI:10.11916/j.issn.1005-9113.2009.02.029.
【Print】   【HTML】   【PDF download】   View/Add Comment  Download reader   Close
←Previous|Next→ Back Issue    Advanced Search
This paper has been: browsed 619times   downloaded 320times 本文二维码信息
码上扫一扫!
Shared by: Wechat More
A new method for extracting domain terminology
Author NameAffiliation
PEI Bing-zhen Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com
College of Computer Science and Technology, Guizhou University, Guiyang 550025, China 
CHEN Xiao-rong College of Computer Science and Technology, Guizhou University, Guiyang 550025, China 
HU Yi Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com 
LU Ru-zhan Dept.of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030, China, peibzgz@163.com 
Abstract:
This article proposes a new general, highly efficient algorithm for extracting domain terminologies. This domain-independent algorithm with multi-layers of filters is a hybrid of statistic-oriented and rule-oriented methods. Utilizing the features of domain terminologies and the characteristics that are unique to Chinese, this algorithm extracts domain terminologies by generating multi-word unit (MWU) candidates at first and then filtering the candidates through multi-strategies. Our test results show that this algorithm is feasible and effective.
Key words:  domain terminology  multi-word unit (MWU)  automatic extract  filter
DOI:10.11916/j.issn.1005-9113.2009.02.029
Clc Number:TP391
Fund:

LINKS