引用本文: | 刘美玲,赵铁军,郑德权,于摇洋.面向TDT的动态多文档文摘研究[J].哈尔滨工业大学学报,2010,42(11):1767.DOI:10.11918/j.issn.0367-6234.2010.11.019 |
| .Research on dynamic multi-document summarization by topic detection and tracking technology[J].Journal of Harbin Institute of Technology,2010,42(11):1767.DOI:10.11918/j.issn.0367-6234.2010.11.019 |
|
摘要: |
应用话题检测技术中的聚类思想,分析了动态多文档文摘时间特性.利用时间信息的阈值变化得到不同的多文档聚类结果,进而生成基于动态网页信息数据流的多文档文摘.针对不同阈值的多文档文摘比较,了解时间信息在动态的多文档文摘中的重要性.实验证明在TDT的话题检测技术中,可通过调整α值的方法来平衡TDT的输出结果,进而生成质量更好的动态多文档文摘。 |
关键词: 话题检测 时间信息 动态阈值 多文档文摘 |
DOI:10.11918/j.issn.0367-6234.2010.11.019 |
分类号:TP391.1 |
基金项目:国家高技术研究发展计划重点资助项目(2006AA010108);国家自然科学基金重点资助项目(60736044) |
|
Research on dynamic multi-document summarization by topic detection and tracking technology |
|
1.School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China;2.College of Information and Computer Engineering,Northeast Forestry University,Harbin 150040,China)
|
Abstract: |
In this paper time characteristics of dynamic multi-documents summarization has been analyzed by the clustering idea in topic detection technology.From the change of time information threshold value,differ-ent multi-document clustering and multi-documents summarization for the dynamic Web information data stream can be generated.By compared with different threshold values,the importance of time information in a dynamic multi-document summarization is understanded.Experimental result means that you can counterpoise the output of TDT by adjusting α value and generate better quality dynamic multi-document summarization. |
Key words: topic detection temporal properties dynamic threshold multi-document summarization |