An improved ML-kNN approach for multi-label text categorization

doi:10.11918/j.issn.0367-6234.2013.11.008

Home > Archive>Volume 45, Issue 11, 2013 >45-49. DOI:10.11918/j.issn.0367-6234.2013.11.008

An improved ML-kNN approach for multi-label text categorization
DOI:
                        10.11918/j.issn.0367-6234.2013.11.008
                    
CSTR:
                        
Author:
                        
Affiliation:(School of Computer Science and Technology, Harbin Institute of Technology, 150001 Harbin, China)
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Conventional kNN algorithms ignore label correlations when being applied to multi-label text categorization. To cover this shortage, an improved Multi-label kNN approach for text categorization is proposed. A specific distance metric based on KL divergence is derived to measure the similarity between individual documents. Based on statistical information gained from the label sets of neighboring documents, a fuzzy maximum a posteriori principle is utilized to conjecture the label sets of the unlabeled documents. Different from ML-kNN, the proposed approach can exploit label correlations to improve classification performance effectively. Experiments on three benchmark datasets using 5 popular multi-label evaluation metrics suggest that the proposed approach achieves superior performance to some well-established multi-label learning algorithms, such as ML-kNN、Rank-SVM and BoosTexter.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: November 30,2013
Published:

Publication Statement

Journal Subscription

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code