Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/32396
Title: A novel kernel for text classification based on semantic and statistical information
Authors: Yao, H
Zhang, B
Zhang, P
Li, M
Keywords: text categorization;semantic information;statistical information;support vector machine
Issue Date: 7-Nov-2018
Publisher: Slovak Academy of Sciences
Citation: Yao, H. et al. (2018) 'A novel kernel for text classification based on semantic and statistical information', Computing and Informatics, 37 (4), pp. 992 - 1010. doi: 10.4149/cai_2018_4_992.
Abstract: In text categorization, a document is usually represented by a vector space model which can accomplish the classification task, but the model cannot deal with Chinese synonyms and polysemy phenomenon. This paper presents a novel approach which takes into account both the semantic and statistical information to improve the accuracy of text classification. The proposed approach computes semantic information based on HowNet and statistical information based on a kernel function with class-based weighting. According to our experimental results, the proposed approach could achieve state-of-the-art or competitive results as compared with traditional approaches such as the k-Nearest Neighbor (KNN), the Naive Bayes and deep learning models like convolutional networks.
URI: https://bura.brunel.ac.uk/handle/2438/32396
DOI: https://doi.org/10.4149/cai_2018_4_992
ISSN: 1335-9150
Other Identifiers: ORCiD: Maozhen Li https://orcid.org/0000-0002-0820-5487
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2018 Slovak Academy of Sciences. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/).913.85 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons