Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/8787
Title: | An ontology enhanced parallel SVM for scalable spam filter training |
Authors: | Caruana, G Li, M Liu, Y |
Keywords: | Spam filtering;Support vector machine;Parallel computing;Classification;MapReduce |
Issue Date: | 2013 |
Publisher: | Elsevier |
Citation: | Neurocomputing, 108, 45 - 57, 2013 |
Abstract: | Spam, under a variety of shapes and forms, continues to inflict increased damage. Varying approaches including Support Vector Machine (SVM) techniques have been proposed for spam filter training and classification. However, SVM training is a computationally intensive process. This paper presents a MapReduce based parallel SVM algorithm for scalable spam filter training. By distributing, processing and optimizing the subsets of the training data across multiple participating computer nodes, the parallel SVM reduces the training time significantly. Ontology semantics are employed to minimize the impact of accuracy degradation when distributing the training data among a number of SVM classifiers. Experimental results show that ontology based augmentation improves the accuracy level of the parallel SVM beyond the original sequential counterpart. |
Description: | This is the post-print version of the final paper published in Neurocomputing. The published article is available from the link below. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. Copyright @ 2013 Elsevier B.V. |
URI: | http://www.sciencedirect.com/science/article/pii/S0925231212008910 http://bura.brunel.ac.uk/handle/2438/8787 |
DOI: | http://dx.doi.org/10.1016/j.neucom.2012.12.001 |
ISSN: | 0925-2312 |
Appears in Collections: | Electronic and Electrical Engineering Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fulltext.pdf | 1.73 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.