Please use this identifier to cite or link to this item:
Title: Analysing outliers cautiously
Authors: Liu, X
Cheng, G
Wu, J
Keywords: Data mining;Knowledge based systems;Measurement errors;Medical computing;Self-organising;Feature maps
Issue Date: 2002
Publisher: IEEE
Citation: IEEE Transactions on Knowledge and Data Engineering 14: 432-437, Apr 2002
Abstract: Outliers are difficult to handle because some of them can be measurement errors, while others may represent phenomena of interest, something "significant" from the viewpoint of the application domain. Statistical and computational methods have been proposed to detect outliers, but further analysis of outliers requires much relevant domain knowledge. In our previous work (1994), we suggested a knowledge-based method for distinguishing between the measurement errors and phenomena of interest by modelling "real measurements" - how measurements should be distributed in an application domain. In this paper, we make this distinction by modelling measurement errors instead. This is a cautious approach to outlier analysis, which has been successfully applied to a medical problem and may find interesting applications in other domains such as science, engineering, finance, and economics.
Appears in Collections:Computer Science
Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
00991726.pdf478.4 kBAdobe PDFView/Open

Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.