Please use this identifier to cite or link to this item:
Title: Distributed data mining in grid computing environments
Authors: Luo, P
Lü, K
Shi, Z
He, Q
Keywords: Distributed data mining;Directed acyclic graph;InterGrid;IntraGrid;Multi-agent system environment
Issue Date: 2007
Publisher: Elsevier
Citation: Future Generation Computer Systems 23(1): 84-91, Jan 2007
Abstract: The computing-intensive data mining for inherently Internet-wide distributed data, referred to as Distributed Data Mining (DDM), calls for the support of a powerful Grid with an effective scheduling framework. DDM often shares the computing paradigm of local processing and global synthesizing. It involves every phase of Data Mining (DM) processes, which makes the workflow of DDM very complex and can be modelled only by a Directed Acyclic Graph (DAG) with multiple data entries. Motivated by the need for a practical solution of the Grid scheduling problem for the DDM workflow, this paper proposes a novel two-phase scheduling framework, including External Scheduling and Internal Scheduling, on a two-level Grid architecture (InterGrid, IntraGrid). Currently a DM IntraGrid, named DMGCE (Data Mining Grid Computing Environment), has been developed with a dynamic scheduling framework for competitive DAGs in a heterogeneous computing environment. This system is implemented in an established Multi-Agent System (MAS) environment, in which the reuse of existing DM algorithms is achieved by encapsulating them into agents. Practical classification problems from oil well logging analysis are used to measure the system performance. The detailed experiment procedure and result analysis are also discussed in this paper.
Description: The official published version of this article can be found at the link below.
ISSN: 0167-739X
Appears in Collections:Business and Management
Brunel Business School Research Papers

Files in This Item:
File Description SizeFormat 
Fulltext.pdf542.57 kBAdobe PDFView/Open

Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.