G. Tsoumakas, L. Angelis, I. Vlahavas, “Clustering Classifiers for Knowledge Discovery from Physically Distributed Databases”, Data and Knowledge Engineering, Elsevier, 49(3), pp. 223-242, 2004.
Author(s): Grigorios Tsoumakas, L. Angelis, I. Vlahavas
Keywords: Multi DBs, Knowledge discovery, Machine learning.
Abstract: Most distributed classification approaches view data distribution as a technical issue and combine local models aiming at a single global model. This however, is unsuitable for inherently distributed databases, which are often described by more than one classification models that might differ conceptually. In this paper we present an approach for clustering distributed classifiers in order to discover groups of similar classifiers and thus similar databases with respect to a specific classification task. We also show that clustering distributed classifiers as a pre-processing step for classifier combination enhances the achieved predictive performance of the ensemble.