Paper Details

  • Title:

    Effective and Efficient Multilabel Classification in Domains with Large Number of Labels

  • Author(s):

    Grigorios Tsoumakas, I. Katakis, I. Vlahavas

  • Keywords: -
  • Abstract:

    This paper contributes a novel algorithm for effective and computationally efficient multilabel classification in domains with large label sets L. The HOMER algorithm constructs a Hierarchy Of Multilabel classifiERs, each one dealing with a much smaller set of labels compared to L and a more balanced example distribution. This leads to improved predictive performance along with linear training and logarithmic testing complexities with respect to |L|. Label distribution from parent to children nodes is achieved via a new balanced clustering algorithm, called balanced k means.

  • Category: Conference Papers
  • Tags: 2008 Tsoumakas Katakis Vlahavas