Vanwinckelen, Gitte; Tragante do O, Vinicius; Fierens, Daan; Blockeel, Hendrik

doi:10.1007/s10618-015-0416-z

Back to matches

Your institution may have rights to this item. Sign in to continue.

Title: Instance-level accuracy versus bag-level accuracy in multi-instance learning.
Authors: Vanwinckelen, Gitte; Tragante do O, Vinicius; Fierens, Daan; Blockeel, Hendrik
Abstract: In multi-instance learning, instances are organized into bags, and a bag is labeled positive if it contains at least one positive instance, and negative otherwise; the labels of the individual instances are not given. The task is to learn a classifier from this limited information. While the original task description involved learning an instance classifier, in the literature the task is often interpreted as learning a bag classifier. Depending on which of these two interpretations is used, it is more natural to evaluate classifiers according to how well they predict, respectively, instance labels or bag labels. In the literature, however, the two interpretations are often mixed, or the intended interpretation is left implicit. In this paper, we investigate the difference between bag-level and instance-level accuracy, both analytically and empirically. We show that there is a substantial difference between these two, and better performance on one does not necessarily imply better performance on the other. It is therefore useful to clearly distinguish the two settings, and always use the evaluation criterion most relevant for the task at hand. We show experimentally that the same conclusions hold for area under the ROC curve.
Subjects: MACHINE learning; INFORMATION theory; PROBABILITY theory; BIG data; COMPUTER science
Publication: Data Mining & Knowledge Discovery, 2016, Vol 30, Issue 2, p313
ISSN: 1384-5810
Publication type: Article
DOI: 10.1007/s10618-015-0416-z

We found a match

Instance-level accuracy versus bag-level accuracy in multi-instance learning.

Vanwinckelen, Gitte; Tragante do O, Vinicius; Fierens, Daan; Blockeel, Hendrik

MACHINE learning; INFORMATION theory; PROBABILITY theory; BIG data; COMPUTER science

Data Mining & Knowledge Discovery, 2016, Vol 30, Issue 2, p313

1384-5810

Article

10.1007/s10618-015-0416-z