ISAAC TRIGUERO VELAZQUEZ I.TrigueroVelazquez@nottingham.ac.uk
Associate Professor
Labelling strategies for hierarchical multi-label classification techniques
Triguero, Isaac; Vens, Celine
Authors
Celine Vens
Abstract
© 2016 Elsevier Ltd Many hierarchical multi-label classification systems predict a real valued score for every (instance, class) couple, with a higher score reflecting more confidence that the instance belongs to that class. These classifiers leave the conversion of these scores to an actual label set to the user, who applies a cut-off value to the scores. The predictive performance of these classifiers is usually evaluated using threshold independent measures like precision-recall curves. However, several applications require actual label sets, and thus an automatic labelling strategy. In this paper, we present and evaluate different alternatives to perform the actual labelling in hierarchical multi-label classification. We investigate the selection of both single and multiple thresholds. Despite the existence of multiple threshold selection strategies in non-hierarchical multi-label classification, they cannot be applied directly to the hierarchical context. The proposed strategies are implemented within two main approaches: optimisation of a certain performance measure of interest (such as F-measure or hierarchical loss), and simulating training set properties (such as class distribution or label cardinality) in the predictions. We assess the performance of the proposed labelling schemes on 10 datasets from different application domains. Our results show that selecting multiple thresholds may result in an efficient and effective solution for hierarchical multi-label problems.
Citation
Triguero, I., & Vens, C. (2016). Labelling strategies for hierarchical multi-label classification techniques. Pattern Recognition, 56, 170-183. https://doi.org/10.1016/j.patcog.2016.02.017
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 24, 2016 |
Online Publication Date | Mar 4, 2016 |
Publication Date | 2016-08 |
Deposit Date | Jun 8, 2016 |
Publicly Available Date | Jun 8, 2016 |
Journal | Pattern Recognition |
Print ISSN | 0031-3203 |
Electronic ISSN | 0031-3203 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 56 |
Pages | 170-183 |
DOI | https://doi.org/10.1016/j.patcog.2016.02.017 |
Keywords | Hierarchical multi-label classification; Threshold optimisation; Hierarchical loss; HMC-loss; F-measure |
Public URL | https://nottingham-repository.worktribe.com/output/781467 |
Publisher URL | http://www.sciencedirect.com/science/article/pii/S0031320316000881 |
Contract Date | Jun 8, 2016 |
Files
HMC_labelling_revision.pdf
(963 Kb)
PDF
Copyright Statement
Copyright information regarding this work can be found at the following address: http://creativecommons.org/licenses/by-nc-nd/4.0
You might also like
MRPR: A MapReduce solution for prototype reduction in big data classification
(2014)
Journal Article
kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
(2016)
Journal Article
Evolutionary undersampling for extremely imbalanced big data classification under apache spark
(2016)
Presentation / Conference Contribution
From Big data to Smart Data with the K-Nearest Neighbours algorithm
(2016)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search