Utkarsh Agrawal
Combining clustering and classification ensembles: A novel pipeline to identify breast cancer profiles
Agrawal, Utkarsh; Soria, Daniele; Wagner, Christian; Garibaldi, Jonathan; Ellis, Ian O.; Bartlett, John M. S.; Cameron, David; Rakha, Emad A.; Green, Andrew R.
Authors
Daniele Soria
CHRISTIAN WAGNER Christian.Wagner@nottingham.ac.uk
Professor of Computer Science
Prof. JONATHAN GARIBALDI JON.GARIBALDI@NOTTINGHAM.AC.UK
Provost and Pvc Unnc
Ian O. Ellis
John M. S. Bartlett
David Cameron
EMAD RAKHA Emad.Rakha@nottingham.ac.uk
Professor of Breast Cancer Pathology
ANDREW GREEN ANDREW.GREEN@NOTTINGHAM.AC.UK
Associate Professor
Abstract
Breast Cancer is one of the most common causes of cancer death in women, representing a very complex disease with varied molecular alterations. To assist breast cancer prognosis, the classification of patients into biological groups is of great significance for treatment strategies. Recent studies have used an ensemble of multiple clustering algorithms to elucidate the most characteristic biological groups of breast cancer. However, the combination of various clustering methods resulted in a number of patients remaining unclustered. Therefore, a framework still needs to be developed which can assign as many unclustered (i.e. biologically diverse) patients to one of the identified groups in order to improve classification. Therefore, in this paper we develop a novel classification framework which introduces a new ensemble classification stage after the ensemble clustering stage to target the unclustered patients. Thus, a step-by-step pipeline is introduced which couples ensemble clustering with ensemble classification for the identification of core groups, data distribution in them and improvement in final classification results by targeting the unclustered data. The proposed pipeline is employed on a novel real world breast cancer dataset and subsequently its robustness and stability are examined by testing it on standard datasets. The results show that by using the presented framework, an improved classification is obtained. Finally, the results have been verified using statistical tests, visualisation techniques, cluster quality assessment and interpretation from clinical experts.
Citation
Agrawal, U., Soria, D., Wagner, C., Garibaldi, J., Ellis, I. O., Bartlett, J. M. S., …Green, A. R. (2019). Combining clustering and classification ensembles: A novel pipeline to identify breast cancer profiles. Artificial Intelligence in Medicine, 97, 27-37. https://doi.org/10.1016/j.artmed.2019.05.002
Journal Article Type | Article |
---|---|
Acceptance Date | May 8, 2019 |
Online Publication Date | May 13, 2019 |
Publication Date | 2019-06 |
Deposit Date | Jun 17, 2019 |
Publicly Available Date | May 14, 2020 |
Journal | Artificial Intelligence in Medicine |
Print ISSN | 0933-3657 |
Electronic ISSN | 1873-2860 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 97 |
Pages | 27-37 |
DOI | https://doi.org/10.1016/j.artmed.2019.05.002 |
Keywords | Ensemble Clustering; Ensemble Classification; Class level fusion; Refining cluster results; Breast Cancer; Pipeline |
Public URL | https://nottingham-repository.worktribe.com/output/2197043 |
Publisher URL | https://www.sciencedirect.com/science/article/pii/S0933365717303913 |
Contract Date | Jun 17, 2019 |
Files
Combining Clustering And Classification Ensembles A Novel Pipeline To Id..
(568 Kb)
PDF
You might also like
The role of the ALKBH5 RNA demethylase in invasive breast cancer
(2024)
Journal Article
Stromal lymphocytes are associated with upgrade of B3 breast lesions
(2024)
Journal Article
The characteristics and prognostic significance of histone H1 expression in breast cancer
(2024)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search