Skip to main content

Research Repository

Advanced Search

A novel framework to elucidate core classes in a dataset

Soria, Daniele; Garibaldi, Jonathan M.

Authors

Daniele Soria

Jonathan M. Garibaldi



Abstract

In this paper we present an original framework to extract representative groups from a dataset, and we validate it
over a novel case study. The framework specifies the application of different clustering algorithms, then several statistical and visualisation techniques are used to characterise the results, and core classes are defined by consensus clustering. Classes may be verified using supervised classification algorithms to obtain a set of rules which may be useful for new data points in the future. This framework is validated over a novel set of histone markers for breast cancer patients. From a technical perspective, the resultant classes are well separated and characterised by low, medium and high levels of biological markers. Clinically, the groups appear to distinguish patients with poor overall survival from those with low grading score and better survival. Overall, this framework offers a promising methodology for elucidating core consensus groups from data.

Citation

Soria, D., & Garibaldi, J. M. (2010). A novel framework to elucidate core classes in a dataset.

Conference Name IEEE Congress on Evolutionary Computation (CEC) 2010
End Date Jul 23, 2010
Publication Date Jan 1, 2010
Deposit Date Feb 26, 2015
Publicly Available Date Feb 26, 2015
Peer Reviewed Peer Reviewed
Public URL https://nottingham-repository.worktribe.com/output/1013224
Publisher URL http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5586331
Additional Information Published in: IEEE Congress on Evolutionary Computation (CEC) 2010, IEEE, 2010, ISBN 978-1-4244-8126-2, pp. 1-8
doi: 10.1109/CEC.2010.5586331

Files





You might also like



Downloadable Citations