Diman Hassan
Comparison of distance metrics for hierarchical data in medical databases
Hassan, Diman; Aickelin, Uwe; Wagner, Christian
Authors
Uwe Aickelin
Professor CHRISTIAN WAGNER Christian.Wagner@nottingham.ac.uk
PROFESSOR OF COMPUTER SCIENCE
Abstract
Distance metrics are broadly used in different research areas and applications, such as bio-informatics, data mining and many other fields. However, there are some metrics, like pg-gram and Edit Distance used specifically for data with a hierarchical structure. Other metrics used for non-hierarchical data are the geometric and Hamming metrics. We have applied these metrics to The Health Improvement Network (THIN) database which has some hierarchical data. The THIN data has to be converted into a tree-like structure for the first group of metrics. For the second group of metrics, the data are converted into a frequency table or matrix, then for all metrics, all distances are found and normalised. Based on this particular data set, our research question: which of these metrics is useful for THIN data?. This paper compares the metrics, particularly the pogram metric on finding the similarities of patients' data. It also investigates the similar patients who have the same close distances as well as the metrics suitability for clustering the whole patient population. Our results show that the two groups of metrics perform differently as they represent different structures of the data. Nevertheless, all the metrics could represent some similar data of patients as well as discriminate sufficiently well in clustering the patient population using k-means clustering algorithm.
Citation
Hassan, D., Aickelin, U., & Wagner, C. (2014, July). Comparison of distance metrics for hierarchical data in medical databases. Presented at 2014 International Joint Conference on Neural Networks (IJCNN)
Conference Name | 2014 International Joint Conference on Neural Networks (IJCNN) |
---|---|
Start Date | Jul 6, 2014 |
End Date | Jul 11, 2014 |
Acceptance Date | Mar 15, 2014 |
Publication Date | Sep 4, 2014 |
Deposit Date | Sep 30, 2014 |
Publicly Available Date | Sep 30, 2014 |
Peer Reviewed | Peer Reviewed |
Keywords | Biomedical Informatics, Data Mining, Machine Learning |
Public URL | https://nottingham-repository.worktribe.com/output/737014 |
Publisher URL | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6889554 |
Additional Information | Published in: Proceedings of the 2014 International Joint Conference on Neural Networks : July 6-11, 2014, Beijing, China. Piscataway, N.J. : IEEE, 2014. ISBN: 978-1-4799-1484-5. pp. 3636-3643, doi: 10.1109/IJCNN.2014.6889554 ©2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Contract Date | Sep 30, 2014 |
Files
WCCI-conference-paper-2014.pdf
(218 Kb)
PDF
You might also like
Explain the world – Using causality to facilitate better rules for fuzzy systems
(2024)
Journal Article
Gradient-based Fuzzy System Optimisation via Automatic Differentiation – FuzzyR as a Use Case
(2024)
Preprint / Working Paper
Explaining time series classifiers through meaningful perturbation and optimisation
(2023)
Journal Article
Feature Importance Identification for Time Series Classifiers
(2022)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search