Professor GILES FOODY giles.foody@nottingham.ac.uk
PROFESSOR OF GEOGRAPHICAL INFORMATION
Impacts of sample design for validation data on the accuracy of feedforward neural network classification
Foody, Giles
Authors
Abstract
Validation data are often used to evaluate the performance of a trained neural network and used in the selection of a network deemed optimal for the task at-hand. Optimality is commonly assessed with a measure, such as overall classification accuracy. The latter is often calculated directly from a confusion matrix showing the counts of cases in the validation set with particular labelling properties. The sample design used to form the validation set can, however, influence the estimated magnitude of the accuracy. Commonly, the validation set is formed with a stratified sample to give balanced classes, but also via random sampling, which reflects class abundance. It is suggested that if the ultimate aim is to accurately classify a dataset in which the classes do vary in abundance, a validation set formed via random, rather than stratified, sampling is preferred. This is illustrated with the classification of simulated and remotely-sensed datasets. With both datasets, statistically significant differences in the accuracy with which the data could be classified arose from the use of validation sets formed via random and stratified sampling (z = 2.7 and 1.9 for the simulated and real datasets respectively, for both p < 0.05%). The accuracy of the classifications that used a stratified sample in validation were smaller, a result of cases of an abundant class being commissioned into a rarer class. Simple means to address the issue are suggested.
Citation
Foody, G. (2017). Impacts of sample design for validation data on the accuracy of feedforward neural network classification. Applied Sciences, 7(9), Article 888. https://doi.org/10.3390/app7090888
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 21, 2017 |
Publication Date | Aug 30, 2017 |
Deposit Date | Sep 11, 2017 |
Publicly Available Date | Sep 11, 2017 |
Journal | Applied Sciences |
Electronic ISSN | 2076-3417 |
Publisher | MDPI |
Peer Reviewed | Peer Reviewed |
Volume | 7 |
Issue | 9 |
Article Number | 888 |
DOI | https://doi.org/10.3390/app7090888 |
Keywords | cross-validation; multi-layer perceptron; remote sensing; classification error; sample design; machine learning |
Public URL | https://nottingham-repository.worktribe.com/output/879474 |
Publisher URL | http://www.mdpi.com/2076-3417/7/9/888 |
Contract Date | Sep 11, 2017 |
Files
applsci-07-00888 Foody.pdf
(705 Kb)
PDF
Copyright Statement
Copyright information regarding this work can be found at the following address: http://creativecommons.org/licenses/by/4.0
You might also like
Spatial–Temporal Analysis of Greenness and Its Relationship with Poverty in China
(2024)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search