Patrice E. Carbonneau
Adopting deep learning methods for airborne RGB fluvial scene classification
Carbonneau, Patrice E.; Dugdale, Stephen J.; Breckon, Toby P.; Dietrich, James T.; Fonstad, Mark A.; Miyamoto, Hitoshi; Woodget, Amy S.
Authors
Dr STEPHEN DUGDALE STEPHEN.DUGDALE@NOTTINGHAM.AC.UK
Associate Professor
Toby P. Breckon
James T. Dietrich
Mark A. Fonstad
Hitoshi Miyamoto
Amy S. Woodget
Abstract
Rivers are among the world's most threatened ecosystems. Enabled by the rapid development of drone technology, hyperspatial resolution ([less than]10 cm) images of fluvial environments are now a common data source used to better understand these sensitive habitats. However, the task of image classification remains challenging for this type of imagery and the application of traditional classification algorithms such as maximum likelihood, still in common use among the river remote sensing community, yields unsatisfactory results. We explore the possibility that a classifier of river imagery based on deep learning methods can provide a significant improvement in our ability to classify fluvial scenes. We assemble a dataset composed of RGB images from 11 rivers in Canada, Italy, Japan, the United Kingdom, and Costa Rica. The images were labelled into 5 land-cover classes: water, dry exposed sediment, green vegetation, senescent vegetation and roads. In total, >5 billion pixels were labelled and partitioned for the tasks of training (1 billion pixels) and validation (4 billion pixels). We develop a novel supervised learning workflow based on the NASNet convolutional neural network (CNN) called ‘CNN-Supervised Classification’ (CSC). First, we compare the classification performance of maximum likelihood, a multilayer perceptron, a random forest, and CSC. Results show median F1 scores (a commonly used quality metric in machine learning) of 71%, 78%, 72% and 95%, respectively. Second, we train our classifier using data for 5 of 11 rivers. We then predict the validation data for all 11 rivers. For the 5 rivers that were used in model training, median F1 scores reach 98%. For the 6 rivers not used in model training, median F1 scores are 90%. We reach two conclusions. First, in the traditional workflow where images are classified one at a time, CSC delivers an unprecedented mix of labour savings and classification F1 scores above 95%. Second, deep learning can predict land-cover classifications (F1 = 90%) for rivers not used in training. This demonstrates the potential to train a generalised open-source deep learning model for airborne river surveys suitable for most rivers ‘out of the box’. Research efforts should now focus on further development of a new generation of deep learning classification tools that will encode human image interpretation abilities and allow for fully automated, potentially real-time, interpretation of riverine landscape images.
Journal Article Type | Article |
---|---|
Acceptance Date | Sep 16, 2020 |
Online Publication Date | Sep 25, 2020 |
Publication Date | 2020-12 |
Deposit Date | Sep 28, 2020 |
Publicly Available Date | Sep 26, 2021 |
Journal | Remote Sensing of Environment |
Print ISSN | 0034-4257 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 251 |
Article Number | 112107 |
DOI | https://doi.org/10.1016/j.rse.2020.112107 |
Keywords | Computers in Earth Sciences; Soil Science; Geology |
Public URL | https://nottingham-repository.worktribe.com/output/4931849 |
Publisher URL | https://www.sciencedirect.com/science/article/abs/pii/S0034425720304806 |
Files
Carbonneau Etal RSE 2020 Final
(12.1 Mb)
PDF
You might also like
Riverscape approaches in practice: perspectives and applications
(2021)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search