Skip to main content

Research Repository

Advanced Search

Resolving data gaps in global surface water monthly records through a self-supervised deep learning strategy

Hao, Zhen; Cai, Xiaobin; Ge, Yong; Foody, Giles; Li, Xinyan; Yin, Zhixiang; Du, Yun; Ling, Feng

Authors

Zhen Hao

Xiaobin Cai

Yong Ge

GILES FOODY giles.foody@nottingham.ac.uk
Professor of Geographical Information

Xinyan Li

Zhixiang Yin

Yun Du

Feng Ling



Abstract

The distribution of land surface water bodies is constantly changing. Monitoring these changes is critical for both humanity and the ecological system. The Joint Research Centre Global Surface Water (GSW) dataset is crucial in monitoring global water resources. However, about a third of this dataset suffers from gaps and invalid observations, diminishing its practical effectiveness. We first highlight the challenge of filling data gaps in seasonal water areas, a task significantly more complex than for permanent water bodies. The distinction is vital, as most water bodies are permanent or exhibit high occurrence probability, simplifying gap-filling. Focusing on addressing this, we introduce a self-supervised learning approach tailored to address data gaps in seasonal water areas. The model, trained using simulated gaps from the GSW dataset, adeptly identifies water body seasonal fluctuation patterns. Tested against 639 cloud-free Sentinel-2 images and simulated labels, the model demonstrates high accuracy, achieving F1 score of 0.83 for water and 0.75 for land in seasonal water areas. To improve dataset reliability, we implemented a quality scoring system for each filled segment, distinguishing between high and low-quality filled data and substantially mitigating uncertainty. This study stresses the need for distinct validations for seasonal waters to circumvent biases inherent in methods that do not differentiate between water body types (permanent vs. seasonal). The gap-filled dataset enables a more precise and comprehensive analysis of global water resource trends, highlighting the transformative impact of deep learning in improving the utility and reliability of large-scale environmental datasets.

Citation

Hao, Z., Cai, X., Ge, Y., Foody, G., Li, X., Yin, Z., Du, Y., & Ling, F. (2024). Resolving data gaps in global surface water monthly records through a self-supervised deep learning strategy. Journal of Hydrology, 640, Article 131673. https://doi.org/10.1016/j.jhydrol.2024.131673

Journal Article Type Article
Acceptance Date Jun 29, 2024
Online Publication Date Jul 24, 2024
Publication Date Aug 1, 2024
Deposit Date Jul 25, 2024
Publicly Available Date Jul 25, 2026
Journal Journal of Hydrology
Print ISSN 0022-1694
Electronic ISSN 1879-2707
Publisher Elsevier
Peer Reviewed Peer Reviewed
Volume 640
Article Number 131673
DOI https://doi.org/10.1016/j.jhydrol.2024.131673
Keywords JRC GSW; Deep learning; Water area mapping; Gap filling; Seasonal water
Public URL https://nottingham-repository.worktribe.com/output/37600103