Abstract
The unavoidable presence of clouds and their shadows in optical satellite imagery hinders the true spectral response of the Earth underlying surface. Accurate cloud and cloud shadow detection is therefore a crucial preprocessing step for optical satellite images and any downstream analysis. Various methods have been developed to address this critical task and can be broadly categorized in physical rule-based methods and learning based methods. In recent years, machine learning based methods, particularly deep learning frameworks, have proven to outperform physical rule-based models. However, these approaches are mostly fully supervised and require a large amount of pixel-level annotations whose obtention is costly and time consuming. In this work, we propose to deal with cloud and cloud shadow detection in optical satellite images using self-supervised representation learning, a machine learning paradigm that focuses on extracting relevant representations from unlabeled data, which can then be used as an effective starting point to fine-tune models with few labeled data in a supervised fashion. These approaches were shown to perform competitively with fully supervised methods without the requirement of large annotation datasets. Particularly, we assessed two self-supervised representation learning methods that use different philosophies about self-supervision: Momentum Contrast (MoCo), based on contrastive learning and DeepCluster, based on clustering. Using two publicly available Sentinel-2 cloud datasets, namely WHUS2-CD+ and CloudSEN12, we show that MoCo and DeepCluster, trained with only 25% of the annotated data, can perform better than physical rule-based methods such as FMask and Sen2Cor, weakly supervised methods and even several fully supervised methods. These results point out the strong applicability of self-supervised representation learning methods to the task of cloud and cloud shadow detection with self-supervised pretraining leading to fine-tuned models that outperform industry standards and achieve near state-of-the-art performances with a fraction of the data.
| Original language | English |
|---|---|
| Article number | 115205 |
| Journal | Remote Sensing of Environment |
| Volume | 334 |
| DOIs | |
| Publication status | Published - 1 Mar 2026 |
| MoE publication type | A1 Journal article-refereed |
Funding
This work was part of the RepreSent project, funded by the European Space Agency (ESA) Contract No: 4000137253/22/I-DT – CCN3.
Fingerprint
Dive into the research topics of 'Self-supervised representation learning for cloud detection using Sentinel-2 images'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver