Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 93% Match Research Paper Ecologists,Environmental Scientists,Remote Sensing Specialists,Geospatial AI Researchers,Data Scientists 2 weeks ago

SSL4Eco: A Global Seasonal Dataset for Geospatial Foundation Models in Ecology

computer-vision › scene-understanding
📄 Abstract

Abstract: With the exacerbation of the biodiversity and climate crises, macroecological pursuits such as global biodiversity mapping become more urgent. Remote sensing offers a wealth of Earth observation data for ecological studies, but the scarcity of labeled datasets remains a major challenge. Recently, self-supervised learning has enabled learning representations from unlabeled data, triggering the development of pretrained geospatial models with generalizable features. However, these models are often trained on datasets biased toward areas of high human activity, leaving entire ecological regions underrepresented. Additionally, while some datasets attempt to address seasonality through multi-date imagery, they typically follow calendar seasons rather than local phenological cycles. To better capture vegetation seasonality at a global scale, we propose a simple phenology-informed sampling strategy and introduce corresponding SSL4Eco, a multi-date Sentinel-2 dataset, on which we train an existing model with a season-contrastive objective. We compare representations learned from SSL4Eco against other datasets on diverse ecological downstream tasks and demonstrate that our straightforward sampling method consistently improves representation quality, highlighting the importance of dataset construction. The model pretrained on SSL4Eco reaches state of the art performance on 7 out of 8 downstream tasks spanning (multi-label) classification and regression. We release our code, data, and model weights to support macroecological and computer vision research at https://github.com/PlekhanovaElena/ssl4eco.
Authors (7)
Elena Plekhanova
Damien Robert
Johannes Dollinger
Emilia Arens
Philipp Brun
Jan Dirk Wegner
+1 more
Submitted
April 25, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Introduces SSL4Eco, a global, multi-date Sentinel-2 dataset designed for training geospatial foundation models in ecology. It addresses data scarcity and representation bias by using a phenology-informed sampling strategy to better capture vegetation seasonality at a global scale, enabling more generalizable ecological models.

Business Value

Supports critical environmental monitoring and conservation efforts by providing essential data infrastructure for advanced AI models, aiding in climate change mitigation and biodiversity preservation.