Increasing amounts of large scale georeferenced data produced by Earth observation missions present new challenges for training and testing machine-learned predictive models. Most of this data is spatially auto-correlated, which violates the classical i.i.d. assumption (identically and independently distributed data) commonly used in machine learning. One of the largest challenges in relation to spatial auto-correlation is how to generate testing sets that are sufficiently independent of the training data. In the geoscience and ecological literature, spatially stratified cross-validation is increasingly used as an alternative to standard random cross-validation. Spatial cross-validation, however, is not yet widely studied in the machine lea...
Spatial and spatiotemporal machine-learning models require a suitable framework for their model asse...
Random forest and similar Machine Learning techniques are already used to generate spatial predictio...
Applications of machine-learning-based approaches in the geosciences have witnessed a substantial in...
For decades scientists have produced maps of biological, ecological and environmental variables. The...
International audienceSpatial autocorrelation is inherent to remotely sensed data. Nearby pixels are...
High spatial resolution (1–5 m) remotely sensed datasets are increasingly being used to map la...
Highlights • Our data split method handles spatial autocorrelation and imposes prediction fairnes...
Several spatial and non-spatial Cross-Validation (CV) methods have been used to perform map validati...
Ecological data often show temporal, spatial, hierarchical (random effects), or phylogenetic structu...
International audienceMapping aboveground forest biomass is central for assessing the global carbon ...
Ecological data often show temporal, spatial, hierarchical (random effects), or phylogenetic structu...
Mapping of environmental variables often relies on map accuracy assessment through cross-validation ...
Data-driven machine learning algorithms have initiated a paradigm shift in hedonic house price and r...
Machine learning spatial modeling is used for mapping the distribution of deep-sea polymetallic nodu...
Spatial and spatiotemporal machine-learning models require a suitable framework for their model asse...
Random forest and similar Machine Learning techniques are already used to generate spatial predictio...
Applications of machine-learning-based approaches in the geosciences have witnessed a substantial in...
For decades scientists have produced maps of biological, ecological and environmental variables. The...
International audienceSpatial autocorrelation is inherent to remotely sensed data. Nearby pixels are...
High spatial resolution (1–5 m) remotely sensed datasets are increasingly being used to map la...
Highlights • Our data split method handles spatial autocorrelation and imposes prediction fairnes...
Several spatial and non-spatial Cross-Validation (CV) methods have been used to perform map validati...
Ecological data often show temporal, spatial, hierarchical (random effects), or phylogenetic structu...
International audienceMapping aboveground forest biomass is central for assessing the global carbon ...
Ecological data often show temporal, spatial, hierarchical (random effects), or phylogenetic structu...
Mapping of environmental variables often relies on map accuracy assessment through cross-validation ...
Data-driven machine learning algorithms have initiated a paradigm shift in hedonic house price and r...
Machine learning spatial modeling is used for mapping the distribution of deep-sea polymetallic nodu...
Spatial and spatiotemporal machine-learning models require a suitable framework for their model asse...
Random forest and similar Machine Learning techniques are already used to generate spatial predictio...
Applications of machine-learning-based approaches in the geosciences have witnessed a substantial in...