377
Views
1
CrossRef citations to date
0
Altmetric
Research Article

Keeping walls straight: data model and training set size matter for deep learning in building generalization

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 130-145 | Received 14 Jan 2023, Accepted 20 Sep 2023, Published online: 14 Nov 2023
 

ABSTRACT

Deep learning-backed models have shown their potential of conducting map generalization tasks. However, pioneering studies for raster-based building generalization encountered a common “wabbly-wall effect” that makes the predicted building shapes unrealistic. This effect was identified as a critical challenge in the existing studies. This work proposes a layered data representation model that separately stores a building for generalization and its context buildings in different channels. Incorporating adjustments to training sample generation and prediction tasks, we show how even without using more complex deep learning architectures, the widely used Residual U-Net can already produce straight walls for the generalized buildings and maintains rectangularity and parallelism of the buildings very well for building simplification and aggregation in the scale transition from 1:5,000 to 1:10,000 and 1:5,000 to 1:15,000, respectively. Experiments with visual evaluation and quantitative indicators such as Intersection over Union (IoU), fractality, and roughness index show that using a larger input tensor size is an easy but effective solution to improve prediction. Balancing samples with data augmentation and introducing an attention mechanism to increase network learning capacity can help in certain experiment settings but have obvious tradeoffs. In addition, we find that the defects observed in previous studies may be due to a lack of enough training samples. We thus conclude that the wabbly-wall challenge can be solved, paving the way for further studies of applying raster-based deep learning models on map generalization.

POLICY HIGHLIGHTS

  • Demonstrates the effectiveness of the proposed data structure with multiple evaluation indicators

  • Identifies a “wabbly-wall effect” a challenge in deep-learning backed image based map generalization

  • Proposes a layered data structure that separates a target building and its surrounding buildings to ease the learning task in training deep learning models for raster-based map generalization.

Acknowledgments

The authors also appreciate the comments of four anonymous reviewers which helped improve the paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The raw maps that support the findings are available by request to Dr Yu Feng ([email protected]). The codes for U-Net and its variants are from third-party authors who are not affiliated with this manuscript. The codes for data preprocessing and the models adapted from U-Net models are available here: https://doi.org/10.6084/m9.figshare.21901086.v1.

Supplemental data

Supplemental data for this article can be accessed online at https://doi.org/10.1080/15230406.2023.2264757

Notes

Additional information

Funding

This research was supported by the Swiss National Science Foundation through project number [200021_204081] DeepGeneralization.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.