Vectorizing planar roof structure from very high resolution remote sensing images using transformers

Wufan Zhaoa Geomatics Section, Department of Civil Engineering, Faculty of Engineering Technology, KU Leuven, Ghent, Belgium;b Department of Earth observation science, Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, Enschede, The NetherlandView further author information

Claudio Persellob Department of Earth observation science, Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, Enschede, The NetherlandView further author information

Xianwei Lvb Department of Earth observation science, Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, Enschede, The Netherland;c School of Computer and Communication Engineering, Northeastern University at Qinhuangdao, Qinhuangdao, People's Republic of ChinaCorrespondence[email protected]
View further author information

Alfred Steinb Department of Earth observation science, Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, Enschede, The NetherlandView further author information

Maarten Vergauwena Geomatics Section, Department of Civil Engineering, Faculty of Engineering Technology, KU Leuven, Ghent, BelgiumView further author information

ABSTRACT

Accurately predicting the geometric structure of a building's roof as a vectorized representation from a raster image is a challenging task in building reconstruction. In this paper, we propose an efficient and precise parsing method called Roof-Former, based on a vision Transformer. Our method involves three steps: (1) Image encoder and edge node initialization, (2) Image feature fusion with an enhanced segmentation refinement branch, and (3) Edge filtering and structural reasoning. Our method outperforms previous works on the vectorizing world building dataset and the Enschede dataset, with vertex and edge heat map F1-scores increasing from $87.1 %$ , $76.2 %$ to $89.1 %$ , $78.1 %$ , and from $69.7 %$ , $68.8 %$ to $71.2 %$ , $69.5 %$ , respectively. Furthermore, our method demonstrates superior performance compared to the current state-of-the-art based on qualitative evaluations, indicating its effectiveness in extracting global image information while maintaining the consistency and topological validity of the roof structure.

KEYWORDS:

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The experiments conducted in this paper are based on two publicly available datasets, which can be accessed at Nauata and Furukawa (Citation2020) and Zhao, Persello, and Stein (Citation2022). Any inquiries regarding the datasets should be directed to the original authors.

Notes

1 Key Register Addresses and Buildings https://www.pdok.nl

Additional information

Funding

This work was supported by Foundation of Anhui Province Key Laboratory of Physical Geographic Environment, P.R. China [grant number 2022PGE012].

Vectorizing planar roof structure from very high resolution remote sensing images using transformers

Information for

Open access

Opportunities

Help and information

Vectorizing planar roof structure from very high resolution remote sensing images using transformers

ABSTRACT

Disclosure statement

Data availability statement

Notes

Additional information

Funding

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature