Cumincad : CUMINCAD Papers : Paper caadria2021_117:Can a Generative Adversarial Network Remove Thin Clouds in Aerial Photographs? - Toward Improving the Accuracy of Generating Horizontal Building Mask Images for Deep Learning in Urban Planning and Design

caadria2021_117

authors

Ikeno, Kazunosuke, Fukuda, Tomohiro and Yabuki, Nobuyoshi

year

2021

title

Can a Generative Adversarial Network Remove Thin Clouds in Aerial Photographs? - Toward Improving the Accuracy of Generating Horizontal Building Mask Images for Deep Learning in Urban Planning and Design

doi

https://doi.org/10.52842/conf.caadria.2021.2.377

source

A. Globa, J. van Ameijde, A. Fingrut, N. Kim, T.T.S. Lo (eds.), PROJECTIONS - Proceedings of the 26th CAADRIA Conference - Volume 2, The Chinese University of Hong Kong and Online, Hong Kong, 29 March - 1 April 2021, pp. 377-386

summary

Information extracted from aerial photographs is widely used in the fields of urban planning and architecture. An effective method for detecting buildings in aerial photographs is to use deep learning to understand the current state of a target region. However, the building mask images used to train the deep learning model must be manually generated in many cases. To overcome this challenge, a method has been proposed for automatically generating mask images by using textured 3D virtual models with aerial photographs. Some aerial photographs include thin clouds, which degrade image quality. In this research, the thin clouds in these aerial photographs are removed by using a generative adversarial network, which leads to improvements in training accuracy. Therefore, the objective of this research is to propose a method for automatically generating building mask images by using 3D virtual models with textured aerial photographs to enable the removable of thin clouds so that the image can be used for deep learning. A model trained on datasets generated by the proposed method was able to detect buildings in aerial photographs with an accuracy of IoU = 0.651.

keywords

Urban planning and design; Deep learning; Generative Adversarial Network (GAN); Semantic segmentation; Mask image

series

CAADRIA

full text

file.pdf (12,879,845 bytes)

references

Content-type: text/plain

Details	Citation	Select
	Bittner, K, Adam, F, Cui, S, Körner, M and Reinartz, P (2018) Building Footprint Extraction From VHR Remote Sensing Images Combined With Normalized DSMs Using Fused Fully Convolutional Networks , IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 11, p. 2615-2629

	Cao, R, Fukuda, T and Yabuki, N (2019) Quantifying Visual Environment by Semantic Segmentation Using Deep Learning , Proceedings of the 24th International Conference on Computer-Aided Architectural Design Research in Asia (CAADRIA 2019), p. 623-632

	Everingham, M, Eslami, S. M. A, Gool, L. V, Williams, C. K. I, Winn, J and Zisserman, A (2015) The PASCAL Visual Object Classes Challenge: A Retrospective , International Journal of Computer Vision, 111, p. 98-136

	Fukuda, T, Novak, M, Fujii, H and Pencreach, Y (2020) Virtual reality rendering methods for training deep learning, analysing landscapes, and preventing virtual reality sickness , International Journal of Architectural Computing, 16th September 2020, p. https://doi.org/10.1177/1478077120957544

	Goodfellow, I, Pouget-Abadie, J, Mirza, M, Xu, B, Warde-Farley, D, Ozair, S, Courville, A and Bengio, Y (2014) Generative adversarial nets , Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS), p. 2672-2680

	Ikeno, K, Fukuda, T and Yabuki, N (2020) Automatic Generation of Horizontal Building Mask Images by Using a 3D Model with Aerial Photographs for Deep Learning , Proceedings of eCAADe 2020, p. 271-278

	Jabbar, A, Farrawell, L, Fountain, J and Chalup, S. K (2017) Training Deep Neural Networks for Detecting Drinking Glasses Using Synthetic Images , Neural Information Processing, p. 354-363

	Krizhevsky, A, Sutskever, I and Hinton, G. E (2012) ImageNet classification with deep convolutional neural networks , Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS 2012), p. 1097-1105

	Long, J, Shelhamer, E and Darrell, T (2015) Fully Convolutional Networks for Semantic Segmentation , The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p. 3431-3440

	Redmon, J, Divvala, S, Girshick, R and Farhadi, A (2016) You Only Look Once: Unified, Real-Time Object Detection , The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p. 779-788

	Ronneberger, O, Fischer, P and Brox, T (2015) Unet: Convolutional networks for biomedical image segmentation , International Conference on Medical Image Computing and Computer-Assisted Intervention, p. 234-241

	Wang, T, Yang, X, Xu, K, Chen, S, Zhang, Q and Lau, R (2019) Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset , The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p. 12262-12271

	Zhang, Q, Wang, Y, Liu, Q, Liu, X and Wang, W (2016) CNN based suburban building detection using monocular high resolution Google Earth images , Proc. IEEE Int. Geosci. Remote Sens. Symp. (IGARSS), p. 661-664

	Zhao, K, Kang, J, Jung, J and Sohn, G (2018) Building Extraction from Satellite Images Using Mask R-CNN with Building Boundary Regularization , Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), p. 247-251

	Zhou, K, Chen, Y, Smal, I and Lindenbergh, R (2019) Building segmentation from Airborne VHR Images Using mask R-CNN , Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., p. 155-161

last changed

2022/06/07 07:50