id |
ecaade2023_68 |
authors |
Mugita, Yuki, Fukuda, Tomohiro and Yabuki, Nobuyoshi |
year |
2023 |
title |
Future Landscape Visualization by Generating Images Using a Diffusion Model and Instance Segmentation |
doi |
https://doi.org/10.52842/conf.ecaade.2023.2.549
|
source |
Dokonal, W, Hirschberg, U and Wurzer, G (eds.), Digital Design Reconsidered - Proceedings of the 41st Conference on Education and Research in Computer Aided Architectural Design in Europe (eCAADe 2023) - Volume 2, Graz, 20-22 September 2023, pp. 549–558 |
summary |
When designing a new landscape, such as when demolishing buildings and building new ones, visual methods are effective in sharing a common image. It is possible to visualize future landscapes by making sketches and models, but this requires a great deal of skill and effort on the part of the creator. One method for visualizing future landscapes without the need for specialized skills or labor is image generation using deep learning, and a method has been proposed of using deep learning to generate landscape images after demolishing current buildings. However, there are two problems: the inability to remove arbitrary buildings and the inability to generate a landscape after reconstruction. Therefore, this study proposes a future landscape visualization method that integrates instance segmentation and a diffusion model. The proposed method can generate both post-removal images of existing buildings and post-reconstruction images based on text input, without the need for specialized technology or labor. Verification results confirmed that the post-removal image was more than 90% accurate when the building was removed and replaced with the sky. And the post-reconstruction image matched the text content with a best accuracy of more than 90%. This research will contribute to the realization of urban planning in which all project stakeholders, both professionals and the public, can directly participate by visualizing their own design proposals for future landscapes. |
keywords |
landscape visualization, deep learning, diffusion model, instance segmentation, text input, text-to-image model, inpainting |
series |
eCAADe |
email |
|
full text |
file.pdf (3,205,154 bytes) |
references |
Content-type: text/plain
|
last changed |
2023/12/10 10:49 |
|