CumInCAD is a Cumulative Index about publications in Computer Aided Architectural Design
supported by the sibling associations ACADIA, CAADRIA, eCAADe, SIGraDi, ASCAAD and CAAD futures

PDF papers
References
id ijac202321201
authors Steinfeld, Kyle
year 2023
title Clever little tricks: A socio-technical history of text-to-image generative models
source International Journal of Architectural Computing 2023, Vol. 21 - no. 2, 211–241
summary The emergence of text-to-image generative models (e.g., Midjourney, DALL-E 2, Stable Diffusion) in the summer of 2022 impacted architectural visual culture suddenly, severely, and seemingly out of nowhere. To contextualize this phenomenon, this text offers a socio-technical history of text-to-image generative systems. Three moments in time, or “scenes,” are presented here: the first at the advent of AI in the middle of the last century; the second at the “reawakening” of a specific approach to machine learning at the turn of this century; the third that documents a rapid sequence of innovations, dubbed “clever little tricks,” that occurred across just 18 months. This final scene is the crux, and represents the first formal documentation of the recent history of a specific set of informal innovations. These innovations were produced by non-affiliated researchers and communities of creative contributors, and directly led to the technologies that so compellingly captured the architectural imagination in the summer of 2022. Across these scenes, we examine the technologies, application domains, infrastructures, social contexts, and practices that drive technical research and shape creative practice in this space.
keywords Machine learning, text-to-image, socio-technical study, generative AI
series journal
references Content-type: text/plain
Details Citation Select
100%; open Bobrow DG (1964) Find in CUMINCAD Natural language input for a computer problem solving system , Thesis, Cambridge, MA, USA: Massachusetts Institute of Technology 1964.

100%; open Brants T, Popat AC, Xu P, et al (2007) Find in CUMINCAD Large language models in machine translation , Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL)

100%; open Brown TB, Mann B, Ryder N, et al (2020) Find in CUMINCAD Language models are few-shot learners , Epub ahead of print July 2020. DOI: 10.48550/arXiv.2005.14165

100%; open Buchanan BG (2005) Find in CUMINCAD A (Very) Brief History of Artificial Intelligence , AI Magazine 2005; 26: 53–53.

100%; open Carpo M (2015) Find in CUMINCAD The new science of form-searching , Archit Design 2015; 85: 22–27.

100%; open Clive Humby (2006) Find in CUMINCAD Data is the new oil, presented at the Association of National Advertisers (ANA) Senior marketer’s summit, Evanston, Illinois: Kellogg School of Management, Nov , 2006.

100%; open Coons SA (1966) Find in CUMINCAD Computer, art and architecture , Art Education 1966; 19: 9–11.

100%; open Crevier DAI (1993) Find in CUMINCAD The tumultuous history of the search for artificial intelligence , USA: Basic Books, Inc

100%; open Deng J, Dong W, Socher R, et al (2009) Find in CUMINCAD Imagenet: A large-scale hierarchical image database , 2009 IEEE conference on computer vision and pattern recognition, Miami, FL, USA, June 20–25, 2009 (IEEE) 2009: 248–255.

100%; open Dhariwal P and Nichol A (2021) Find in CUMINCAD Diffusion models beat GANs on image synthesis , Epub ahead of print June 2021. DOI:10.48550/arXiv.2105.05233

100%; open Ernst GW and Newell A (1969) Find in CUMINCAD GPS: A case study in generality and problem solving , New York, NY, USA: Academic Press, 1969.

100%; open Esser P, Rombach R and Ommer B (2020) Find in CUMINCAD Taming transformers for high-resolution image synthesis, https://arxiv.org/abs/2012.09841 (2020) , 240 International Journal of Architectural Computing 21(2)

100%; open Gallinari P, Lecun Y, Thiria S, et al (1987) Find in CUMINCAD Memoires associatives distribuees: Une comparaison (Distributed associative memories: A comparison) , Proceedings of COGNITIVA 87, Paris, La Villette, May 1987 (Cesta-Afcet) 1987.

100%; open Gao L, Biderman S, Black S, et al (2020) Find in CUMINCAD The Pile: An 800GB dataset of diverse text for language modeling , arXiv preprint arXiv:210100027, https://arxiv.org/abs/2101.00027 2020.

100%; open Ginsparg P (2021) Find in CUMINCAD Lessons from arXiv’s 30 years of information sharing , Nat Rev Phys 2021; 3: 602–603.

100%; open Gregor K, Danihelka I, Graves A, et al (2015) Find in CUMINCAD DRAW: A recurrent neural network for image generation , Epub ahead of print May 2015. DOI: 10.48550/arXiv.1502.04623

100%; open Ho J, Jain A and Abbeel P (2020) Find in CUMINCAD Denoising diffusion probabilistic models , Epub ahead of print December 2020. DOI: 10.48550/arXiv.2006.11239

100%; open Hochreiter S and Schmidhuber J (1997) Find in CUMINCAD Long short-term memory , Neural Comput 1997; 9: 1735–1780.

100%; open Hopfield JJ (1982) Find in CUMINCAD Neural networks and physical systems with emergent collective computational abilities , Proc Natl Acad Sci U S A 1982; 79: 2554–2558.

100%; open Ilharco G, Wortsman M, Wightman R, et al (2021) Find in CUMINCAD OpenCLIP , Epub ahead of print July 2021. DOI: 10.48550/arXiv.1908.09203

last changed 2024/04/17 14:30
pick and add to favorite papersHOMELOGIN (you are user _anon_404735 from group guest) CUMINCAD Papers Powered by SciX Open Publishing Services 1.002