【Stable Diffusion Prompt build】What is Stable Diffusion?

July 28th, 2023

Stable Diffusion is a text-to-image AI model. It is trained on millions of image and text description pairs found on the internet. Because it has seen so much, the model understands what text description associates with what images.

As a result, if you put in a prompt like “A Photo of a cat sitting on top of a building”, it would give you images like these:

You may be thinking what’s the big deal? Couldn’t we get millions of them in a Google search? What’s intriguing about this technology is that you can prompt the model to generate high quality images that do not exist before. For example, you can ask for a portrait painting of Emma Watson by the 19th century American painter John Singer Sargent:

It is incredible that such images can be produced from keyword-pixel correlations! What’s mind-boggling is that it gets the artistic style, faces (which our brains are very unforgiving of tiny mistakes) and shadows correct, and blends them all together in an aesthetically pleasing manner. The wonder of large numbers is beyond the comprehension of human minds.

