Introduction

Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

  • Lexica.art is a good resource to see images with prompts other people have tried.

V1

V2

Running locally on Mac

Running on the cloud

How does it work?

Some really interesting applications of the model

Animation with Stable diffusion and Unreal Engine 5

Reactive audio based on the generated image

Image-to-image + voice and video synth

Text-to-video editing

Morphing stable diffusion images with Unreal engine 5 models

VFX “makeup”

Stable diffusion animation