Text input required to guide the image generation.

Text input that will not guide the image generation.

0 to 1. Indicates how much to transform the reference image. When strength is 1, initial image will be ignored. Technically, strength parameter indicates how much noise add to the image.

1 to 100. The number of denoising steps. More steps usually can produce higher quality images, but take more time to generate. Number of steps is modulated by strength

0 to 20. Guidance scale as defined in Classifier-Free Diffusion Guidance. Higer guidance forces the model to better follow the prompt, but result in lower quality output.