AICurious Logo

What is: Guided Language to Image Diffusion for Generation and Editing?

SourceGLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

GLIDE is a generative model based on text-guided diffusion models for more photorealistic image generation. Guided diffusion is applied to text-conditional image synthesis and the model is able to handle free-form prompts. The diffusion model uses a text encoder to condition on natural language descriptions. The model is provided with editing capabilities in addition to zero-shot generation, allowing for iterative improvement of model samples to match more complex prompts. The model is fine-tuned to perform image inpainting.