Imagen uses large pretrained frozen text encoders and a new thresholding diffusion sampler to generate high-quality images, achieving a new state-of-the-art COCO FID score of 7.27.
Imagen uses large pretrained frozen text encoders to generate high-quality images.
Imagen uses a new thresholding diffusion sampler to generate high-quality images.
Imagen uses an efficient U-Net architecture that is more compute efficient, more memory efficient, and converges faster.
Imagen uses cascaded diffusion models to generate high-resolution images.
Imagen is designed with responsible AI practices in mind, including concerns about social bias and misuse.
Generate high-quality images from text prompts.
Use Imagen for artistic purposes, such as creating realistic images or videos.
Apply Imagen to real-world problems, such as generating images for medical diagnosis or education.
Sign up for access to Imagen.
Enter a text prompt to generate an image.
Adjust parameters to fine-tune the image generation process.
Evaluate the generated image for quality and realism.