Imagen (text-to-image model)

Imagen
Developer(s)Google DeepMind
Stable release
Imagen 3 / 13 August 2024 (2024-08-13)
TypeText-to-image model
Websitedeepmind.google/technologies/imagen-3/

Imagen, Imagen 2, and Imagen 3 are text-to-image models developed by Google DeepMind. They were developed by Google Brain until the company's merger with DeepMind in April 2023.[1] Imagen is primarily used to generate images from text prompts, similar to Stability AI's Stable Diffusion, OpenAI's DALL-E, or Midjourney.

The original version of the model was first discussed in a paper from May 2022.[2] The tool produces high-quality images and is available to all users with a Google account through services including Gemini, ImageFX, and Vertex AI.[3]

  1. ^ Roth, Emma; Peters, Jay (April 20, 2023). "Google's big AI push will combine Brain and DeepMind into one team". The Verge. Archived from the original on April 20, 2023. Retrieved March 18, 2025.
  2. ^ Saharia, Chitwan; Chan, William; Saxena, Saurabh; Li, Lala; Whang, Jay; Denton, Emily; Seyed Kamyar Seyed Ghasemipour; Burcu Karagol Ayan; Sara Mahdavi, S.; Rapha Gontijo Lopes; Salimans, Tim; Ho, Jonathan; David J Fleet; Norouzi, Mohammad (2022). "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding". arXiv:2205.11487 [cs.CV].
  3. ^ Cite error: The named reference :2 was invoked but never defined (see the help page).

© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search