BPOI Banner
Google Unleashes Imagen 3, Heating Up the AI Image Generator Race Google Unleashes Imagen 3, Heating Up the AI Image Generator Race

Google Unleashes Imagen 3, Heating Up the AI Image Generator Race

Google is putting the icing on the cake for a busy week in the generative AI space with the launch of Imagen 3, its brand-new text-to-image model. This release builds upon the success of Imagen 2, introduced in December 2023, which already rivaled industry heavyweights like Dall-E 3 and MidJourney v5.

Imagen 3, originally announced in May, boasts enhanced capabilities in understanding and executing complex prompts, generating images with improved details, and better prompt adherence compared to its predecessor. It is pretty versatile, producing good results that range from photorealism to art and 3D compositions.

“Imagen 3 is our highest quality text-to-image model, capable of generating images with even better detail, richer lighting, and fewer distracting artifacts than our previous models,” Google said in its official announcement.

Imagen 3’s prompt improvements allow users to describe desired images in natural language without complex prompt engineering. The model’s training also incorporated richer image captions, enabling it to capture nuanced details like specific camera angles or compositions and long text prompts when needed.

The tech giant has placed particular emphasis on Imagen 3’s enhanced text rendering capabilities. Although noticeably improved, our initial tests show that its capabilities are not quite on par with other models like Dall-E 3, Auraflow, or Flux.

Generations by Imagen 3 and Grok 2 using the same prompt

Google has also stressed its commitment to safety and responsibility in the development and deployment of Imagen 3. The company implemented what it described as “extensive filtering and data labeling” processes to minimize harmful content in the model’s training datasets. Additionally, Google said it conducted thorough evaluations, including red team exercises, to identify and fix potential vulnerabilities.

It is also important to note that Imagen 3 integrates SynthID, Google’s watermarking tool. SynthID embeds a digital signature directly into the pixels of generated images. This watermark is imperceptible to the human eye but detectable by specialized software, providing a means to identify AI-generated content.

Currently, Imagen 3 is available through Google’s ImageFX platform and Vertex AI. Looking ahead, Google plans to introduce popular editing features from Imagen 2, such as inpainting (editing elements in the image) and outpainting (expanding it), to Imagen 3 in the coming months. The company has also announced intentions to expand Imagen 3’s availability across its broader product ecosystem, including integration into the Gemini app, Google Workspace, and Google Ads.

This release is part of a broader Google strategy that aims to put Gemini and AI technology in basically all of its services and hardware. This week, the company introduced its new Pixel 9 lineup, which was designed with AI capabilities at its core. The new Pixel phones can handle certain generative AI tasks locally, including text-based tasks and small image generations.

The release of Imagen 3 comes amid a flurry of activity in the AI image generation space. Elon Musk’s xAI recently unveiled Grok 2, featuring the Flux.1 image generator, which has gained attention for its ability to produce highly realistic, uncensored images alongside strong text generation capabilities.

Meanwhile, MidJourney, another key player in the field, announced an imminent v6.2 update to its model. The company also teased the development of MidJourney v7, slated for release in the coming months. Ideogram, another contender in the AI image generation arena, has also hinted at a forthcoming update to its model. Finally. the Open Model Initiative has chosen Flux.1 as the foundation for developing its state-of-the-art open-source image generation model.

Edited by Ryan Ozawa.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source link

Jose Antonio Lanz

https://decrypt.co/245121/google-imagen-3-release-imagefx

2024-08-16 20:03:57

bitcoin
Bitcoin (BTC) $ 67,750.15 1.41%
ethereum
Ethereum (ETH) $ 2,620.26 0.91%
tether
Tether (USDT) $ 0.999698 0.08%
bnb
BNB (BNB) $ 602.68 1.67%
solana
Solana (SOL) $ 154.80 0.94%
usd-coin
USDC (USDC) $ 0.999687 0.09%
xrp
XRP (XRP) $ 0.549368 1.57%
staked-ether
Lido Staked Ether (STETH) $ 2,619.65 0.91%
dogecoin
Dogecoin (DOGE) $ 0.126618 9.96%
tron
TRON (TRX) $ 0.159959 0.73%
the-open-network
Toncoin (TON) $ 5.25 0.94%
cardano
Cardano (ADA) $ 0.355495 0.11%
avalanche-2
Avalanche (AVAX) $ 28.12 0.35%
shiba-inu
Shiba Inu (SHIB) $ 0.000019 3.66%
wrapped-steth
Wrapped stETH (WSTETH) $ 3,094.62 0.80%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 67,601.11 1.27%
weth
WETH (WETH) $ 2,619.60 0.92%
bitcoin-cash
Bitcoin Cash (BCH) $ 365.23 3.61%
chainlink
Chainlink (LINK) $ 11.32 0.21%
polkadot
Polkadot (DOT) $ 4.34 1.00%
near
NEAR Protocol (NEAR) $ 5.00 0.42%
dai
Dai (DAI) $ 0.999591 0.06%
sui
Sui (SUI) $ 2.11 4.17%
uniswap
Uniswap (UNI) $ 7.62 2.63%
leo-token
LEO Token (LEO) $ 6.06 0.61%
litecoin
Litecoin (LTC) $ 70.11 0.16%
aptos
Aptos (APT) $ 10.12 3.36%
pepe
Pepe (PEPE) $ 0.000011 3.12%
wrapped-eeth
Wrapped eETH (WEETH) $ 2,751.00 0.86%
bittensor
Bittensor (TAO) $ 586.08 0.89%
fetch-ai
Artificial Superintelligence Alliance (FET) $ 1.44 0.85%
internet-computer
Internet Computer (ICP) $ 7.96 0.87%
kaspa
Kaspa (KAS) $ 0.131401 1.99%
ethereum-classic
Ethereum Classic (ETC) $ 19.47 0.90%
first-digital-usd
First Digital USD (FDUSD) $ 0.999845 0.11%
monero
Monero (XMR) $ 156.33 1.73%
stellar
Stellar (XLM) $ 0.095174 2.53%
polygon-ecosystem-token
POL (ex-MATIC) (POL) $ 0.372019 0.90%
blockstack
Stacks (STX) $ 1.85 3.06%
dogwifcoin
dogwifhat (WIF) $ 2.65 2.80%
immutable-x
Immutable (IMX) $ 1.53 1.17%
okb
OKB (OKB) $ 41.15 0.43%
ethena-usde
Ethena USDe (USDE) $ 0.998925 0.01%
whitebit
WhiteBIT Coin (WBT) $ 16.37 0.19%
aave
Aave (AAVE) $ 157.33 0.36%
filecoin
Filecoin (FIL) $ 3.78 0.36%
optimism
Optimism (OP) $ 1.78 3.68%
render-token
Render (RENDER) $ 5.43 0.24%
crypto-com-chain
Cronos (CRO) $ 0.079023 2.02%
mantle
Mantle (MNT) $ 0.624936 1.00%