BPOI Banner
Google Unleashes Imagen 3, Heating Up the AI Image Generator Race Google Unleashes Imagen 3, Heating Up the AI Image Generator Race

Google Unleashes Imagen 3, Heating Up the AI Image Generator Race

Google is putting the icing on the cake for a busy week in the generative AI space with the launch of Imagen 3, its brand-new text-to-image model. This release builds upon the success of Imagen 2, introduced in December 2023, which already rivaled industry heavyweights like Dall-E 3 and MidJourney v5.

Imagen 3, originally announced in May, boasts enhanced capabilities in understanding and executing complex prompts, generating images with improved details, and better prompt adherence compared to its predecessor. It is pretty versatile, producing good results that range from photorealism to art and 3D compositions.

“Imagen 3 is our highest quality text-to-image model, capable of generating images with even better detail, richer lighting, and fewer distracting artifacts than our previous models,” Google said in its official announcement.

Imagen 3’s prompt improvements allow users to describe desired images in natural language without complex prompt engineering. The model’s training also incorporated richer image captions, enabling it to capture nuanced details like specific camera angles or compositions and long text prompts when needed.

The tech giant has placed particular emphasis on Imagen 3’s enhanced text rendering capabilities. Although noticeably improved, our initial tests show that its capabilities are not quite on par with other models like Dall-E 3, Auraflow, or Flux.

Generations by Imagen 3 and Grok 2 using the same prompt

Google has also stressed its commitment to safety and responsibility in the development and deployment of Imagen 3. The company implemented what it described as “extensive filtering and data labeling” processes to minimize harmful content in the model’s training datasets. Additionally, Google said it conducted thorough evaluations, including red team exercises, to identify and fix potential vulnerabilities.

It is also important to note that Imagen 3 integrates SynthID, Google’s watermarking tool. SynthID embeds a digital signature directly into the pixels of generated images. This watermark is imperceptible to the human eye but detectable by specialized software, providing a means to identify AI-generated content.

Currently, Imagen 3 is available through Google’s ImageFX platform and Vertex AI. Looking ahead, Google plans to introduce popular editing features from Imagen 2, such as inpainting (editing elements in the image) and outpainting (expanding it), to Imagen 3 in the coming months. The company has also announced intentions to expand Imagen 3’s availability across its broader product ecosystem, including integration into the Gemini app, Google Workspace, and Google Ads.

This release is part of a broader Google strategy that aims to put Gemini and AI technology in basically all of its services and hardware. This week, the company introduced its new Pixel 9 lineup, which was designed with AI capabilities at its core. The new Pixel phones can handle certain generative AI tasks locally, including text-based tasks and small image generations.

The release of Imagen 3 comes amid a flurry of activity in the AI image generation space. Elon Musk’s xAI recently unveiled Grok 2, featuring the Flux.1 image generator, which has gained attention for its ability to produce highly realistic, uncensored images alongside strong text generation capabilities.

Meanwhile, MidJourney, another key player in the field, announced an imminent v6.2 update to its model. The company also teased the development of MidJourney v7, slated for release in the coming months. Ideogram, another contender in the AI image generation arena, has also hinted at a forthcoming update to its model. Finally. the Open Model Initiative has chosen Flux.1 as the foundation for developing its state-of-the-art open-source image generation model.

Edited by Ryan Ozawa.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source link

Jose Antonio Lanz

https://decrypt.co/245121/google-imagen-3-release-imagefx

2024-08-16 20:03:57

bitcoin
Bitcoin (BTC) $ 91,379.48 2.60%
ethereum
Ethereum (ETH) $ 3,123.10 0.86%
tether
Tether (USDT) $ 1.00 0.10%
solana
Solana (SOL) $ 218.84 2.70%
bnb
BNB (BNB) $ 623.21 0.18%
xrp
XRP (XRP) $ 1.02 26.17%
dogecoin
Dogecoin (DOGE) $ 0.375355 0.16%
usd-coin
USDC (USDC) $ 1.00 0.09%
staked-ether
Lido Staked Ether (STETH) $ 3,121.73 0.82%
cardano
Cardano (ADA) $ 0.741201 20.76%
tron
TRON (TRX) $ 0.193017 6.68%
shiba-inu
Shiba Inu (SHIB) $ 0.000025 4.79%
avalanche-2
Avalanche (AVAX) $ 34.86 7.94%
the-open-network
Toncoin (TON) $ 5.50 3.87%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 91,109.41 2.42%
wrapped-steth
Wrapped stETH (WSTETH) $ 3,697.80 0.83%
sui
Sui (SUI) $ 3.90 15.35%
pepe
Pepe (PEPE) $ 0.000022 3.26%
weth
WETH (WETH) $ 3,123.34 0.99%
chainlink
Chainlink (LINK) $ 14.11 4.92%
bitcoin-cash
Bitcoin Cash (BCH) $ 439.93 3.95%
polkadot
Polkadot (DOT) $ 5.33 8.05%
near
NEAR Protocol (NEAR) $ 6.25 12.09%
leo-token
LEO Token (LEO) $ 7.73 3.71%
litecoin
Litecoin (LTC) $ 88.78 7.79%
aptos
Aptos (APT) $ 12.39 3.80%
wrapped-eeth
Wrapped eETH (WEETH) $ 3,286.40 0.87%
uniswap
Uniswap (UNI) $ 8.85 6.80%
usds
USDS (USDS) $ 0.995206 0.82%
stellar
Stellar (XLM) $ 0.157051 19.04%
crypto-com-chain
Cronos (CRO) $ 0.16672 3.18%
internet-computer
Internet Computer (ICP) $ 9.18 13.09%
bittensor
Bittensor (TAO) $ 526.05 2.97%
dogwifcoin
dogwifhat (WIF) $ 3.75 1.55%
kaspa
Kaspa (KAS) $ 0.14745 12.85%
ethereum-classic
Ethereum Classic (ETC) $ 23.99 7.69%
fetch-ai
Artificial Superintelligence Alliance (FET) $ 1.30 3.80%
hedera-hashgraph
Hedera (HBAR) $ 0.086746 24.58%
dai
Dai (DAI) $ 1.00 0.01%
whitebit
WhiteBIT Coin (WBT) $ 22.29 0.06%
ethena-usde
Ethena USDe (USDE) $ 1.00 0.00%
polygon-ecosystem-token
POL (ex-MATIC) (POL) $ 0.398013 9.23%
blockstack
Stacks (STX) $ 1.94 5.62%
bonk
Bonk (BONK) $ 0.000041 7.26%
render-token
Render (RENDER) $ 7.11 5.17%
monero
Monero (XMR) $ 149.04 2.56%
okb
OKB (OKB) $ 44.19 2.08%
first-digital-usd
First Digital USD (FDUSD) $ 1.00 0.11%
filecoin
Filecoin (FIL) $ 4.27 8.10%
aave
Aave (AAVE) $ 168.61 4.07%