BPOI Banner
OpenAI Twitter Accounts Link to Crypto Scam After Another Hack OpenAI Twitter Accounts Link to Crypto Scam After Another Hack

OpenAI Launches Advanced Voice Mode, Minus the Scarlett Johansson Drama

OpenAI has begun rolling out its much-anticipated Advanced Voice Mode for ChatGPT Plus and Teams users, marking another step towards a more human-like AI interaction.

The feature allows for real-time, fluid conversations powered by GPT-4o, OpenAI’s latest model, which combines text, vision, and audio to deliver faster responses.

“Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week,” OpenAI said in an official tweet, “It can also say “Sorry I’m late” in over 50 languages,” it added—addressing the long delay this project went through.

Needless to say, one notable element is still missing: the flirty and definitely too human-like “Sky” voice, which caused a stir for its uncanny resemblance to actress Scarlett Johansson. After her legal team sent letters to OpenAI’s CEO Sam Altman, OpenAI put the Sky voice on hold, maintaining that any resemblance between Johansson’s distinctive voice and Sky was purely coincidental.

Instead, OpenAI introduced five new voices: Arbor, Maple, Sol, Spruce, and Vale, which are available in both Standard and Advanced Voice Mode. These join the previously available Breeze, Juniper, Cove, and Ember. (For some reason, the company seems to be naming them after soap fragrances.) Users in the Plus and Team tiers will gradually gain access to these new voices, designed to make conversations more natural, with emotional responsiveness and the ability to interrupt and switch topics on the fly.

Additionally, OpenAI is adding compatibility with custom instructions and “memories” to allow users to personalize their ChatGPT experience further, tailoring interactions to their preferences. Just as the text-based chatbot learns from your instructions (i.e., your name, occupation, and probably the type of answers you like to read), the new voices will try to learn from your conversations, making them more natural, familiar, and used to your preferences.

Users in the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein will have to wait, as the feature has not yet rolled out in those regions. Enterprise and Edu users can expect access starting next week, according to OpenAI’s timeline. The rollout is slow, and not all users, even from supported regions, have the feature available.

OpenAI also refined accents in popular foreign languages and enhanced conversational speed and smoothness. The design has also been updated, with an animated blue sphere that visually represents the voice interaction as it happens and is more aesthetically pleasing than the minimalist black dot they used to show.

Image: OpenAI

While OpenAI continues to refine its voice AI offerings, competition in the space has been heating up.

Google’s NotebookLM currently sets the bar with some of the most human-like AI voices available, able to simulate entire debates between AI-generated speakers with remarkable realism.

Google’s AI tool can process up to one million data tokens and let users interact with it, Decrypt previously reported. Once users upload a specific group of documents with different types of information, Notebook LM can generate up to 10 minutes of audio with two AIs talking about that specific information. The result is almost extremely realistic.

Besides Google, Meta has also entered the fray with its own live assistant, Meta AI, though it is not yet widely available. The assistant is also capable of having natural conversations with users, processing commands fluently. The voice is more natural than the typically robotic voice we see in most AI assistants, but it still has some giveaways—like the speech cadence and speed—that make it identifiable as AI-generated. That said, Reuters has reported that Meta’s upcoming chatbot will have the personas of Judy Dench and Michael Cerna. It’s not Scarlet Johansson, but nor is it chopped liver.

Edited by Josh Quittner and Sebastian Sinclair

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.



Source link

Jose Antonio Lanz

https://decrypt.co/251069/openai-launches-advanced-voice-mode-minus-scarjo-drama

2024-09-25 00:41:42

bitcoin
Bitcoin (BTC) $ 91,239.45 3.47%
ethereum
Ethereum (ETH) $ 3,150.86 2.37%
tether
Tether (USDT) $ 1.00 0.01%
solana
Solana (SOL) $ 220.97 5.96%
bnb
BNB (BNB) $ 624.84 0.47%
dogecoin
Dogecoin (DOGE) $ 0.378608 2.41%
xrp
XRP (XRP) $ 0.913376 10.09%
usd-coin
USDC (USDC) $ 0.99989 0.01%
staked-ether
Lido Staked Ether (STETH) $ 3,149.25 2.30%
cardano
Cardano (ADA) $ 0.737891 23.23%
tron
TRON (TRX) $ 0.189822 6.38%
shiba-inu
Shiba Inu (SHIB) $ 0.000025 7.48%
avalanche-2
Avalanche (AVAX) $ 34.43 9.10%
the-open-network
Toncoin (TON) $ 5.43 3.19%
wrapped-steth
Wrapped stETH (WSTETH) $ 3,714.53 1.84%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 91,118.41 3.59%
sui
Sui (SUI) $ 3.87 21.67%
pepe
Pepe (PEPE) $ 0.000023 8.15%
weth
WETH (WETH) $ 3,155.65 2.45%
chainlink
Chainlink (LINK) $ 14.26 8.74%
bitcoin-cash
Bitcoin Cash (BCH) $ 434.26 3.40%
polkadot
Polkadot (DOT) $ 5.25 8.37%
near
NEAR Protocol (NEAR) $ 6.10 12.24%
leo-token
LEO Token (LEO) $ 7.76 4.29%
aptos
Aptos (APT) $ 12.48 8.93%
litecoin
Litecoin (LTC) $ 83.75 2.47%
wrapped-eeth
Wrapped eETH (WEETH) $ 3,312.08 2.20%
uniswap
Uniswap (UNI) $ 8.81 8.23%
usds
USDS (USDS) $ 0.994887 0.73%
crypto-com-chain
Cronos (CRO) $ 0.168688 6.63%
stellar
Stellar (XLM) $ 0.145269 7.05%
internet-computer
Internet Computer (ICP) $ 9.04 12.88%
bittensor
Bittensor (TAO) $ 535.96 6.33%
dogwifcoin
dogwifhat (WIF) $ 3.91 11.30%
kaspa
Kaspa (KAS) $ 0.14075 6.24%
ethereum-classic
Ethereum Classic (ETC) $ 23.58 6.26%
fetch-ai
Artificial Superintelligence Alliance (FET) $ 1.32 8.00%
dai
Dai (DAI) $ 0.999775 0.03%
whitebit
WhiteBIT Coin (WBT) $ 22.30 0.77%
ethena-usde
Ethena USDe (USDE) $ 1.00 0.07%
bonk
Bonk (BONK) $ 0.000044 26.91%
polygon-ecosystem-token
POL (ex-MATIC) (POL) $ 0.379814 6.09%
hedera-hashgraph
Hedera (HBAR) $ 0.078807 17.82%
blockstack
Stacks (STX) $ 1.94 6.83%
render-token
Render (RENDER) $ 7.35 11.86%
monero
Monero (XMR) $ 144.09 3.05%
okb
OKB (OKB) $ 44.19 1.81%
first-digital-usd
First Digital USD (FDUSD) $ 1.00 0.18%
floki
FLOKI (FLOKI) $ 0.000265 24.42%
aave
Aave (AAVE) $ 169.45 8.86%