BPOI Banner
OpenAI Twitter Accounts Link to Crypto Scam After Another Hack OpenAI Twitter Accounts Link to Crypto Scam After Another Hack

OpenAI Launches Advanced Voice Mode, Minus the Scarlett Johansson Drama

OpenAI has begun rolling out its much-anticipated Advanced Voice Mode for ChatGPT Plus and Teams users, marking another step towards a more human-like AI interaction.

The feature allows for real-time, fluid conversations powered by GPT-4o, OpenAI’s latest model, which combines text, vision, and audio to deliver faster responses.

“Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week,” OpenAI said in an official tweet, “It can also say “Sorry I’m late” in over 50 languages,” it added—addressing the long delay this project went through.

Needless to say, one notable element is still missing: the flirty and definitely too human-like “Sky” voice, which caused a stir for its uncanny resemblance to actress Scarlett Johansson. After her legal team sent letters to OpenAI’s CEO Sam Altman, OpenAI put the Sky voice on hold, maintaining that any resemblance between Johansson’s distinctive voice and Sky was purely coincidental.

Instead, OpenAI introduced five new voices: Arbor, Maple, Sol, Spruce, and Vale, which are available in both Standard and Advanced Voice Mode. These join the previously available Breeze, Juniper, Cove, and Ember. (For some reason, the company seems to be naming them after soap fragrances.) Users in the Plus and Team tiers will gradually gain access to these new voices, designed to make conversations more natural, with emotional responsiveness and the ability to interrupt and switch topics on the fly.

Additionally, OpenAI is adding compatibility with custom instructions and “memories” to allow users to personalize their ChatGPT experience further, tailoring interactions to their preferences. Just as the text-based chatbot learns from your instructions (i.e., your name, occupation, and probably the type of answers you like to read), the new voices will try to learn from your conversations, making them more natural, familiar, and used to your preferences.

Users in the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein will have to wait, as the feature has not yet rolled out in those regions. Enterprise and Edu users can expect access starting next week, according to OpenAI’s timeline. The rollout is slow, and not all users, even from supported regions, have the feature available.

OpenAI also refined accents in popular foreign languages and enhanced conversational speed and smoothness. The design has also been updated, with an animated blue sphere that visually represents the voice interaction as it happens and is more aesthetically pleasing than the minimalist black dot they used to show.

Image: OpenAI

While OpenAI continues to refine its voice AI offerings, competition in the space has been heating up.

Google’s NotebookLM currently sets the bar with some of the most human-like AI voices available, able to simulate entire debates between AI-generated speakers with remarkable realism.

Google’s AI tool can process up to one million data tokens and let users interact with it, Decrypt previously reported. Once users upload a specific group of documents with different types of information, Notebook LM can generate up to 10 minutes of audio with two AIs talking about that specific information. The result is almost extremely realistic.

Besides Google, Meta has also entered the fray with its own live assistant, Meta AI, though it is not yet widely available. The assistant is also capable of having natural conversations with users, processing commands fluently. The voice is more natural than the typically robotic voice we see in most AI assistants, but it still has some giveaways—like the speech cadence and speed—that make it identifiable as AI-generated. That said, Reuters has reported that Meta’s upcoming chatbot will have the personas of Judy Dench and Michael Cerna. It’s not Scarlet Johansson, but nor is it chopped liver.

Edited by Josh Quittner and Sebastian Sinclair

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.



Source link

Jose Antonio Lanz

https://decrypt.co/251069/openai-launches-advanced-voice-mode-minus-scarjo-drama

2024-09-25 00:41:42

bitcoin
Bitcoin (BTC) $ 94,847.41 1.95%
ethereum
Ethereum (ETH) $ 3,283.72 1.68%
tether
Tether (USDT) $ 0.998625 0.14%
xrp
XRP (XRP) $ 2.18 1.82%
bnb
BNB (BNB) $ 650.67 1.68%
solana
Solana (SOL) $ 181.23 0.01%
dogecoin
Dogecoin (DOGE) $ 0.31096 1.67%
usd-coin
USDC (USDC) $ 1.00 0.11%
cardano
Cardano (ADA) $ 0.881306 1.27%
staked-ether
Lido Staked Ether (STETH) $ 3,264.82 2.24%
tron
TRON (TRX) $ 0.243883 0.46%
avalanche-2
Avalanche (AVAX) $ 36.50 1.79%
chainlink
Chainlink (LINK) $ 22.13 0.47%
the-open-network
Toncoin (TON) $ 5.39 2.03%
wrapped-steth
Wrapped stETH (WSTETH) $ 3,893.20 1.67%
sui
Sui (SUI) $ 4.37 0.05%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 94,616.35 1.92%
shiba-inu
Shiba Inu (SHIB) $ 0.000021 0.45%
stellar
Stellar (XLM) $ 0.354942 0.94%
polkadot
Polkadot (DOT) $ 6.82 1.46%
hedera-hashgraph
Hedera (HBAR) $ 0.262035 4.34%
hyperliquid
Hyperliquid (HYPE) $ 28.63 14.96%
weth
WETH (WETH) $ 3,289.15 1.64%
bitcoin-cash
Bitcoin Cash (BCH) $ 443.22 1.73%
leo-token
LEO Token (LEO) $ 9.31 0.02%
uniswap
Uniswap (UNI) $ 13.72 1.83%
litecoin
Litecoin (LTC) $ 100.29 0.44%
pepe
Pepe (PEPE) $ 0.000018 3.06%
wrapped-eeth
Wrapped eETH (WEETH) $ 3,469.00 1.66%
near
NEAR Protocol (NEAR) $ 5.03 0.76%
ethena-usde
Ethena USDe (USDE) $ 0.999834 0.08%
bitget-token
Bitget Token (BGB) $ 4.08 2.45%
usds
USDS (USDS) $ 0.999189 0.40%
aptos
Aptos (APT) $ 9.10 2.90%
aave
Aave (AAVE) $ 320.93 7.47%
internet-computer
Internet Computer (ICP) $ 9.88 0.71%
crypto-com-chain
Cronos (CRO) $ 0.153513 1.50%
polygon-ecosystem-token
POL (ex-MATIC) (POL) $ 0.472152 0.07%
mantle
Mantle (MNT) $ 1.17 0.01%
ethereum-classic
Ethereum Classic (ETC) $ 25.99 0.13%
vechain
VeChain (VET) $ 0.045524 0.93%
render-token
Render (RENDER) $ 7.00 0.86%
monero
Monero (XMR) $ 191.08 3.02%
whitebit
WhiteBIT Coin (WBT) $ 24.36 0.61%
mantra-dao
MANTRA (OM) $ 3.69 2.14%
dai
Dai (DAI) $ 1.00 0.43%
bittensor
Bittensor (TAO) $ 449.46 0.76%
fetch-ai
Artificial Superintelligence Alliance (FET) $ 1.25 0.17%
arbitrum
Arbitrum (ARB) $ 0.744995 0.16%
ethena
Ethena (ENA) $ 1.04 1.58%