BPOI Banner
AI Startup Hugging Face is Building Small LMs for 'Next Stage Robotics' AI Startup Hugging Face is Building Small LMs for 'Next Stage Robotics'

AI Startup Hugging Face is Building Small LMs for ‘Next Stage Robotics’

AI startup Hugging Face envisions that small—not large—language models will be used for applications including “next stage robotics,” its Co-Founder and Chief Science Officer Thomas Wolf said.

“We want to deploy models in robots that are smarter, so we can start having robots that are not only on assembly lines, but also in the wild,” Wolf said while speaking at Web Summit in Lisbon today.  But that goal, he said, requires low latency. “You cannot wait two seconds so that your robots understand what’s happening, and the only way we can do that is through a small language model,” Wolf added.

Small language models “can do a lot of the tasks we thought only large models could do,” Wolf said, adding that they can also be deployed on-device. “If you think about this kind of game changer, you can have them running on your laptop,” he said. “You can have them running even on your smartphone in the future.”

Ultimately, he envisions small language models running “in almost every tool or appliance that we have, just like today, our fridge is connected to the internet.”

The firm released its SmolLM language model earlier this year. “We are not the only one,” said Wolf, adding that, “Almost every open source company has been releasing smaller and smaller models this year.”

He explained that, “For a lot of very interesting tasks that we need that we could automate with AI, we don’t need to have a model that can solve the Riemann conjecture or general relativity.” Instead, simple tasks such as data wrangling, image processing and speech can be performed using small language models, with corresponding benefits in speed.

The performance of Hugging Face’s LLaMA 1b model to 1 billion parameters this year is “equivalent, if not better than, the performance of a 10 billion parameters model of last year,” he said. “So you have a 10 times smaller model that can reach roughly similar performance.”

“A lot of the knowledge we discovered for our large language model can actually be translated to smaller models,” Wolf said. He explained that the firm trains them on “very specific data sets” that are “slightly simpler, with some form of adaptation that’s tailored for this model.”

Those adaptations include “very tiny, tiny neural nets that you put inside the small model,” he said. “And you have an even smaller model that you add into it and that specializes,” a process he likened to “putting a hat for a specific task that you’re gonna do. I put my cooking hat on, and I’m a cook.”

In the future, Wolf said, the AI space will split across two main trends.

“On the one hand, we’ll have this huge frontier model that will keep getting bigger, because the ultimate goal is to do things that human cannot do, like new scientific discoveries,” using LLMs, he said. The long tail of AI applications will see the technology “embedded a bit everywhere, like we have today with the internet.”

Edited by Stacy Elliott.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source link

Stephen Graves

https://decrypt.co/291237/ai-startup-hugging-face-small-lms-robots

2024-11-12 13:50:14

bitcoin
Bitcoin (BTC) $ 96,965.98 0.87%
ethereum
Ethereum (ETH) $ 3,376.80 1.35%
tether
Tether (USDT) $ 0.999189 0.15%
xrp
XRP (XRP) $ 2.26 1.21%
bnb
BNB (BNB) $ 665.68 1.08%
solana
Solana (SOL) $ 186.22 2.85%
dogecoin
Dogecoin (DOGE) $ 0.321545 3.53%
usd-coin
USDC (USDC) $ 0.999684 0.13%
staked-ether
Lido Staked Ether (STETH) $ 3,369.62 1.32%
cardano
Cardano (ADA) $ 0.912281 4.01%
tron
TRON (TRX) $ 0.247913 0.84%
avalanche-2
Avalanche (AVAX) $ 37.98 5.54%
chainlink
Chainlink (LINK) $ 22.46 4.90%
wrapped-steth
Wrapped stETH (WSTETH) $ 4,012.90 1.62%
the-open-network
Toncoin (TON) $ 5.41 0.56%
sui
Sui (SUI) $ 4.47 5.04%
shiba-inu
Shiba Inu (SHIB) $ 0.000022 4.10%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 96,817.94 0.91%
hyperliquid
Hyperliquid (HYPE) $ 33.89 2.32%
stellar
Stellar (XLM) $ 0.366178 1.95%
polkadot
Polkadot (DOT) $ 7.10 3.79%
hedera-hashgraph
Hedera (HBAR) $ 0.262056 2.01%
weth
WETH (WETH) $ 3,377.15 1.39%
bitcoin-cash
Bitcoin Cash (BCH) $ 455.73 2.00%
leo-token
LEO Token (LEO) $ 9.30 0.43%
uniswap
Uniswap (UNI) $ 14.17 3.16%
litecoin
Litecoin (LTC) $ 103.25 0.44%
pepe
Pepe (PEPE) $ 0.000018 3.87%
wrapped-eeth
Wrapped eETH (WEETH) $ 3,565.38 1.89%
near
NEAR Protocol (NEAR) $ 5.09 4.69%
ethena-usde
Ethena USDe (USDE) $ 0.999056 0.06%
bitget-token
Bitget Token (BGB) $ 4.18 1.94%
aptos
Aptos (APT) $ 9.50 9.67%
usds
USDS (USDS) $ 0.996873 0.16%
internet-computer
Internet Computer (ICP) $ 10.14 6.10%
aave
Aave (AAVE) $ 307.54 3.64%
crypto-com-chain
Cronos (CRO) $ 0.159886 4.35%
polygon-ecosystem-token
POL (ex-MATIC) (POL) $ 0.485887 2.83%
mantle
Mantle (MNT) $ 1.18 2.59%
ethereum-classic
Ethereum Classic (ETC) $ 26.37 3.29%
render-token
Render (RENDER) $ 7.26 4.12%
vechain
VeChain (VET) $ 0.046192 3.89%
mantra-dao
MANTRA (OM) $ 3.78 4.05%
monero
Monero (XMR) $ 190.67 0.58%
whitebit
WhiteBIT Coin (WBT) $ 24.36 0.39%
bittensor
Bittensor (TAO) $ 465.10 3.49%
dai
Dai (DAI) $ 0.999773 0.07%
fetch-ai
Artificial Superintelligence Alliance (FET) $ 1.29 4.06%
arbitrum
Arbitrum (ARB) $ 0.759703 4.85%
ethena
Ethena (ENA) $ 1.07 7.86%