Newsportal USENET - What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)

Sujet : What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)
De : janburse (at) *nospam* fastmail.fm (Mild Shock)
Groupes : sci.logic
Date : 19. Dec 2024, 10:39:12

Autres entêtes

Message-ID : <vk0pjv$12ke2$1@solani.org>
References : 1
User-Agent : Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Firefox/91.0 SeaMonkey/2.53.19

Hi,
Could it be that "o1" likely refers to "Optimizer 1".
And what could this include?
- Compressing weights or activations into
fewer bits can significantly reduce computation,
especially in hardware, mimicking O(1)-like
efficiency for certain operations.
- Removing redundant connections in the neural
network leads to fewer computations. Sparse matrix
operations can optimize dense workloads, making specific
inference tasks faster.
- Large models are distilled into smaller ones
with similar capabilities, reducing computational
costs during inference. If the optimized paths are
cleverly structured, their complexity might be
closer to O(1) for lookup-style tasks.
So maybe Ilya Sutskever wants to tell us, in his
recent talk when he refers to the 700g brain line:
Look we did the same as biological evolution,
we found a way to construct more compact brains.
Bye
Mild Shock schrieb:

Hi,
I liked some videos on YouTube:
Ilya Sutskever: The Next Oppenheimer
https://www.youtube.com/watch?v=jryDWOKikys
Ilya Sutskever: Sequence to Sequence Learning
https://www.youtube.com/watch?v=WQQdd6qGxNs
Bye

Date	Sujet	#	Auteur
18 Dec 24	Ilya Sutskever: The Next Oppenheimer	2	Mild Shock
19 Dec 24	What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)	1	Mild Shock