What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)

Liste des GroupesRevenir à s logic 
Sujet : What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)
De : janburse (at) *nospam* fastmail.fm (Mild Shock)
Groupes : sci.logic
Date : 19. Dec 2024, 10:39:12
Autres entêtes
Message-ID : <vk0pjv$12ke2$1@solani.org>
References : 1
User-Agent : Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Firefox/91.0 SeaMonkey/2.53.19
Hi,
Could it be that "o1" likely refers to "Optimizer 1".
And what could this include?
- Compressing weights or activations into
fewer bits can significantly reduce computation,
especially in hardware, mimicking O(1)-like
efficiency for certain operations.
- Removing redundant connections in the neural
network leads to fewer computations. Sparse matrix
operations can optimize dense workloads, making specific
inference tasks faster.
- Large models are distilled into smaller ones
with similar capabilities, reducing computational
costs during inference. If the optimized paths are
cleverly structured, their complexity might be
closer to O(1) for lookup-style tasks.
So maybe Ilya Sutskever wants to tell us, in his
recent talk when he refers to the 700g brain line:
Look we did the same as biological evolution,
we found a way to construct more compact brains.
Bye
Mild Shock schrieb:
Hi,
 I liked some videos on YouTube:
 Ilya Sutskever: The Next Oppenheimer
https://www.youtube.com/watch?v=jryDWOKikys
 Ilya Sutskever: Sequence to Sequence Learning
https://www.youtube.com/watch?v=WQQdd6qGxNs
 Bye

Date Sujet#  Auteur
18 Dec 24 * Ilya Sutskever: The Next Oppenheimer2Mild Shock
19 Dec 24 `- What does "o1" mean in recent Models (Was: Ilya Sutskever: The Next Oppenheimer)1Mild Shock

Haut de la page

Les messages affichés proviennent d'usenet.

NewsPortal