Sujet : Some modern heros of DeepSeek (Re: the asteroid that kills tech dinosaurs)
De : janburse (at) *nospam* fastmail.fm (Mild Shock)
Groupes : sci.physics.relativityDate : 31. Jan 2025, 23:58:26
Autres entêtes
Message-ID : <vnjkii$p4aq$4@solani.org>
References : 1 2
User-Agent : Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:128.0) Gecko/20100101 Firefox/128.0 SeaMonkey/2.53.20
Hi,
Please meet Luo Fuli:
The 29-Year-Old Genius Behind DeepSeek’s AI Revolution
https://www.youtube.com/watch?v=B2fxh4aoQ8QI find this paper interesting, finally
some say about fine tuning during pretraing:
Raise a Child in Large Language Model
13 Sep 2021 - Fuli Luo et al.
https://arxiv.org/pdf/2109.05687Bye
Mild Shock schrieb:
Hi,
So how its going? DeepSeek embraced by many cloud
providers, even by NVIDIA NIM itself.
DeepSeek-R1 Now Live With NVIDIA NIM
https://blogs.nvidia.com/blog/deepseek-r1-nim-microservice/
What what are these models doing and how are they
trained. Is Geoffrey Hinton our only AI God? There
seems to be another slightly disputed AI God,
S. Hochreiter, J. Schmidhuber. Long Short-Term Memory. Neural Computation, 9(8):1735-1780, 1997.
https://people.idsia.ch/~juergen/deep-learning-history.html
Bye
P.S.: It allows a mechanistic view on our linguistic
brain if the latent space is some semantic vectors?
So that learning is a kind of control mechanism:
Machine Learning Approach to Model Order Reduction
of Nonlinear Systems via Autoencoder and LSTM Networks
Thomas Simpson - 23 Sep 2021
https://arxiv.org/abs/2109.11213
Mild Shock schrieb:
Hi,
>
Wait till USA figures out there is a second
competitor besides DeepSeek, its called Yi-Lightning:
>
Yi-Lightning Technical Report
https://arxiv.org/abs/2412.01253
>
It was already discussed 2 months ago:
>
Eric Schmidt DROPS BOMBSHELL: China DOMINATES AI!
https://www.youtube.com/watch?v=ddWuEUjo4u4
>
Bye