Re: ongoing infrastructure changes with AI in the USA

Liste des GroupesRevenir à ras written 
Sujet : Re: ongoing infrastructure changes with AI in the USA
De : wollman (at) *nospam* hergotha.csail.mit.edu (Garrett Wollman)
Groupes : rec.arts.sf.written
Date : 15. Nov 2024, 21:23:10
Autres entêtes
Organisation : MIT Computer Science & Artificial Intelligence Lab
Message-ID : <vh8aje$e4l$2@usenet.csail.mit.edu>
References : 1 2 3
User-Agent : trn 4.0-test77 (Sep 1, 2010)
In article <cc5134e0-99b3-ae4b-3943-765a891ee7c9@example.net>,
D  <nospam@example.net> wrote:
On Thu, 14 Nov 2024, Scott Lurndal wrote:
Lynn McGuire <lynnmcguire5@gmail.com> writes:
I am on the periphery of the ongoing blanketing of the USA with AI
servers.  I have a few facts that might just blow you away.
>
The expected number of AI servers in the USA alone is presently a
million (SWAG).  The current cost for a single AI server is $500,000 US.
   1,000,000 x $500,000 = $500 billion US of capital.
>
First, What is your source for this data?  Be specific.
>
Second, define precisely what an "AI server" is.
>
My guess would be a server stuffed with GPU:s.

As someone who is currently involved with such things, while I'm not
at liberty to comment on pricing, a typical "state of the art" compute
node for a machine learning cluster might include:

- multiple CPUs with many cores each
- a lot of RAM
- a lot of very fast solid-state disk
- 8 H200 GPUs (also many cores each)
- 8 ports of 400G Infiniband
- 2 ports of 100G or 200G Ethernet
- about 30 kW in power supplies
- enough cooling (fans and/or liquid cooling systems) to dissipate 30
  kW of waste heat

Because most ML work ("AI" or "training") happens on the GPUs, there
is typically only enough CPU to handle the I/O load.  The Infiniband
is used exclusively for low-latency GPU-to-GPU communication across
the cluster; the regular ingress and egress happen over Ethernet.

While I can't comment on specific costs, I will say that the retail
cost is far higher than the BOM cost, and most of that profit stays in
the pockets of Nvidia.

-GAWollman

--
Garrett A. Wollman    | "Act to avoid constraining the future; if you can,
wollman@bimajority.org| act to remove constraint from the future.  This is
Opinions not shared by| a thing you can do, are able to do, to do together."
my employers.         | - Graydon Saunders, _A Succession of Bad Days_ (2015)

Date Sujet#  Auteur
14 Nov 24 * ongoing infrastructure changes with AI in the USA38Lynn McGuire
14 Nov 24 +* Re: ongoing infrastructure changes with AI in the USA2Paul S Person
15 Nov 24 i`- Re: ongoing infrastructure changes with AI in the USA1Paul S Person
14 Nov 24 +* Re: ongoing infrastructure changes with AI in the USA23D
15 Nov 24 i+* Re: ongoing infrastructure changes with AI in the USA21Lynn McGuire
15 Nov 24 ii`* Re: ongoing infrastructure changes with AI in the USA20D
15 Nov 24 ii `* Re: ongoing infrastructure changes with AI in the USA19Lynn McGuire
15 Nov 24 ii  `* Re: ongoing infrastructure changes with AI in the USA18D
16 Nov 24 ii   `* Re: ongoing infrastructure changes with AI in the USA17Paul S Person
17 Nov 24 ii    `* Re: ongoing infrastructure changes with AI in the USA16Gary R. Schmidt
17 Nov 24 ii     `* Re: ongoing infrastructure changes with AI in the USA15Paul S Person
17 Nov 24 ii      +- Re: ongoing infrastructure changes with AI in the USA1Cryptoengineer
18 Nov 24 ii      `* Re: ongoing infrastructure changes with AI in the USA13Jay E. Morris
18 Nov 24 ii       `* Re: ongoing infrastructure changes with AI in the USA12Paul S Person
18 Nov 24 ii        `* Re: ongoing infrastructure changes with AI in the USA11Jay E. Morris
18 Nov 24 ii         `* Re: ongoing infrastructure changes with AI in the USA10Jay E. Morris
19 Nov 24 ii          `* Re: ongoing infrastructure changes with AI in the USA9Paul S Person
19 Nov 24 ii           +- Re: ongoing infrastructure changes with AI in the USA1Jay E. Morris
19 Nov 24 ii           `* Re: ongoing infrastructure changes with AI in the USA7Cryptoengineer
19 Nov 24 ii            `* Re: ongoing infrastructure changes with AI in the USA6Garrett Wollman
20 Nov 24 ii             `* Re: ongoing infrastructure changes with AI in the USA5Jay E. Morris
20 Nov 24 ii              +* Re: ongoing infrastructure changes with AI in the USA3Cryptoengineer
20 Nov 24 ii              i`* Re: ongoing infrastructure changes with AI in the USA2ted@loft.tnolan.com (Ted Nolan
21 Nov 24 ii              i `- Re: ongoing infrastructure changes with AI in the USA1Jay E. Morris
20 Nov 24 ii              `- Re: ongoing infrastructure changes with AI in the USA1Paul S Person
15 Nov 24 i`- Re: ongoing infrastructure changes with AI in the USA1Garrett Wollman
15 Nov 24 +* Re: ongoing infrastructure changes with AI in the USA3Titus G
15 Nov 24 i`* Re: ongoing infrastructure changes with AI in the USA2Lynn McGuire
15 Nov 24 i `- Re: ongoing infrastructure changes with AI in the USA1Titus G
15 Nov 24 +* Re: ongoing infrastructure changes with AI in the USA5William Hyde
15 Nov 24 i`* Re: ongoing infrastructure changes with AI in the USA4Lynn McGuire
16 Nov 24 i +* Re: ongoing infrastructure changes with AI in the USA2William Hyde
16 Nov 24 i i`- Re: ongoing infrastructure changes with AI in the USA1Bobbie Sellers
16 Nov 24 i `- Re: ongoing infrastructure changes with AI in the USA1Paul S Person
17 Nov 24 `* Re: ongoing infrastructure changes with AI in the USA4Lynn McGuire
18 Nov 24  `* Re: ongoing infrastructure changes with AI in the USA3Paul S Person
18 Nov 24   `* Re: ongoing infrastructure changes with AI in the USA2Lynn McGuire
19 Nov 24    `- Re: ongoing infrastructure changes with AI in the USA1D

Haut de la page

Les messages affichés proviennent d'usenet.

NewsPortal