Sujet : Re: Short Vectors Versus Long Vectors
De : cr88192 (at) *nospam* gmail.com (BGB)
Groupes : comp.archDate : 24. Apr 2024, 10:10:25
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <v0aia4$282ia$1@dont-email.me>
References : 1 2 3 4 5 6
User-Agent : Mozilla Thunderbird
On 4/24/2024 2:08 AM, Lawrence D'Oliveiro wrote:
On Wed, 24 Apr 2024 00:57:07 -0600, John Savard wrote:
But they've managed to get GPUs to multiply matrices - and they're quite
good at it, which is why we're having all this amazing progress in AI
recently.
Worth noting that this AI stuff requires very low-precision floats: 16-
bit, even 8-bit. And they sacrifice mantissa bits in favour of exponents--
down to something like maybe only a couple of mantissa bits in the 8-bit
format.
Yeah... Something like S.E4.F3 or similar is almost useless, but not entirely useless.
In my case I have a few things to allow for 8-bit formats as storage, but ended up going with Binary16 as the smallest floating-point format used for actual computation.
In some of my own small experiments with NN's (mostly perceptron style), FP8 was an OK format for storing weights and biases, but generally had an adverse effect if used for the accumulators (which need at least slightly higher precision to "actually accumulate stuff" effectively).
Something like FP12 (S.E5.F6) would probably be OK though, and in most other areas one can "throw precision to the wind" with this stuff.
But, this was my own limited experience.
Probably, if doing something like, say, multiplying two FP8 values and then accumulating as a 16-bit fixed-point value or similar, converting the result back to FP8 as part of the activation function, this could probably also work for specialized hardware.
Or, maybe even go smaller for the impulses, say, 3-bit:
000: 0.0 001: 0.5 010: 1.0 011: 1.5+
100: -0.0 101: -0.5 110: -1.0 111: -1.5+
Where, say, weights could be FP8 but impulses would be mostly 3-bit (since, following the activation function, the actual values are no longer particularly relevant).
This could be used to further reduce the storage cost, and the cost of multipliers (which are basically a 6-bit lookup and a bit-shift in this case). Though, FP8*FP8 isn't that much more, and would possibly be cheaper if the output is Binary16 (though, Binary16 would be more expensive to accumulate than, say, a 16-bit fixed-point value).
...
Date | Sujet | # | | Auteur |
23 Apr 24 | Short Vectors Versus Long Vectors | 109 | | Lawrence D'Oliveiro |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 97 | | MitchAlsup1 |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 21 | | Lawrence D'Oliveiro |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 15 | | Anton Ertl |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | Lawrence D'Oliveiro |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Anton Ertl |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 10 | | BGB |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 9 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 8 | | BGB |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 6 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 5 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | BGB |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Terje Mathisen |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 5 | | MitchAlsup1 |
23 Apr 24 | Re: Short Vectors Versus Long Vectors | 4 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 75 | | John Savard |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 26 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 23 | | John Savard |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | BGB |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 19 | | Anton Ertl |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 18 | | Lawrence D'Oliveiro |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Michael S |
27 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Lawrence D'Oliveiro |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 15 | | John Levine |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 14 | | MitchAlsup1 |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 9 | | Michael S |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 8 | | MitchAlsup1 |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 7 | | Michael S |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 6 | | BGB |
27 Apr 24 | Re: Short Vectors Versus Long Vectors | 5 | | Thomas Koenig |
27 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | John Levine |
28 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
28 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | John Levine |
28 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Tim Rentsch |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 4 | | Lawrence D'Oliveiro |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | MitchAlsup1 |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | David Schultz |
27 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | aph |
27 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Anton Ertl |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Thomas Koenig |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 45 | | Anton Ertl |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 43 | | Lawrence D'Oliveiro |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 42 | | Anton Ertl |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 41 | | Lawrence D'Oliveiro |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 40 | | John Savard |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 39 | | Lawrence D'Oliveiro |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | Michael S |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 2 | | Lawrence D'Oliveiro |
26 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | Michael S |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 7 | | John Levine |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 6 | | Thomas Koenig |
29 Apr 24 | Re: Short Vectors Versus Long Vectors | 5 | | George Neuner |
29 Apr 24 | Re: Short Vectors Versus Long Vectors | 3 | | Terje Mathisen |
29 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | BGB |
30 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | George Neuner |
29 Apr 24 | Re: lotsa power, Short Vectors Versus Long Vectors | 1 | | John Levine |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 28 | | John Savard |
25 Apr 24 | Re: Short Vectors Versus Long Vectors | 27 | | Thomas Koenig |
25 Apr 24 | Re: lots of juice, Short Vectors Versus Long Vectors | 3 | | John Levine |
27 Apr 24 | Re: lots of juice, Short Vectors Versus Long Vectors | 2 | | Thomas Koenig |
28 Apr 24 | Re: lots of juice, Short Vectors Versus Long Vectors | 1 | | Lawrence D'Oliveiro |
28 Apr 24 | Re: Short Vectors Versus Long Vectors | 23 | | Tim Rentsch |
30 Apr 24 | Re: Short Vectors Versus Long Vectors | 22 | | Thomas Koenig |
30 Apr 24 | Re: old power, Short Vectors Versus Long Vectors | 19 | | John Levine |
30 Apr 24 | Re: old power, Short Vectors Versus Long Vectors | 18 | | Lawrence D'Oliveiro |
1 May 24 | Re: old power, Short Vectors Versus Long Vectors | 17 | | John Levine |
1 May 24 | Re: old power, Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
2 May 24 | Re: old power, Short Vectors Versus Long Vectors | 15 | | BGB |
2 May 24 | Re: old power, Short Vectors Versus Long Vectors | 14 | | MitchAlsup1 |
3 May 24 | Re: old power, Short Vectors Versus Long Vectors | 1 | | BGB |
3 May 24 | Re: old power, Short Vectors Versus Long Vectors | 12 | | Lawrence D'Oliveiro |
3 May 24 | Re: old power, Short Vectors Versus Long Vectors | 11 | | BGB |
4 May 24 | Re: old power, Short Vectors Versus Long Vectors | 10 | | MitchAlsup1 |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 4 | | Thomas Koenig |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 1 | | BGB |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 2 | | MitchAlsup1 |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 1 | | BGB |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 2 | | BGB |
5 May 24 | Re: old power, Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
6 May 24 | Re: not even sort of old power, Short Vectors Versus Long Vectors | 3 | | John Levine |
6 May 24 | Re: not even sort of old power, Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
6 May 24 | Re: not even sort of old power, Short Vectors Versus Long Vectors | 1 | | Thomas Koenig |
1 May 24 | Re: Short Vectors Versus Long Vectors | 2 | | Tim Rentsch |
1 May 24 | Re: Short Vectors Versus Long Vectors | 1 | | Thomas Koenig |
24 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
30 Apr 24 | Re: Short Vectors Versus Long Vectors | 11 | | MitchAlsup1 |
30 Apr 24 | Re: Short Vectors Versus Long Vectors | 1 | | MitchAlsup1 |
1 May 24 | Re: Short Vectors Versus Long Vectors | 9 | | Lawrence D'Oliveiro |