Newsportal USENET - Re: Parsing timestamps?

Re: Parsing timestamps?

Sujet : Re: Parsing timestamps?
De : anton (at) *nospam* mips.complang.tuwien.ac.at (Anton Ertl)
Groupes : comp.lang.forth
Date : 16. Jul 2025, 12:25:04

Autres entêtes

Organisation : Institut fuer Computersprachen, Technische Universitaet Wien
Message-ID : <2025Jul16.132504@mips.complang.tuwien.ac.at>
References : 1 2 3 4 5 6 7 8 9 10
User-Agent : xrn 10.11

mhx@iae.nl (mhx) writes:

On Mon, 14 Jul 2025 7:50:04 +0000, Anton Ertl wrote:
C) Perform tree addition
>
a) Using 80-bit addition. This will be faster than sequential
   addition because in many cases several additions can run in
   parallel. It will also be quite accurate because it uses 80-bit
   addition, and because the addition chains are reduced to
   ld(length(vector)).
>
This looks very interesting. I can find Kahan and Neumaier, but
"tree addition" didn't turn up (There is a suspicious looking
reliability paper about the approach which surely is not what
you meant). Or is it pairwise addition what I should look for?

Yes, "tree addition" is not a common term, and Wikipedia calls it
pairwise addition. Except that unlike suggeseted in
<https://en.wikipedia.org/wiki/Pairwise_summation> I would not switch to
a sequential approach for small n, for both accuracy and performance.
In any case the idea is to turn the evaluation tree from a degenerate
tree into a balanced tree. E.g., if you add up a, b, c, and d, then
the naive evaluation

a b f+ c f+ d f+

has the evaluation tree

a   b
\ /
f+   c
   \ /
   f+   d
   \ /
   f+

with the three F+ each depending on the previous one, and also
increasing the rounding errors. If you balance the tree

a   b c   d
\ /   \ /
f+ f+
   \ /
f+

corresponding to

a b f+ c d f+ f+

the first two f+ can run in parallel (increasing performance), and the
rounding errors tend to be less.

So how to implement this for an arbitrary N? We had an extensive
discussion of a similar problem in the thread on the subject "balanced
REDUCE: a challenge for the brave", and you can find that discussion
at
<https://comp.lang.forth.narkive.com/GIg9V9HK/balanced-reduce-a-challenge-for-the-brave>

But I decided to use a recursive approach (recursive-sum, REC) that
uses the largest 2^k<n as the left child and the rest as the right
child, and as base cases for the recursion use a straight-line
balanced-tree evaluation for 2^k with k<=7 (and combine these for n
that are not 2^k). For systems with tiny FP stacks, I added the
option to save intermediate results on a software stack in the
recursive word. Concerning the straight-line code, it turned out that
the highest k I could use on sf64 and vfx64 is 5 (corresponding to 6
FP stack items); it's not clear to me why; on lxf I can use k=7 (and
it uses the 387 stack, too).

I also coded the shift-reduce-sum algorithm (shift-reduce-sum, SR)
described in <https://en.wikipedia.org/wiki/Pairwise_summation> in
Forth, because it can make use of Forth's features (such as the FP
stack) where the C code has to hand-code it. It uses the FP stack
beyond 8 elements if there are more than 128 elements in the array, so
it does not work for the benchmark (with 100_000 elements in the
array) on lxf, sf64, and vfx64. As you will see, this is no loss.

I also coded the naive, sequential approach (naive-sum, NAI).

One might argue that the straight-line stuff in REC puts REC at an
advantage, so i also produced an unrolled version of the naive code
(unrolled-sum, UNR) that uses straight-line sequences for adding up to
2^7 elements to the intermediate result.

You can find a file containing all these versions, compatibility
configurations for various Forth systems, and testing and benchmarking
code and data, on

https://www.complang.tuwien.ac.at/forth/programs/pairwise-sum.4th

I did not do any accuracy measurements, but I did performance
measurements on a Ryzen 5800X:

cycles:u
gforth-fast    iforth    lxf SwiftForth    VFX
3_057_979_501 6_482_017_334 6_087_130_593 6_021_777_424 6_034_560_441 NAI
6_601_284_920 6_452_716_125 7_001_806_497 6_606_674_147 6_713_703_069 UNR
3_787_327_724 2_949_273_264 1_641_710_689 7_437_654_901 1_298_257_315 REC
9_150_679_812 14_634_786_781 SR

cycles:u
gforth-fast    iforth    lxf SwiftForth    VFX
13_113_842_702 6_264_132_870 9_011_308_923 11_011_828_048 8_072_637_768 NAI
6_802_702_884 2_553_418_501 4_238_099_417 11_277_658_203 3_244_590_981 UNR
9_370_432_755 4_489_562_792 4_955_679_285 12_283_918_226 3_915_367_813 REC
51_113_853_111 29_264_267_850 SR

The versions used are:
Gforth 0.7.9_20250625
iForth 5.1-mini
lxf 1.7-172-983
SwiftForth x64-Linux 4.0.0-RC89
VFX Forth 64 5.43 [build 0199] 2023-11-09

The ":u" means that I measured what happened at the user-level, not at
the kernel-level.

Each benchmark run performs 1G f@ and f+ operations, and the naive
approach performs 1G iterations of the loop.

The NAIve and UNRolled results show that performance in both is
limited by the latency of the F+: 3 cycles for the DP SSE2 operation
in Gforth-fast, 6 cycles for the 80-bit 387 fadd on the other systems.
It's unclear to me why UNR is much slower on gforth-fast compared to
NAI.

The RECursive balanced-tree sum is faster on iForth, lxf and VFX than
the NAIve and UNRolled versions. It is slower on Gforth: My guess is
that, despite all hardware advances, the lack of multi-state stack
caching in Gforth means that the hardware of the Ryzen 5800X does not
just see the real data flow, but a lot of additional dependences; or
it may be related to whatever causes the slowdown for UNRolled.

The SR (shift-reduce) sum looks cute, but performs so many additional
instructions, even on iForth, that it is uncompetetive. It's unclear
to me what slows it down so much on iForth, however.

I expect that vectorized implementations using AVX will be several
times faster than the fastest scalar stuff we see here.

- anton
--
M. Anton Ertl http://www.complang.tuwien.ac.at/anton/home.html
comp.lang.forth FAQs: http://www.complang.tuwien.ac.at/forth/faq/toc.html
   New standard: https://forth-standard.org/
EuroForth 2025 CFP: http://www.euroforth.org/ef25/cfp.html

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
6 Oct 24	Parsing timestamps?	282	dxf
6 Oct 24	Re: Parsing timestamps?	245	mhx
6 Oct 24	Re: Parsing timestamps?	3	dxf
6 Oct 24	Re: Parsing timestamps?	2	dxf
7 Oct 24	Re: Parsing timestamps?	1	dxf
7 Jun 25	Re: Parsing timestamps?	241	B. Pym
7 Jun 25	Re: Parsing timestamps?	225	dxf
7 Jun 25	Re: Parsing timestamps?	224	LIT
8 Jun 25	Re: Parsing timestamps?	223	dxf
9 Jun 25	Re: Parsing timestamps?	222	Hans Bezemer
9 Jun 25	Re: Parsing timestamps?	221	LIT
9 Jun 25	Re: Parsing timestamps?	220	Hans Bezemer
9 Jun 25	Re: Parsing timestamps?	219	LIT
10 Jun 25	Re: Parsing timestamps?	207	dxf
10 Jun 25	Re: Parsing timestamps?	2	mhx
10 Jun 25	Re: Parsing timestamps?	1	dxf
10 Jun 25	Re: Parsing timestamps?	1	LIT
19 Jun 25	Re: Parsing timestamps?	203	LIT
20 Jun 25	Re: Parsing timestamps?	202	dxf
20 Jun 25	Re: Parsing timestamps?	7	minforth
20 Jun 25	Re: Parsing timestamps?	2	mhx
20 Jun 25	Re: Parsing timestamps?	1	albert
20 Jun 25	Re: Parsing timestamps?	4	dxf
20 Jun 25	Re: Parsing timestamps?	1	mhx
20 Jun 25	Re: Parsing timestamps?	2	minforth
21 Jun 25	Re: Parsing timestamps?	1	dxf
20 Jun 25	Re: Parsing timestamps?	194	LIT
21 Jun 25	Re: Parsing timestamps?	1	dxf
22 Jun 25	Re: Parsing timestamps?	187	minforth
23 Jun 25	Re: Parsing timestamps?	1	dxf
23 Jun 25	Re: Parsing timestamps?	181	Anton Ertl
23 Jun 25	Re: Parsing timestamps?	180	minforth
24 Jun 25	Re: Parsing timestamps?	162	minforth
24 Jun 25	Re: Parsing timestamps?	161	dxf
24 Jun 25	Re: Parsing timestamps?	13	minforth
24 Jun 25	Re: Parsing timestamps?	2	Anton Ertl
1 Jul 25	Re: Parsing timestamps?	1	Stephen Pelc
26 Jun 25	Re: The future. (was Re: Parsing timestamps?)	2	Paul Rubin
30 Jun 25	Re: The future. (was Re: Parsing timestamps?)	1	albert
15 Jul 25	Re: The future. (was Re: Parsing timestamps?)	8	LIT
16 Jul 25	Re: The future. (was Re: Parsing timestamps?)	7	minforth
16 Jul 25	Re: The future. (was Re: Parsing timestamps?)	5	dxf
16 Jul 25	Re: The future. (was Re: Parsing timestamps?)	1	minforth
16 Jul 25	Re: The future. (was Re: Parsing timestamps?)	3	LIT
17 Jul 25	Re: The future. (was Re: Parsing timestamps?)	2	dxf
17 Jul 25	Re: The future. (was Re: Parsing timestamps?)	1	LIT
16 Jul 25	Re: The future. (was Re: Parsing timestamps?)	1	LIT
25 Jun 25	Re: Parsing timestamps?	12	dxf
25 Jun 25	Re: Parsing timestamps?	11	Paul Rubin
26 Jun 25	Re: Parsing timestamps?	1	Paul Rubin
26 Jun 25	Re: Parsing timestamps?	9	dxf
26 Jun 25	Re: Parsing timestamps?	8	Paul Rubin
27 Jun 25	Re: Parsing timestamps?	7	dxf
27 Jun 25	Re: Parsing timestamps?	1	Paul Rubin
2 Jul 25	Re: Parsing timestamps?	5	dxf
2 Jul 25	Re: Parsing timestamps?	2	Stephen Pelc
4 Jul 25	Re: Parsing timestamps?	1	dxf
6 Jul 25	Re: Parsing timestamps?	2	LIT
6 Jul 25	Re: Parsing timestamps?	1	dxf
25 Jun 25	Re: Parsing timestamps?	2	Paul Rubin
30 Jun 25	Re: Parsing timestamps?	1	Hans Bezemer
26 Jun 25	Re: Parsing timestamps?	133	Waldek Hebisch
26 Jun 25	Re: Parsing timestamps?	132	minforth
26 Jun 25	Re: Parsing timestamps?	2	LIT
27 Jun 25	Re: Parsing timestamps?	1	minforth
29 Jun 25	Re: Parsing timestamps?	129	Anton Ertl
29 Jun 25	Re: Parsing timestamps?	128	LIT
29 Jun 25	Re: Parsing timestamps?	123	LIT
30 Jun 25	Re: Parsing timestamps?	122	dxf
30 Jun 25	Re: Parsing timestamps?	121	LIT
30 Jun 25	Re: Parsing timestamps?	116	dxf
30 Jun 25	Re: Parsing timestamps?	115	LIT
30 Jun 25	Re: Parsing timestamps?	114	dxf
30 Jun 25	Re: Parsing timestamps?	2	LIT
1 Jul 25	Re: Parsing timestamps?	1	dxf
30 Jun 25	Re: Parsing timestamps?	111	Paul Rubin
30 Jun 25	Re: Parsing timestamps?	3	LIT
2 Jul 25	Re: Parsing timestamps?	1	LIT
2 Jul 25	Re: Parsing timestamps?	1	Paul Rubin
1 Jul 25	Re: Parsing timestamps?	107	Paul Rubin
1 Jul 25	Re: Parsing timestamps?	18	minforth
1 Jul 25	Re: Parsing timestamps?	17	Paul Rubin
2 Jul 25	Re: Parsing timestamps?	4	minforth
2 Jul 25	Re: Parsing timestamps?	1	minforth
2 Jul 25	Re: Parsing timestamps?	1	dxf
2 Jul 25	Re: Parsing timestamps?	1	Anton Ertl
2 Jul 25	Re: Parsing timestamps?	12	Anton Ertl
3 Jul 25	Re: Parsing timestamps?	8	Paul Rubin
3 Jul 25	Re: Parsing timestamps?	2	minforth
3 Jul 25	Re: Parsing timestamps?	1	albert
3 Jul 25	Re: Parsing timestamps?	3	Hans Bezemer
3 Jul 25	Re: Parsing timestamps?	2	albert
5 Jul 25	Re: Parsing timestamps?	1	dxf
3 Jul 25	Re: Parsing timestamps?	1	albert
4 Jul 25	Re: Parsing timestamps?	1	dxf
3 Jul 25	Re: Parsing timestamps?	2	Anton Ertl
3 Jul 25	Re: Parsing timestamps?	1	Hans Bezemer
3 Jul 25	Re: Parsing timestamps?	1	dxf
1 Jul 25	Re: Parsing timestamps?	5	peter
2 Jul 25	Re: Parsing timestamps?	2	minforth
2 Jul 25	Re: Parsing timestamps?	1	Anton Ertl
2 Jul 25	Re: Parsing timestamps?	2	Paul Rubin
2 Jul 25	Re: Parsing timestamps?	83	Anton Ertl
30 Jun 25	Re: Parsing timestamps?	4	Paul Rubin
29 Jun 25	Re: Parsing timestamps?	3	Paul Rubin
30 Jun 25	Re: Parsing timestamps?	1	sean
24 Jun 25	Re: Parsing timestamps?	1	Anton Ertl
2 Jul 25	Nested definitions (was: Parsing timestamps?)	16	Ruvim
23 Jun 25	Re: Parsing timestamps?	4	mhx
23 Jun 25	Re: Parsing timestamps?	5	LIT
10 Jun 25	Re: Parsing timestamps?	2	LIT
10 Jun 25	Re: Parsing timestamps?	2	LIT
10 Jun 25	Re: Parsing timestamps?	2	Stephen Pelc
10 Jun 25	Re: Parsing timestamps?	4	LIT
10 Jun 25	Re: Parsing timestamps?	1	Hans Bezemer
9 Jun 25	Re: Parsing timestamps?	1	B. Pym
10 Jun 25	Re: Parsing timestamps?	14	B. Pym
6 Oct 24	Re: Parsing timestamps?	5	Ruvim
6 Oct 24	Re: Parsing timestamps?	6	FFmike
6 Oct 24	Re: Parsing timestamps?	2	Anthony Howe
7 Oct 24	Re: Parsing timestamps?	9	albert
8 Oct 24	Re: Parsing timestamps?	3	albert
9 Oct 24	Re: Parsing timestamps?	4	alaa
18 Oct 24	Re: Parsing timestamps?	7	Gerry Jackson