Newsportal USENET - Re: Misc: BGBCC targeting RV64G, initial results...

On Fri, 27 Sep 2024 23:53:22 +0000, BGB wrote:

On 9/27/2024 2:40 PM, MitchAlsup1 wrote:
On Fri, 27 Sep 2024 18:26:28 +0000, BGB wrote:
>
But, generally this does still impose limits:
Can't reorder instructions across a label;
Can't move instructions with an associated reloc;

I always did code motion prior to assembler. Code motion only has to
consider:: 1-operand, 2-operand, 3-operand, branch, label, LD, ST.

Can't reorder memory instructions unless they can be proven to not alias
(loads may be freely reordered, but the relative order of loads and
stores may not unless provably non-aliasing);

Same base register different displacement.

The effectiveness of this does depend on how the C code is written
though (works favorably with larger blocks of mostly-independent
expressions).

One of the reasons reservation stations became in vouge.

Most agree it is closer to 30% than 25% {{Unless you clutter up the ISA
such that your typical memref needs a support instruction.
>
>
Cough, RV64...

-----

Which makes that 16% (above) into 48% and renormalizing to::
~ 63% fixed-displacement;
~ 36% register-indexed and support instructions.
>
Yeah.
>
I think there are reasons here why I am generally getting lackluster
performance out of RV64...

-----

Comparably, XG2 has a 16K or 32K reach here (depending on immediate
size), which hits most of the global variables. The fallback Jumbo
encoding hits the rest.
>
I get ±32K with 16-bit displacements
>
>
Baseline has special case 32-bit ops:
   MOV.L (GBR, Disp10u), Rn //4K
   MOV.Q (GBR, Disp10u), Rn //8K
>
But, in XG2, it gains 2 bits:
   MOV.L (GBR, Disp12u), Rn //16K
   MOV.Q (GBR, Disp12u), Rn //32K
>
Jumbo can encode +/- 4GB here (64-bit encoding).
   MOV.L (GBR, Disp33s), Rn //+/- 4GB
   MOV.Q (GBR, Disp33s), Rn //+/- 4GB
>
Mostly because GBR displacements are unscaled.
   Plan for XG3 is that all Disp33s encodings would be unscaled.

The assembler gets to choose based on the memory model::
MEM Rd,[Rb,Ri<<s,DISP]
Assembler (or even linker) can choose 32-bit or 64 bit based on a
variety
of things {flags, memory model, size of linked module,...}

BJX2 can also do (PC, Disp33s) in a single logical instruction...
>
But, RISC-V can't...
>

What is your definition of "single logical instruction". In my parlance,
a single logical instruction can be::
ST #64-bit-const,[Rb,Ri<<s,DISP64]
is 1 instruction occupying 5 words.

>
>
Likewise, no one seems to be bothering with 64-bit ELF FDPIC for RV64
(there does seem to be some interest for ELF FDPIC but limited to 32-bit
RISC-V ...). Ironically, ideas for doing FDPIC in RV aren't too far off
from PBO (namely, using GP for a global section and then chaining the
sections for each binary).
>
How are you going to do dense PIC switch() {...} in RISC-V ??
>
Already implemented...
>
With pseudo-instructions:
SUB Rs, $(MIN), R10
MOV $(MAX-MIN), R11
BGTU R11, R10, Lbl_Dfl
>
MOV   .L0, R6 //AUIPC+ADD
SHAD R10, 2, R10 //SLLI
ADD   R6, R10, R6
JMP   R6    //JALR X0, X6, 0
>
.L0:
BRA Lbl_Case0    //JAL X0, Lbl_Case0
BRA Lbl_Case1
...

Compared to::
// ADD Rt,Rswitch,#-min
JTT Rt,#max
.jttable min, ... , max, default
adder:
The ADD is not necessary if min == 0
The JTT instruction compared Rt with 0 on the low side and max
on the high side. If Ri is out of bounds, default is selected.
The table displacements come in {B,H,W,D} selected in the JTT
(jump through table) instruction. Rt indexes the table, its
signed value is <<2 and added to address which happens to be
address of JTT instruction + #(max+1)<<entry. {{The table is
fetched through the ICache with execute permission}}
Thus, the table is PIC; and generally 1/4 the size of typical
switch tables.
-----

Currently, BGBCC does not use this strategy.
Though, for 64-bit constants it could be more compact and faster.
>
But, better still would be having Jumbo prefixes or similar, or even a
SHORI instruction.
>
Better Still Still is having 32-bit and 64-bit constants available
from the instruction stream and positioned in either operand position.
>
>
Granted...
>
>
Say, 64-bit constant-load in SH-5 or similar:
   xxxxyyyyzzzzwwww
   MOV   ImmX, Rn
   SHORI ImmY, Rn
   SHORI ImmZ, Rn
   SHORI ImmW, Rn
Where, one loads the constant in 16-bit chunks.
>
Yech
>
>
But, 4 is still less than 6.

1 is less than 4, too.

Date	Sujet	#	Auteur
27 Sep 24	Misc: BGBCC targeting RV64G, initial results...	37	BGB
27 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	20	Robert Finch
27 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	19	BGB
27 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	18	MitchAlsup1
28 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	17	BGB
28 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	16	MitchAlsup1
28 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	15	BGB
29 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	14	MitchAlsup1
30 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	13	BGB
30 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	MitchAlsup1
1 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	11	Robert Finch
1 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	MitchAlsup1
3 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	9	BGB
4 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	2	Robert Finch
4 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	BGB
6 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	6	MitchAlsup1
8 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	5	BGB
8 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	4	MitchAlsup1
9 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	3	BGB
9 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	Stefan Monnier
9 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	MitchAlsup1
27 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	16	MitchAlsup1
27 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	2	BGB
28 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	MitchAlsup1
28 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	13	Paul A. Clayton
30 Sep 24	Re: Misc: BGBCC targeting RV64G, initial results...	12	MitchAlsup1
16 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	11	Paul A. Clayton
16 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	9	Stephen Fuld
16 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	Thomas Koenig
16 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	7	BGB
17 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	6	MitchAlsup1
17 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	5	BGB
18 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	4	MitchAlsup1
21 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	3	BGB
21 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	2	MitchAlsup1
22 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	BGB
16 Oct 24	Re: Misc: BGBCC targeting RV64G, initial results...	1	MitchAlsup1