Liste des Groupes | Revenir à c arch |
Stephen Fuld <sfuld@alumni.cmu.edu.invalid> wrote:Anytime one removes more "MOVs and saves and restore" instructionsOn 8/14/2024 5:54 PM, Brett wrote:>Brett <ggtgp@yahoo.com> wrote:MitchAlsup1 <mitchalsup@aol.com> wrote:>On Mon, 12 Aug 2024 2:23:00 +0000, Brett wrote:>
>BGB <cr88192@gmail.com> wrote:>>>
Another benefit of 64 registers is more inlining removing calls.
>
A call can cause a significant amount of garbage code all around that
call,
as it splits your function and burns registers that would otherwise get
used.
What I see around calls is MOV instructions grabbing arguments from the
preserved registers and putting return values in to the proper preserved
register. Inlining does get rid of these MOVs, but what else ??
For middling functions, I spent my time optimizing heavy code, the 10%
that
matters.
>
The first half of a big function will have some state that has to be
reloaded after a call, or worse yet saved and reloaded.
>
Inlining is limited by register count, with twice the registers the
compiler will generate far larger leaf calls with less call depth. Which
removes more of those MOVs.
>I can understand the reluctance to go to 6 bit register specifiers, it>
burns up your opcode space and makes encoding everything more difficult.
I am on record as stating the proper number of bits in an instruction-
specifier is 34-bits. This is after designing Mc88K ISA, doing 3
generations
of SPARC chips, 7 years of x86-64, and Samsung GPU (and my own efforts)
Making the registers 6-bits would increase that count to 36-bits.
My 66000 hurts less with 6-bits as more constants bits get moved to
extension words, which is almost free by most metrics.
>
Only My 66000 can reasonably be able to implement 6-bits register
specifiers.
The market is yours for the taking.
>
6-bits will make you stand out and get noticed.
>
The only down side I see is a few percent in code density.
Actually due to the removal of MOVs and reloads the code density may be
basically the same.
>When HW is doing the saves, the saves can be performed whileAlso longer context switch times, as more registers to save/restore.>
The save is should be free, as the load from ram is so slow.
If the context is time critical it should be written to use theWhen HW is doing the saves, it does them in a known order and
registers that are reloaded first, first. In which case the code
could start doing work in the same amount of time regardless of
register count. (I doubt the CPU design is actually that smart,
or that the people that program the interrupts are.)
Les messages affichés proviennent d'usenet.