Liste des Groupes | Revenir à c arch |
Scott Lurndal wrote:The advantage is consuming OpCode space at breathtaking speed.EricP <ThatWouldBeTelling@thevillage.com> writes:>Chris M. Thomasson wrote:>On 11/8/2024 2:56 PM, Chris M. Thomasson wrote:Arm A64 has LDXP Load Exclusive Pair of registers andPerhaps sometime tonight. Is seems like optimistic LL/SC instead ofLL/SC vs cmpxchg8b?
pessimistic CAS RMW type of logic?
STXP Store Exclusive Pair of registers looks like it can be
equivalent to cmpxchg16b (aka double-wide compare and swap).
Aarch64 also has CASP, a 128-bit atomic compare and swap
instruction.
Thanks, I missed that.
>
Any idea what is the advantage for them having all these various
LDxxx and STxxx instructions that only seem to combine a LD or ST
with a fence instruction?
Why haveBecause the memory model was not build with the notion of memory order
LDAPR Load-Acquire RCpc Register
LDAR Load-Acquire Register
LDLAR LoadLOAcquire Register
plus all the variations for byte, half, word, and pair,Blame Leslie Lamport for those requirements.
instead of just the standard LDx and a general data fence instruction?
>
The execution time of each is the same, and the main cost is the fence
synchronizing the Load Store Queue with the cache, flushing the cache
comms queue and waiting for all outstanding cache ops to finish.
Les messages affichés proviennent d'usenet.