Liste des Groupes | Revenir à cl c |
On Mon, 25 Mar 2024 18:10:23 +0000There are two aspects to be considered:
bart <bc@freeuk.com> wrote:
On 25/03/2024 16:51, Michael S wrote:Your explanation exactly matches what I was imagining.On Mon, 25 Mar 2024 16:06:24 +0000>
bart <bc@freeuk.com> wrote:
On 25/03/2024 12:26, David Brown wrote:>On 25/03/2024 12:16, Michael S wrote:>On Sun, 24 Mar 2024 23:43:32 +0100>
David Brown <david.brown@hesbynett.no> wrote:>>
I could be wrong here, of course.
It seems, you are.
It happens - and it was not unexpected here, as I said. I don't
have all these compilers installed to test.
>
But it would be helpful if you had a /little/ more information.
If you don't know why some compilers generate binaries that have
memory mapped at 0x400000, and others do not, fair enough. I am
curious, but it's not at all important.
In the PE EXE format, the default image load base is specified in a
special header in the file:
>
Magic: 20B
Link version: 1.0
Code size: 512 200
Idata size: 1024 400
Zdata size: 512
Entry point: 4096 1000 in data:0
Code base: 4096
Image base: 4194304 400000
Section align: 4096
>
By convention it is at 0x40'0000 (I've no idea why).
>
More recently, dynamic loading, regardless of what it says in the
PE header, has become popular with linkers. So, while there is
still a fixed value in the Image Base file, which might be
0x140000000, it gets loaded at some random address, usually in
high memory above 2GB.
>
I don't know what's responsible for that, but presumably the OS
must be in on the act.
>
To make this possible, both for loading above 2GB, and for loading
at an address not known by the linker, the code inside the EXE
must be position-independent, and have relocation info for any
absolute 64-bit static addresses. 32-bit static addresses won't
work.
I don't understand why you say that EXE must be
position-independent. I never learned PE format in depth (and
learned only absolute minimum of elf, just enough to be able to
load images in simple embedded scenario), but my impression always
was that PE EXE contains plenty of relocation info for a loader, so
it (loader) can modify (I think professional argot uses the word
'fix') non-PIC at load time to run at any chosen position.
Am I wrong about it?
>
A PE EXE designed to run only at the image base given won't be
position-independent, so it can't be moved anywwhere else.
>
There isn't enough info to make it possible, especially before
position-independent addressing modes for x64 came along (that is,
using offset to the RIP intruction pointer instead of 32-bit absolute
addresses).
>
Take this C program:
>
int abc;
int* ptr = &abc;
>
int main(void) {
int x;
x = abc;
}
>
Some of the assembly generated is this:
>
abc: resb 4
>
ptr: dq abc
...
mov eax, [abc]
>
That last reference is an absolute 32-bit address, for example it
might have address 0x00403000 when loaded at 0x400000.
>
If the program is instead loaded at 0x78230000, there is no reloc
info to tell it that that particular 32-bit value, plus the 64-bit
field initialising ptr, must be adjusted.
>
RIP-relative addressing (I think sometimes called PIC), can fix that
second reference:
>
mov eax, [rip:abc]
>
But it only works for code, not data; that initialisation is still
absolute.
>
When a DLL is generated instead, those will need to be moved (to
avoid multiple DLLs all based at the same address). In that case,
base-relocation tables are needed: a list of addresses that contain a
field that needs relocating, and what type and size of reloc is
needed.
>
The same info is needed for EXE if it contains flags saying that the
EXE could be loaded at an arbitrary adddress.
>
The technology for relocation of non-PIC code is already here, in file
format definitions and in OS loader code. The linker or the part of
compiler that serves the role of linker can decide to not generate
required tables. Operation in such mode will have small benefits in EXE
size and in quicker load time, but IMHO nowadays it should be used
rarely, only in special situations rather than serve as a default of the
tool.
Les messages affichés proviennent d'usenet.