docs/design/coreclr/botr/readytorun-platform-native-envelope.md
Up through .NET 10, ReadyToRun (R2R) uses the Windows PE format as the native envelope on every platform. Non‑Windows platforms therefore load a PE file with the .NET loader performing the required fixups and code activation.
In .NET 11, we plan to start adding support beyond the PE format. We will target support for:
crossgen2The tentative high-level design is outlined below. As we implement this support, this document should be updated with more details and the ReadyToRun overview and ReadyToRun format should be updated to reflect the changes.
Mach‑O support will only be supported for composite ReadyToRun when the target OS is macOS. It will be opt-in via a new crossgen2 flag:
--obj-format machocrossgen2 will:
RTR_HEADER export for the READYTORUN_HEADER.READYTORUN_FLAG_COMPONENT.READYTORUN_FLAG_PLATFORM_NATIVE_IMAGEcrossgen2 does not produce the final shared library. A separate SDK / build linking step must preserve the RTR_HEADER export in the final dylib.
There's a few cases in the R2R format that are not natively represented in the Mach-O format that have to be emulated. This section will describe some of the design decisions for the Mach-O R2R format.
Data moved out of __TEXT,__text:
__TEXT,__managedcode. __TEXT,__text gets special treatment by the linker and __TEXT,__managedcode matches NativeAOT.__TEXT,__constData that stays in the corresponding locations as the PE envelope:
__DATA,__data__TEXT,__textSymbol ranges are represented differently in Mach-O than other platforms. Apple linkers have issues when multiple symbols are defined at the same location. Additionally, the Mach format natively supports a "subtractor" reloc to represent the distance between two symbols. As a result, we can represent the start of the symbol range as the start symbol of the range. We can represent the size of the range we can represent as "end symbol location - start symbol location + end symbol size".
The R2R format, like the PE format, is heavily based around having RVAs emitted into the image that can be added to the base symbol of the image. The COFF object file format natively supports such a concept, and the PE format uses such a concept in the PE header. However, other formats do not natively support such a concept.
The Apple linker does provide a base symbol for the Mach format, but the base symbol depends on the output type, generally in the form __mh_<output>_header. For dylibs, the symbol is __mh_dylib_header. This symbol is located at the address returned by dlinfo and dladdr for the base address. It also points to the Mach header, which can be used to find the size of the image to bound reads of the R2R data.
As a result, we can emulate this support in the Mach format with ease:
__mh_dylib_header.__mh_dylib_header location".The runtime will be updated to handle platform-native R2R images during assembly load.
READYTORUN_FLAG_PLATFORM_NATIVE_IMAGE flag set:
a. Read OwnerCompositeExecutable value.
b. Invoke host callback with component assembly path and owner composite name.
c. On success, obtain pointer to composite READYTORUN_HEADER and use it for native method lookup / fixups.
d. On failure, fall back to IL/JIT path.The host_runtime_contract will be updated with a new callback for getting native code information.
struct native_code_context
{
size_t size; // size of this struct
const char* assembly_path; // component assembly path
const char* owner_composite_name; // name from component R2R header
};
struct native_code_data
{
size_t size; // size of this struct
void* r2r_header_ptr; // ReadyToRun header
size_t image_size; // size of the image
void* image_base; // base address where the image was loaded
};
bool get_native_code_data(
const struct native_code_context* context,
/*out*/ struct native_code_data* data
);
This leaves it to the host to do the actual load (for example, dlopen of a shared library, using something statically linked into the host itself) of the platform-native image. It is also responsible for any caching desired.