onnxruntime/lora/adapter_format/README.md
This directory contains ONNXRuntime Lora Parameter format schema and the generated C++ header file for the Lora Parameters file format. This file format is defined as a means to deliver Lora parameters so it can read by ONNXRuntime C++ code.
The format is generally designed to house a single Lora adapter with named Lora parameters.
ONNXRuntime Lora Parameter file format schema uses the FlatBuffers serialization library.
Please do not directly modify the generated C++ header file for ONNXRuntime Lora Parameter file format.
Use flatc compiler for the purpose.
e.g.
It is possible to use another flatc as well, e.g., from a separate installation.
To update the flatbuffers schemas and generated files:
Run compile_schema.py to generate the C++ bindings.
python onnxruntime/lora/lora_format/compile_schema.py --flatc <path to flatc>
In lora_format_version.h, see IsLoraParameterslVersionSupported() for the supported versions and
kLoraParametersVersion for the current version.
History begins.
Initial support for FlatBuffers that Lora Parameters support. This includes a definition of Tensor entity so it can be saved in a tensor per file format.