packages/training/scripts/templates/model_card_quant.md
{quant_short_name} weights for the
{base_eliza_repo_id}full-precision Eliza fine-tune.
{quant_blurb}
| field | value |
|---|---|
| Quantization scheme | {quant_scheme_name} |
| Bit-width (weights) | {quant_bits_weights} |
| Bit-width (KV cache) | {quant_bits_kv} |
| Reference paper | {quant_paper} |
| Sibling base repo | {base_eliza_repo_id} |
| Recommended runtime | {quant_runtime} |
| Approximate file size | {quant_file_size} |
| Target hardware | {quant_target_hw} |
| Expected quality regression vs base | {quant_quality_delta} |
{quant_inference_block}
{base_eliza_repo_id}elizaos/eliza-native-v1-sft.elizaos/eliza-1-training
(scripts/quantization/).Evaluated against the base bf16 sibling. Numbers below are absolute scores on the eliza-1 evaluation suite; subtract from the base card's scores for the regression delta.
{eval_table}
Apache-2.0, inherited from the base
checkpoint recorded in the sibling model metadata. Same terms as the
{base_eliza_repo_id} sibling.