qwencoder-eval/instruct/eval-dev-quality/docs/reports/v0.5.0/gemma-2-9b-it/README.md
This report was generated by DevQualityEval benchmark in version 0.5.0.
REMARK: gemma-2-9b-it and gemma-2-27-it were originally evaluated together with the results then being split into separate folders. Therefore some logs might contain entries from "the other" gemma model.
Keep in mind that LLMs are nondeterministic. The following results just reflect a current snapshot.
The results of all models have been divided into the following categories:
The following sections list all models with their categories. The complete log of the evaluation with all outputs can be found here. Detailed scoring can be found here.
Models in this category could not be categorized.