Back to Qwen3 Coder

README

qwencoder-eval/instruct/eval-dev-quality/docs/reports/v0.6/README.md

latest454 B
Original Source

This evaluation ran over several days and is actually assembled from several sub-evaluations. Hence, we only provide the csv results for now because the logs are distributed all over.

The benchmark consisted of 5x runs, except for the code-repair task, which contains only one usable run because of a bug that is already fixed for later versions.