Object Detection evaluation using the 2014 COCO minival dataset.

This binary evaluates the following parameters of TFLite models trained for the bounding box-based COCO Object Detection task:

Native pre-processing latency
Inference latency
mean Average Precision (mAP) averaged across IoU thresholds from 0.5 to 0.95 (in increments of 0.05) and all object categories.

The binary takes the path to validation images and a ground truth proto file as inputs, along with the model and inference-specific parameters such as delegate and number of threads. It outputs the metrics to std-out as follows:

Num evaluation runs: 8059
Preprocessing latency: avg=16589.9(us), std_dev=0(us)
Inference latency: avg=85169.7(us), std_dev=505(us)
Average Precision [IOU Threshold=0.5]: 0.349581
Average Precision [IOU Threshold=0.55]: 0.330213
Average Precision [IOU Threshold=0.6]: 0.307694
Average Precision [IOU Threshold=0.65]: 0.281025
Average Precision [IOU Threshold=0.7]: 0.248507
Average Precision [IOU Threshold=0.75]: 0.210295
Average Precision [IOU Threshold=0.8]: 0.165011
Average Precision [IOU Threshold=0.85]: 0.116215
Average Precision [IOU Threshold=0.9]: 0.0507883
Average Precision [IOU Threshold=0.95]: 0.0064338
Overall mAP: 0.206576

To run the binary, please follow the Preprocessing section to prepare the data, and then execute the commands in the Running the binary section.

Parameters

The binary takes the following parameters:

model_file : string
Path to the TFlite model file. It should accept images preprocessed in the Inception format, and the output signature should be similar to the SSD MobileNet model:
model_output_labels: string
Path to labels that correspond to output of model. E.g. in case of COCO-trained SSD model, this is the path to a file where each line contains a class detected by the model in correct order, starting from 'background'.

A sample model & label-list combination for COCO can be downloaded from the TFLite Hosted models page.

ground_truth_images_path: string
The path to the directory containing ground truth images.
ground_truth_proto: string
Path to file containing tflite::evaluation::ObjectDetectionGroundTruth proto in text format. If left empty, mAP numbers are not provided.

The above two parameters can be prepared using the preprocess_coco_minival script included in this folder.

output_file_path: string
The final metrics are dumped into output_file_path as a string-serialized instance of tflite::evaluation::EvaluationStageMetrics.

The following optional parameters can be used to modify the inference runtime:

num_interpreter_threads: int (default=1)
This modifies the number of threads used by the TFLite Interpreter for inference.
delegate: string
If provided, tries to use the specified delegate for accuracy evaluation. Valid values: "nnapi", "gpu", "hexagon".

NOTE: Please refer to the Hexagon delegate documentation for instructions on how to set it up for the Hexagon delegate. The tool assumes that libhexagon_interface.so and Qualcomm libraries lie in /data/local/tmp.

This script also supports runtime/delegate arguments introduced by the delegate registrar. If there is any conflict (for example, num_threads vs num_interpreter_threads here), the parameters of this script are given precedence.

When multiple delegates are specified to be used in the commandline flags via the support of delegate registrar, the order of delegates applied to the TfLite runtime will be same as their enabling commandline flag is specified. For example, "--use_xnnpack=true --use_gpu=true" means applying the XNNPACK delegate first, and then the GPU delegate secondly. In comparison, "--use_gpu=true --use_xnnpack=true" means applying the GPU delegate first, and then the XNNPACK delegate secondly.

Note, one could specify --help when launching the binary to see the full list of supported arguments.

Debug Mode

The script also supports a debug mode with the following parameter:

debug_mode: boolean
Whether to enable debug mode. Per-image predictions are written to std-out along with metrics.

Image-wise predictions are output as follows:

======================================================

Image: image_1.jpg

Object [0]
  Score: 0.585938
  Class-ID: 5
  Bounding Box:
    Normalized Top: 0.23103
    Normalized Bottom: 0.388524
    Normalized Left: 0.559144
    Normalized Right: 0.763928
Object [1]
  Score: 0.574219
  Class-ID: 5
  Bounding Box:
    Normalized Top: 0.269571
    Normalized Bottom: 0.373971
    Normalized Left: 0.613175
    Normalized Right: 0.760507
======================================================

Image: image_2.jpg
...

This mode lets you debug the output of an object detection model that isn't necessarily trained on the COCO dataset (by leaving ground_truth_proto empty). The model output signature would still need to follow the convention mentioned above, and you we still need an output labels file.

Preprocessing the minival dataset

To compute mAP in a consistent and interpretable way, we utilize the same 2014 COCO 'minival' dataset that is mentioned in the Tensorflow detection model zoo.

The links to download the components of the validation set are:

2014 COCO Validation Images
2014 COCO Train/Val annotations: Out of the files from this zip, we only require instances_val2014.json.
minival Image IDs : Only applies to the 2014 validation set. You would need to copy the contents into a text file.

Since evaluation has to be performed on-device, we first filter the above data and extract a subset that only contains the images & ground-truth bounding boxes we need.

To do so, we utilize the preprocess_coco_minival Python binary as follows:

bazel run //tensorflow/lite/tools/evaluation/tasks/coco_object_detection:preprocess_coco_minival -- \
  --images_folder=/path/to/val2014 \
  --instances_file=/path/to/instances_val2014.json \
  --allowlist_file=/path/to/minival_allowlist.txt \
  --output_folder=/path/to/output/folder

Optionally, you can specify a --num_images=N argument, to preprocess the first N image files (based on sorted list of filenames).

The script generates the following within the output folder:

images/: the resulting subset of the 2014 COCO Validation images.
ground_truth.pb: a .pb (binary-format proto) file holding tflite::evaluation::ObjectDetectionGroundTruth corresponding to image subset.

Running the binary

On Android

(0) Refer to https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android for configuring NDK and SDK.

(1) Build using the following command:

bazel build -c opt \
  --config=android_arm64 \
  --cxxopt='--std=c++17' \
  //tensorflow/lite/tools/evaluation/tasks/coco_object_detection:run_eval

(2) Connect your phone. Push the binary to your phone with adb push (make the directory if required):

adb push bazel-bin/third_party/tensorflow/lite/tools/evaluation/tasks/coco_object_detection/run_eval /data/local/tmp

(3) Make the binary executable.

adb shell chmod +x /data/local/tmp/run_eval

(4) Push the TFLite model that you need to test:

adb push ssd_mobilenet_v1_float.tflite /data/local/tmp

(5) Push the model labels text file to device.

adb push /path/to/labelmap.txt /data/local/tmp/labelmap.txt

(6) Preprocess the dataset using the instructions given in the Preprocessing section and push the data (folder containing images & ground truth proto) to the device:

adb shell mkdir /data/local/tmp/coco_validation && \
adb push /path/to/output/folder /data/local/tmp/coco_validation

(7) Run the binary.

adb shell /data/local/tmp/run_eval \
  --model_file=/data/local/tmp/ssd_mobilenet_v1_float.tflite \
  --ground_truth_images_path=/data/local/tmp/coco_validation/images \
  --ground_truth_proto=/data/local/tmp/coco_validation/ground_truth.pb \
  --model_output_labels=/data/local/tmp/labelmap.txt \
  --output_file_path=/data/local/tmp/coco_output.txt

Optionally, you could also pass in the --num_interpreter_threads & --delegate arguments to run with different configurations.

On Desktop

(1) Build and run using the following command:

bazel run -c opt \
  -- \
  //tensorflow/lite/tools/evaluation/tasks/coco_object_detection:run_eval \
  --model_file=/path/to/ssd_mobilenet_v1_float.tflite \
  --ground_truth_images_path=/path/to/images \
  --ground_truth_proto=/path/to/ground_truth.pb \
  --model_output_labels=/path/to/labelmap.txt \
  --output_file_path=/path/to/coco_output.txt