MODEL_ZOO.md
This file documents a large collection of baselines trained with detectron2 in Sep-Oct, 2019. All numbers were obtained on Big Basin servers with 8 NVIDIA V100 GPUs & NVLink. The speed numbers are periodically updated with latest PyTorch/CUDA/cuDNN versions. You can access these models from code using detectron2.model_zoo APIs.
In addition to these official baseline models, you can find more models in projects/.
tools/train_net.py with the corresponding yaml config file,
or tools/lazyconfig_train_net.py for python config files.metrics file.
Training speed for multi-machine jobs is not provided.tools/train_net.py --eval-only, or inference_on_dataset(),
with batch size 1 in detectron2 directly.
Measuring it with custom code may introduce other overhead.
Actual deployment in production should in general be faster than the given inference
speed due to more optimizations.metrics for each model.All COCO models were trained on train2017 and evaluated on val2017.
The default settings are not directly comparable with Detectron's standard settings. For example, our default training data augmentation uses scale jittering in addition to horizontal flipping.
To make fair comparisons with Detectron's settings, see Detectron1-Comparisons for accuracy comparison, and benchmarks for speed comparison.
For Faster/Mask R-CNN, we provide baselines based on 3 different backbone combinations:
Most models are trained with the 3x schedule (~37 COCO epochs). Although 1x models are heavily under-trained, we provide some ResNet-50 models with the 1x (~12 COCO epochs) training schedule for comparison when doing quick research iteration.
It's common to initialize from backbone models pre-trained on ImageNet classification tasks. The following backbone models are available:
Note that the above models have different format from those provided in Detectron: we do not fuse BatchNorm into an affine layer. Pretrained models in Detectron's format can still be used. For example:
These models require slightly different settings regarding normalization and architecture. See the model zoo configs for reference.
All models available for download through this document are licensed under the Creative Commons Attribution-ShareAlike 3.0 license.
The following baselines of COCO Instance Segmentation with Mask R-CNN are generated using a longer training schedule and large-scale jitter as described in Google's Simple Copy-Paste Data Augmentation paper. These models are trained from scratch using random initialization. These baselines exceed the previous Mask R-CNN baselines.
In the following table, one epoch consists of training on 118000 COCO images.
<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">epochs</th> <th valign="bottom">train time (s/im)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">box AP</th> <th valign="bottom">mask AP</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: mask_rcnn_R_50_FPN_100ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_50_FPN_100ep_LSJ.py">R50-FPN</a></td> <td align="center">100</td> <td align="center">0.376</td> <td align="center">0.069</td> <td align="center">44.6</td> <td align="center">40.3</td> <td align="center">42047764</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_100ep_LSJ/42047764/model_final_bb69de.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_100ep_LSJ/42047764/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_50_FPN_200ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_50_FPN_200ep_LSJ.py">R50-FPN</a></td> <td align="center">200</td> <td align="center">0.376</td> <td align="center">0.069</td> <td align="center">46.3</td> <td align="center">41.7</td> <td align="center">42047638</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_200ep_LSJ/42047638/model_final_89a8d3.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_200ep_LSJ/42047638/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_50_FPN_400ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_50_FPN_400ep_LSJ.py">R50-FPN</a></td> <td align="center">400</td> <td align="center">0.376</td> <td align="center">0.069</td> <td align="center">47.4</td> <td align="center">42.5</td> <td align="center">42019571</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_400ep_LSJ/42019571/model_final_14d201.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_50_FPN_400ep_LSJ/42019571/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_101_FPN_100ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_101_FPN_100ep_LSJ.py">R101-FPN</a></td> <td align="center">100</td> <td align="center">0.518</td> <td align="center">0.073</td> <td align="center">46.4</td> <td align="center">41.6</td> <td align="center">42025812</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_100ep_LSJ/42025812/model_final_4f7b58.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_100ep_LSJ/42025812/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_101_FPN_200ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_101_FPN_200ep_LSJ.py">R101-FPN</a></td> <td align="center">200</td> <td align="center">0.518</td> <td align="center">0.073</td> <td align="center">48.0</td> <td align="center">43.1</td> <td align="center">42131867</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_200ep_LSJ/42131867/model_final_0bb7ae.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_200ep_LSJ/42131867/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_101_FPN_400ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_R_101_FPN_400ep_LSJ.py">R101-FPN</a></td> <td align="center">400</td> <td align="center">0.518</td> <td align="center">0.073</td> <td align="center">48.9</td> <td align="center">43.7</td> <td align="center">42073830</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_400ep_LSJ/42073830/model_final_f96b26.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_R_101_FPN_400ep_LSJ/42073830/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnetx_4gf_dds_FPN_100ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_100ep_LSJ.py">regnetx_4gf_dds_FPN</a></td> <td align="center">100</td> <td align="center">0.474</td> <td align="center">0.071</td> <td align="center">46.0</td> <td align="center">41.3</td> <td align="center">42047771</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_100ep_LSJ/42047771/model_final_b7fbab.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_100ep_LSJ/42047771/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnetx_4gf_dds_FPN_200ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_200ep_LSJ.py">regnetx_4gf_dds_FPN</a></td> <td align="center">200</td> <td align="center">0.474</td> <td align="center">0.071</td> <td align="center">48.1</td> <td align="center">43.1</td> <td align="center">42132721</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_200ep_LSJ/42132721/model_final_5d87c1.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_200ep_LSJ/42132721/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnetx_4gf_dds_FPN_400ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_400ep_LSJ.py">regnetx_4gf_dds_FPN</a></td> <td align="center">400</td> <td align="center">0.474</td> <td align="center">0.071</td> <td align="center">48.6</td> <td align="center">43.5</td> <td align="center">42025447</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_400ep_LSJ/42025447/model_final_f1362d.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnetx_4gf_dds_FPN_400ep_LSJ/42025447/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnety_4gf_dds_FPN_100ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_100ep_LSJ.py">regnety_4gf_dds_FPN</a></td> <td align="center">100</td> <td align="center">0.487</td> <td align="center">0.073</td> <td align="center">46.1</td> <td align="center">41.6</td> <td align="center">42047784</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_100ep_LSJ/42047784/model_final_6ba57e.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_100ep_LSJ/42047784/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnety_4gf_dds_FPN_200ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_200ep_LSJ.py">regnety_4gf_dds_FPN</a></td> <td align="center">200</td> <td align="center">0.487</td> <td align="center">0.072</td> <td align="center">47.8</td> <td align="center">43.0</td> <td align="center">42047642</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_200ep_LSJ/42047642/model_final_27b9c1.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_200ep_LSJ/42047642/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_regnety_4gf_dds_FPN_400ep_LSJ --> <tr><td align="left"><a href="configs/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_400ep_LSJ.py">regnety_4gf_dds_FPN</a></td> <td align="center">400</td> <td align="center">0.487</td> <td align="center">0.072</td> <td align="center">48.2</td> <td align="center">43.3</td> <td align="center">42045954</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_400ep_LSJ/42045954/model_final_ef3a80.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/new_baselines/mask_rcnn_regnety_4gf_dds_FPN_400ep_LSJ/42045954/metrics.json">metrics</a></td> </tr> </tbody></table>Mask R-CNN baselines on the LVIS dataset, v0.5. These baselines are described in Table 3(c) of the LVIS paper.
NOTE: the 1x schedule here has the same amount of iterations as the COCO 1x baselines. They are roughly 24 epochs of LVISv0.5 data. The final results of these configs have large variance across different runs.
<!-- ./gen_html_table.py --config 'LVISv0.5-InstanceSegmentation/mask*50*' 'LVISv0.5-InstanceSegmentation/mask*101*' --name R50-FPN R101-FPN X101-FPN --fields lr_sched train_speed inference_speed mem box_AP mask_AP --> <table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">mask AP</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: mask_rcnn_R_50_FPN_1x --> <tr><td align="left"><a href="configs/LVISv0.5-InstanceSegmentation/mask_rcnn_R_50_FPN_1x.yaml">R50-FPN</a></td> <td align="center">1x</td> <td align="center">0.292</td> <td align="center">0.107</td> <td align="center">7.1</td> <td align="center">23.6</td> <td align="center">24.4</td> <td align="center">144219072</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_R_50_FPN_1x/144219072/model_final_571f7c.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_R_50_FPN_1x/144219072/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_101_FPN_1x --> <tr><td align="left"><a href="configs/LVISv0.5-InstanceSegmentation/mask_rcnn_R_101_FPN_1x.yaml">R101-FPN</a></td> <td align="center">1x</td> <td align="center">0.371</td> <td align="center">0.114</td> <td align="center">7.8</td> <td align="center">25.6</td> <td align="center">25.9</td> <td align="center">144219035</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_R_101_FPN_1x/144219035/model_final_824ab5.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_R_101_FPN_1x/144219035/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_X_101_32x8d_FPN_1x --> <tr><td align="left"><a href="configs/LVISv0.5-InstanceSegmentation/mask_rcnn_X_101_32x8d_FPN_1x.yaml">X101-FPN</a></td> <td align="center">1x</td> <td align="center">0.712</td> <td align="center">0.151</td> <td align="center">10.2</td> <td align="center">26.7</td> <td align="center">27.1</td> <td align="center">144219108</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_X_101_32x8d_FPN_1x/144219108/model_final_5e3439.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/LVISv0.5-InstanceSegmentation/mask_rcnn_X_101_32x8d_FPN_1x/144219108/metrics.json">metrics</a></td> </tr> </tbody></table>Simple baselines for
Ablations for Deformable Conv and Cascade R-CNN:
<!-- ./gen_html_table.py --config 'COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_1x.yaml' 'Misc/*R_50_FPN_1x_dconv*' 'Misc/cascade*1x.yaml' 'COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml' 'Misc/*R_50_FPN_3x_dconv*' 'Misc/cascade*3x.yaml' --name "Baseline R50-FPN" "Deformable Conv" "Cascade R-CNN" "Baseline R50-FPN" "Deformable Conv" "Cascade R-CNN" --fields lr_sched train_speed inference_speed mem box_AP mask_AP --> <table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">mask AP</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: mask_rcnn_R_50_FPN_1x --> <tr><td align="left"><a href="configs/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_1x.yaml">Baseline R50-FPN</a></td> <td align="center">1x</td> <td align="center">0.261</td> <td align="center">0.043</td> <td align="center">3.4</td> <td align="center">38.6</td> <td align="center">35.2</td> <td align="center">137260431</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_1x/137260431/model_final_a54504.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_1x/137260431/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_50_FPN_1x_dconv_c3-c5 --> <tr><td align="left"><a href="configs/Misc/mask_rcnn_R_50_FPN_1x_dconv_c3-c5.yaml">Deformable Conv</a></td> <td align="center">1x</td> <td align="center">0.342</td> <td align="center">0.048</td> <td align="center">3.5</td> <td align="center">41.5</td> <td align="center">37.5</td> <td align="center">138602867</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/mask_rcnn_R_50_FPN_1x_dconv_c3-c5/138602867/model_final_65c703.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/mask_rcnn_R_50_FPN_1x_dconv_c3-c5/138602867/metrics.json">metrics</a></td> </tr> <!-- ROW: cascade_mask_rcnn_R_50_FPN_1x --> <tr><td align="left"><a href="configs/Misc/cascade_mask_rcnn_R_50_FPN_1x.yaml">Cascade R-CNN</a></td> <td align="center">1x</td> <td align="center">0.317</td> <td align="center">0.052</td> <td align="center">4.0</td> <td align="center">42.1</td> <td align="center">36.4</td> <td align="center">138602847</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_R_50_FPN_1x/138602847/model_final_e9d89b.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_R_50_FPN_1x/138602847/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_50_FPN_3x --> <tr><td align="left"><a href="configs/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml">Baseline R50-FPN</a></td> <td align="center">3x</td> <td align="center">0.261</td> <td align="center">0.043</td> <td align="center">3.4</td> <td align="center">41.0</td> <td align="center">37.2</td> <td align="center">137849600</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x/137849600/model_final_f10217.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x/137849600/metrics.json">metrics</a></td> </tr> <!-- ROW: mask_rcnn_R_50_FPN_3x_dconv_c3-c5 --> <tr><td align="left"><a href="configs/Misc/mask_rcnn_R_50_FPN_3x_dconv_c3-c5.yaml">Deformable Conv</a></td> <td align="center">3x</td> <td align="center">0.349</td> <td align="center">0.047</td> <td align="center">3.5</td> <td align="center">42.7</td> <td align="center">38.5</td> <td align="center">144998336</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/mask_rcnn_R_50_FPN_3x_dconv_c3-c5/144998336/model_final_821d0b.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/mask_rcnn_R_50_FPN_3x_dconv_c3-c5/144998336/metrics.json">metrics</a></td> </tr> <!-- ROW: cascade_mask_rcnn_R_50_FPN_3x --> <tr><td align="left"><a href="configs/Misc/cascade_mask_rcnn_R_50_FPN_3x.yaml">Cascade R-CNN</a></td> <td align="center">3x</td> <td align="center">0.328</td> <td align="center">0.053</td> <td align="center">4.0</td> <td align="center">44.3</td> <td align="center">38.5</td> <td align="center">144998488</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_R_50_FPN_3x/144998488/model_final_480dd8.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_R_50_FPN_3x/144998488/metrics.json">metrics</a></td> </tr> </tbody></table>Ablations for normalization methods, and a few models trained from scratch following Rethinking ImageNet Pre-training.
(Note: The baseline uses 2fc head while the others use 4conv1fc head)
A few very large models trained for a long time, for demo purposes. They are trained using multiple machines:
<!-- ./gen_html_table.py --config 'Misc/panoptic_*dconv*' 'Misc/cascade_*152*' --name "Panoptic FPN R101" "Mask R-CNN X152" --fields inference_speed mem box_AP mask_AP PQ # manually add TTA results --> <table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">mask AP</th> <th valign="bottom">PQ</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: panoptic_fpn_R_101_dconv_cascade_gn_3x --> <tr><td align="left"><a href="configs/Misc/panoptic_fpn_R_101_dconv_cascade_gn_3x.yaml">Panoptic FPN R101</a></td> <td align="center">0.098</td> <td align="center">11.4</td> <td align="center">47.4</td> <td align="center">41.3</td> <td align="center">46.1</td> <td align="center">139797668</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/panoptic_fpn_R_101_dconv_cascade_gn_3x/139797668/model_final_be35db.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/panoptic_fpn_R_101_dconv_cascade_gn_3x/139797668/metrics.json">metrics</a></td> </tr> <!-- ROW: cascade_mask_rcnn_X_152_32x8d_FPN_IN5k_gn_dconv --> <tr><td align="left"><a href="configs/Misc/cascade_mask_rcnn_X_152_32x8d_FPN_IN5k_gn_dconv.yaml">Mask R-CNN X152</a></td> <td align="center">0.234</td> <td align="center">15.1</td> <td align="center">50.2</td> <td align="center">44.0</td> <td align="center"></td> <td align="center">18131413</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_X_152_32x8d_FPN_IN5k_gn_dconv/18131413/model_0039999_e76410.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Misc/cascade_mask_rcnn_X_152_32x8d_FPN_IN5k_gn_dconv/18131413/metrics.json">metrics</a></td> </tr> <!-- ROW: TTA cascade_mask_rcnn_X_152_32x8d_FPN_IN5k_gn_dconv --> <tr><td align="left">above + test-time aug.</td> <td align="center"></td> <td align="center"></td> <td align="center">51.9</td> <td align="center">45.9</td> <td align="center"></td> <td align="center"></td> <td align="center"></td> </tr> </tbody></table>