projects/Panoptic-DeepLab/README.md
Bowen Cheng, Maxwell D. Collins, Yukun Zhu, Ting Liu, Thomas S. Huang, Hartwig Adam, Liang-Chieh Chen
[arXiv] [BibTeX] [Reference implementation]
Install Detectron2 following the instructions. To use cityscapes, prepare data follow the tutorial.
To train a model with 8 GPUs run:
cd /path/to/detectron2/projects/Panoptic-DeepLab
python train_net.py --config-file configs/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv.yaml --num-gpus 8
Model evaluation can be done similarly:
cd /path/to/detectron2/projects/Panoptic-DeepLab
python train_net.py --config-file configs/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint
If you want to benchmark the network speed without post-processing, you can run the evaluation script with MODEL.PANOPTIC_DEEPLAB.BENCHMARK_NETWORK_SPEED True:
cd /path/to/detectron2/projects/Panoptic-DeepLab
python train_net.py --config-file configs/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint MODEL.PANOPTIC_DEEPLAB.BENCHMARK_NETWORK_SPEED True
Cityscapes models are trained with ImageNet pretraining.
<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Method</th> <th valign="bottom">Backbone</th> <th valign="bottom">Output resolution</th> <th valign="bottom">PQ</th> <th valign="bottom">SQ</th> <th valign="bottom">RQ</th> <th valign="bottom">mIoU</th> <th valign="bottom">AP</th> <th valign="bottom">Memory (M)</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <tr><td align="left">Panoptic-DeepLab</td> <td align="center">R50-DC5</td> <td align="center">1024×2048</td> <td align="center"> 58.6 </td> <td align="center"> 80.9 </td> <td align="center"> 71.2 </td> <td align="center"> 75.9 </td> <td align="center"> 29.8 </td> <td align="center"> 8668 </td> <td align="center"> - </td> <td align="center">model | metrics</td> </tr> <tr><td align="left"><a href="configs/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024.yaml">Panoptic-DeepLab</a></td> <td align="center">R52-DC5</td> <td align="center">1024×2048</td> <td align="center"> 60.3 </td> <td align="center"> 81.5 </td> <td align="center"> 72.9 </td> <td align="center"> 78.2 </td> <td align="center"> 33.2 </td> <td align="center"> 9682 </td> <td align="center"> 30841561 </td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32/model_final_bd324a.pkl ">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32/metrics.json ">metrics</a></td> </tr> <tr><td align="left"><a href="configs/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv.yaml">Panoptic-DeepLab (DSConv)</a></td> <td align="center">R52-DC5</td> <td align="center">1024×2048</td> <td align="center"> 60.3 </td> <td align="center"> 81.0 </td> <td align="center"> 73.2 </td> <td align="center"> 78.7 </td> <td align="center"> 32.1 </td> <td align="center"> 10466 </td> <td align="center"> 33148034 </td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv/model_final_23d03a.pkl ">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/Cityscapes-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_90k_bs32_crop_512_1024_dsconv/metrics.json ">metrics</a></td> </tr> </tbody></table>Note:
res5.COCO models are trained with ImageNet pretraining on 16 V100s.
<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Method</th> <th valign="bottom">Backbone</th> <th valign="bottom">Output resolution</th> <th valign="bottom">PQ</th> <th valign="bottom">SQ</th> <th valign="bottom">RQ</th> <th valign="bottom">Box AP</th> <th valign="bottom">Mask AP</th> <th valign="bottom">Memory (M)</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <tr><td align="left"><a href="configs/COCO-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_200k_bs64_crop_640_640_coco_dsconv.yaml">Panoptic-DeepLab (DSConv)</a></td> <td align="center">R52-DC5</td> <td align="center">640×640</td> <td align="center"> 35.5 </td> <td align="center"> 77.3 </td> <td align="center"> 44.7 </td> <td align="center"> 18.6 </td> <td align="center"> 19.7 </td> <td align="center"> </td> <td align="center"> 246448865 </td> <td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/COCO-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_200k_bs64_crop_640_640_coco_dsconv/model_final_5e6da2.pkl ">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/PanopticDeepLab/COCO-PanopticSegmentation/panoptic_deeplab_R_52_os16_mg124_poly_200k_bs64_crop_640_640_coco_dsconv/metrics.json ">metrics</a></td> </tr> </tbody></table>Note:
res5.If you use Panoptic-DeepLab, please use the following BibTeX entry.
@inproceedings{cheng2020panoptic,
title={Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation},
author={Cheng, Bowen and Collins, Maxwell D and Zhu, Yukun and Liu, Ting and Huang, Thomas S and Adam, Hartwig and Chen, Liang-Chieh},
booktitle={CVPR},
year={2020}
}
@inproceedings{cheng2019panoptic,
title={Panoptic-DeepLab},
author={Cheng, Bowen and Collins, Maxwell D and Zhu, Yukun and Liu, Ting and Huang, Thomas S and Adam, Hartwig and Chen, Liang-Chieh},
booktitle={ICCV COCO + Mapillary Joint Recognition Challenge Workshop},
year={2019}
}