Back to Detectron2

Chart-based Dense Pose Estimation for Humans and Animals

projects/DensePose/doc/DENSEPOSE_IUV.md

0.630.1 KB
Original Source

Chart-based Dense Pose Estimation for Humans and Animals

<a name="Overview"></a> Overview

The goal of chart-based DensePose methods is to establish dense correspondences between image pixels and 3D object mesh by splitting the latter into charts and estimating for each pixel the corresponding chart index I and local chart coordinates (U, V).

<div align="center"> </div>

The charts used for human DensePose estimation are shown in Figure 1. The human body is split into 24 parts, each part is parametrized by U and V coordinates, each taking values in [0, 1].

<div align="center"> </div> <p class="image-caption"><b>Figure 1.</b> Partitioning and parametrization of human body surface.</p>

The pipeline uses Faster R-CNN with Feature Pyramid Network meta architecture outlined in Figure 2. For each detected object, the model predicts its coarse segmentation S (2 or 15 channels: foreground / background or background + 14 predefined body parts), fine segmentation I (25 channels: background + 24 predefined body parts) and local chart coordinates U and V.

<div align="center"> </div> <p class="image-caption"><b>Figure 2.</b> DensePose chart-based architecture based on Faster R-CNN with Feature Pyramid Network (FPN).</p>

<a name="Bootstrap"></a> Bootstrapping Chart-Based Models

Sanakoyeu et al., 2020 introduced a pipeline to transfer DensePose models trained on humans to proximal animal classes (chimpanzees), which is summarized in Figure 3. The training proceeds in two stages:

First, a master model is trained on data from source domain (humans with full DensePose annotation S, I, U and V) and supporting domain (animals with segmentation annotation only). Only selected animal classes are chosen from the supporting domain through category filters to guarantee the quality of target domain results. The training is done in class-agnostic manner: all selected categories are mapped to a single category (human).

Second, a student model is trained on data from source and supporting domains, as well as data from target domain obtained by applying the master model, selecting high-confidence detections and sampling the results.

<div align="center"> </div> <p class="image-caption"><b>Figure 3.</b> Domain adaptation: <i>master</i> model is trained on data from source and supporting domains to produce predictions in target domain; <i>student</i> model combines data from source and supporting domains, as well as sampled predictions from the master model on target domain to improve target domain predictions quality.</p>

Examples of pretrained master and student models are available in the Model Zoo. For more details on the bootstrapping pipeline, please see Bootstrapping Pipeline.

Datasets

For more details on datasets used for chart-based model training and validation, please refer to the DensePose Datasets page.

<a name="ModelZoo"></a> Model Zoo and Baselines

Legacy Models

Baselines trained using schedules from Güler et al, 2018

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_rcnn_R_50_FPN_s1x_legacy --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_s1x_legacy.yaml">R_50_FPN_s1x_legacy</a></td> <td align="center">s1x</td> <td align="center">0.307</td> <td align="center">0.051</td> <td align="center">3.2</td> <td align="center">58.1</td> <td align="center">58.2</td> <td align="center">52.1</td> <td align="center">54.9</td> <td align="center">164832157</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_s1x_legacy/164832157/model_final_d366fa.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_s1x_legacy/164832157/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_s1x_legacy --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_s1x_legacy.yaml">R_101_FPN_s1x_legacy</a></td> <td align="center">s1x</td> <td align="center">0.390</td> <td align="center">0.063</td> <td align="center">4.3</td> <td align="center">59.5</td> <td align="center">59.3</td> <td align="center">53.2</td> <td align="center">56.0</td> <td align="center">164832182</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_s1x_legacy/164832182/model_final_10af0e.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_s1x_legacy/164832182/metrics.json">metrics</a></td> </tr> </tbody></table>

Improved Baselines, Original Fully Convolutional Head

These models use an improved training schedule and Panoptic FPN head from Kirillov et al, 2019.

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_rcnn_R_50_FPN_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_s1x.yaml">R_50_FPN_s1x</a></td> <td align="center">s1x</td> <td align="center">0.359</td> <td align="center">0.066</td> <td align="center">4.5</td> <td align="center">61.2</td> <td align="center">67.2</td> <td align="center">63.7</td> <td align="center">65.3</td> <td align="center">165712039</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_s1x/165712039/model_final_162be9.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_s1x/165712039/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_s1x.yaml">R_101_FPN_s1x</a></td> <td align="center">s1x</td> <td align="center">0.428</td> <td align="center">0.079</td> <td align="center">5.8</td> <td align="center">62.3</td> <td align="center">67.8</td> <td align="center">64.5</td> <td align="center">66.2</td> <td align="center">165712084</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_s1x/165712084/model_final_c6ab63.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_s1x/165712084/metrics.json">metrics</a></td> </tr> </tbody></table>

<a name="ModelZooDeepLabV3"> Improved Baselines, DeepLabV3 Head

These models use an improved training schedule, Panoptic FPN head from Kirillov et al, 2019 and DeepLabV3 head from Chen et al, 2017.

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_rcnn_R_50_FPN_DL_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_DL_s1x.yaml">R_50_FPN_DL_s1x</a></td> <td align="center">s1x</td> <td align="center">0.392</td> <td align="center">0.070</td> <td align="center">6.7</td> <td align="center">61.1</td> <td align="center">68.3</td> <td align="center">65.6</td> <td align="center">66.7</td> <td align="center">165712097</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_s1x/165712097/model_final_0ed407.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_s1x/165712097/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_DL_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_DL_s1x.yaml">R_101_FPN_DL_s1x</a></td> <td align="center">s1x</td> <td align="center">0.478</td> <td align="center">0.083</td> <td align="center">7.0</td> <td align="center">62.3</td> <td align="center">68.7</td> <td align="center">66.3</td> <td align="center">67.6</td> <td align="center">165712116</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_s1x/165712116/model_final_844d15.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_s1x/165712116/metrics.json">metrics</a></td> </tr> </tbody></table>

<a name="ModelZooConfidence"> Baselines with Confidence Estimation

These models perform additional estimation of confidence in regressed UV coodrinates, along the lines of Neverova et al., 2019.

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_rcnn_R_50_FPN_WC1_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_WC1_s1x.yaml">R_50_FPN_WC1_s1x</a></td> <td align="center">s1x</td> <td align="center">0.353</td> <td align="center">0.064</td> <td align="center">4.6</td> <td align="center">60.5</td> <td align="center">67.0</td> <td align="center">64.2</td> <td align="center">65.4</td> <td align="center">173862049</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC1_s1x/173862049/model_final_289019.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC1_s1x/173862049/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_WC2_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_WC2_s1x.yaml">R_50_FPN_WC2_s1x</a></td> <td align="center">s1x</td> <td align="center">0.364</td> <td align="center">0.066</td> <td align="center">4.8</td> <td align="center">60.7</td> <td align="center">66.9</td> <td align="center">64.2</td> <td align="center">65.7</td> <td align="center">173861455</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC2_s1x/173861455/model_final_3abe14.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC2_s1x/173861455/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_DL_WC1_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_DL_WC1_s1x.yaml">R_50_FPN_DL_WC1_s1x</a></td> <td align="center">s1x</td> <td align="center">0.397</td> <td align="center">0.068</td> <td align="center">6.7</td> <td align="center">61.1</td> <td align="center">68.1</td> <td align="center">65.8</td> <td align="center">67.0</td> <td align="center">173067973</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC1_s1x/173067973/model_final_b1e525.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC1_s1x/173067973/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_DL_WC2_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_DL_WC2_s1x.yaml">R_50_FPN_DL_WC2_s1x</a></td> <td align="center">s1x</td> <td align="center">0.410</td> <td align="center">0.070</td> <td align="center">6.8</td> <td align="center">60.8</td> <td align="center">67.9</td> <td align="center">65.6</td> <td align="center">66.7</td> <td align="center">173859335</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC2_s1x/173859335/model_final_60fed4.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC2_s1x/173859335/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_WC1_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_WC1_s1x.yaml">R_101_FPN_WC1_s1x</a></td> <td align="center">s1x</td> <td align="center">0.435</td> <td align="center">0.076</td> <td align="center">5.7</td> <td align="center">62.5</td> <td align="center">67.6</td> <td align="center">64.9</td> <td align="center">66.3</td> <td align="center">171402969</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC1_s1x/171402969/model_final_9e47f0.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC1_s1x/171402969/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_WC2_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_WC2_s1x.yaml">R_101_FPN_WC2_s1x</a></td> <td align="center">s1x</td> <td align="center">0.450</td> <td align="center">0.078</td> <td align="center">5.7</td> <td align="center">62.3</td> <td align="center">67.6</td> <td align="center">64.8</td> <td align="center">66.4</td> <td align="center">173860702</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC2_s1x/173860702/model_final_5ea023.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC2_s1x/173860702/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_DL_WC1_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_DL_WC1_s1x.yaml">R_101_FPN_DL_WC1_s1x</a></td> <td align="center">s1x</td> <td align="center">0.479</td> <td align="center">0.081</td> <td align="center">7.9</td> <td align="center">62.0</td> <td align="center">68.4</td> <td align="center">66.2</td> <td align="center">67.2</td> <td align="center">173858525</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC1_s1x/173858525/model_final_f359f3.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC1_s1x/173858525/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_DL_WC2_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_DL_WC2_s1x.yaml">R_101_FPN_DL_WC2_s1x</a></td> <td align="center">s1x</td> <td align="center">0.491</td> <td align="center">0.082</td> <td align="center">7.6</td> <td align="center">61.7</td> <td align="center">68.3</td> <td align="center">65.9</td> <td align="center">67.2</td> <td align="center">173294801</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC2_s1x/173294801/model_final_6e1ed1.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC2_s1x/173294801/metrics.json">metrics</a></td> </tr> </tbody></table>

Acronyms:

WC1: with confidence estimation model type 1 for U and V

WC2: with confidence estimation model type 2 for U and V

<a name="ModelZooMaskConfidence"> Baselines with Mask Confidence Estimation

Models that perform estimation of confidence in regressed UV coodrinates as well as confidences associated with coarse and fine segmentation, see Sanakoyeu et al., 2020 for details.

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_rcnn_R_50_FPN_WC1M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_WC1M_s1x.yaml">R_50_FPN_WC1M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.381</td> <td align="center">0.066</td> <td align="center">4.8</td> <td align="center">60.6</td> <td align="center">66.7</td> <td align="center">64.0</td> <td align="center">65.4</td> <td align="center">217144516</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC1M_s1x/217144516/model_final_48a9d9.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC1M_s1x/217144516/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_WC2M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_WC2M_s1x.yaml">R_50_FPN_WC2M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.342</td> <td align="center">0.068</td> <td align="center">5.0</td> <td align="center">60.7</td> <td align="center">66.9</td> <td align="center">64.2</td> <td align="center">65.5</td> <td align="center">216245640</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC2M_s1x/216245640/model_final_d79ada.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_WC2M_s1x/216245640/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_DL_WC1M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_DL_WC1M_s1x.yaml">R_50_FPN_DL_WC1M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.371</td> <td align="center">0.068</td> <td align="center">6.0</td> <td align="center">60.7</td> <td align="center">68.0</td> <td align="center">65.2</td> <td align="center">66.7</td> <td align="center">216245703</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC1M_s1x/216245703/model_final_61971e.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC1M_s1x/216245703/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_50_FPN_DL_WC2M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_50_FPN_DL_WC2M_s1x.yaml">R_50_FPN_DL_WC2M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.385</td> <td align="center">0.071</td> <td align="center">6.1</td> <td align="center">60.8</td> <td align="center">68.1</td> <td align="center">65.0</td> <td align="center">66.4</td> <td align="center">216245758</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC2M_s1x/216245758/model_final_7bfb43.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_50_FPN_DL_WC2M_s1x/216245758/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_WC1M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_WC1M_s1x.yaml">R_101_FPN_WC1M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.423</td> <td align="center">0.079</td> <td align="center">5.9</td> <td align="center">62.0</td> <td align="center">67.3</td> <td align="center">64.8</td> <td align="center">66.0</td> <td align="center">216453687</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC1M_s1x/216453687/model_final_0a7287.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC1M_s1x/216453687/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_WC2M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_WC2M_s1x.yaml">R_101_FPN_WC2M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.436</td> <td align="center">0.080</td> <td align="center">5.9</td> <td align="center">62.5</td> <td align="center">67.4</td> <td align="center">64.5</td> <td align="center">66.0</td> <td align="center">216245682</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC2M_s1x/216245682/model_final_e354d9.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_WC2M_s1x/216245682/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_DL_WC1M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_DL_WC1M_s1x.yaml">R_101_FPN_DL_WC1M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.453</td> <td align="center">0.079</td> <td align="center">6.8</td> <td align="center">62.0</td> <td align="center">68.1</td> <td align="center">66.4</td> <td align="center">67.1</td> <td align="center">216245771</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC1M_s1x/216245771/model_final_0ebeb3.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC1M_s1x/216245771/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_rcnn_R_101_FPN_DL_WC2M_s1x --> <tr><td align="left"><a href="../configs/densepose_rcnn_R_101_FPN_DL_WC2M_s1x.yaml">R_101_FPN_DL_WC2M_s1x</a></td> <td align="center">s1x</td> <td align="center">0.464</td> <td align="center">0.080</td> <td align="center">6.9</td> <td align="center">61.9</td> <td align="center">68.2</td> <td align="center">66.1</td> <td align="center">67.1</td> <td align="center">216245790</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC2M_s1x/216245790/model_final_de6e7a.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/densepose_rcnn_R_101_FPN_DL_WC2M_s1x/216245790/metrics.json">metrics</a></td> </tr> </tbody></table>

Acronyms:

WC1M: with confidence estimation model type 1 for U and V and mask confidence estimation

WC2M: with confidence estimation model type 2 for U and V and mask confidence estimation

<a name="ModelZooBootstrap"></a> Bootstrapping Baselines

Master and student models trained using the bootstrapping pipeline with chimpanzee as the target category, see Sanakoyeu et al., 2020 and Bootstrapping Pipeline for details. Evaluation is performed on DensePose Chimps dataset.

<table><tbody> <!-- START TABLE --> <!-- TABLE HEADER --> <th valign="bottom">Name</th> <th valign="bottom">lr sched</th> <th valign="bottom">train time (s/iter)</th> <th valign="bottom">inference time (s/im)</th> <th valign="bottom">train mem (GB)</th> <th valign="bottom">box AP</th> <th valign="bottom">segm AP</th> <th valign="bottom">dp. APex GPS</th> <th valign="bottom">dp. AP GPS</th> <th valign="bottom">dp. AP GPSm</th> <th valign="bottom">model id</th> <th valign="bottom">download</th> <!-- TABLE BODY --> <!-- ROW: densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA --> <tr><td align="left"><a href="../configs/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA.yaml">R_50_FPN_DL_WC1M_3x_Atop10P_CA</a></td> <td align="center">3x</td> <td align="center">0.522</td> <td align="center">0.073</td> <td align="center">9.7</td> <td align="center">61.3</td> <td align="center">59.1</td> <td align="center">36.2</td> <td align="center">20.0</td> <td align="center">30.2</td> <td align="center">217578784</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA/217578784/model_final_9fe1cc.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10 P_CA/217578784/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uniform --> <tr><td align="left"><a href="../configs/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uniform.yaml">R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uniform</a></td> <td align="center">3x</td> <td align="center">1.939</td> <td align="center">0.072</td> <td align="center">10.1</td> <td align="center">60.9</td> <td align="center">58.5</td> <td align="center">37.2</td> <td align="center">21.5</td> <td align="center">31.0</td> <td align="center">256453729</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uniform/256453729/model_final_241ff5.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uniform/256453729/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uv --> <tr><td align="left"><a href="../configs/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uv.yaml">R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uv</a></td> <td align="center">3x</td> <td align="center">1.985</td> <td align="center">0.072</td> <td align="center">9.6</td> <td align="center">61.4</td> <td align="center">58.9</td> <td align="center">38.3</td> <td align="center">22.2</td> <td align="center">32.1</td> <td align="center">256452095</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uv/256452095/model_final_d689e2.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_uv/256452095/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_finesegm --> <tr><td align="left"><a href="../configs/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_finesegm.yaml">R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_finesegm</a></td> <td align="center">3x</td> <td align="center">2.047</td> <td align="center">0.072</td> <td align="center">10.3</td> <td align="center">60.9</td> <td align="center">58.5</td> <td align="center">36.7</td> <td align="center">20.7</td> <td align="center">30.7</td> <td align="center">256452819</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_finesegm/256452819/model_final_cb4ac6.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_finesegm/256452819/metrics.json">metrics</a></td> </tr> <!-- ROW: densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_coarsesegm --> <tr><td align="left"><a href="../configs/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_coarsesegm.yaml">R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_coarsesegm</a></td> <td align="center">3x</td> <td align="center">1.830</td> <td align="center">0.070</td> <td align="center">9.6</td> <td align="center">61.3</td> <td align="center">59.2</td> <td align="center">37.9</td> <td align="center">21.5</td> <td align="center">31.6</td> <td align="center">256455697</td> <td align="center"><a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_coarsesegm/256455697/model_final_a6a4bf.pkl">model</a>&nbsp;|&nbsp;<a href="https://dl.fbaipublicfiles.com/densepose/evolution/densepose_R_50_FPN_DL_WC1M_3x_Atop10P_CA_B_coarsesegm/256455697/metrics.json">metrics</a></td> </tr> </tbody></table>

Acronyms:

WC1M: with confidence estimation model type 1 for U and V and mask confidence estimation

Atop10P: humans and animals from the 10 best suitable categories are used for training

CA: class agnostic training, where all annotated instances are mapped into a single category

B_<...>: schedule with bootstrapping with the specified results sampling strategy

Note:

The relaxed dp. APex GPS metric was used in Sanakoyeu et al., 2020 to evaluate DensePose results. This metric considers matches at thresholds 0.2, 0.3 and 0.4 additionally to the standard ones used in the evaluation protocol. The minimum threshold is controlled by DENSEPOSE_EVALUATION.MIN_IOU_THRESHOLD config option.

License

All models available for download are licensed under the Creative Commons Attribution-ShareAlike 3.0 license

<a name="References"></a> References

If you use chart-based DensePose methods, please take the references from the following BibTeX entries:

DensePose bootstrapping pipeline:

@InProceedings{Sanakoyeu2020TransferringDensePose,
    title = {Transferring Dense Pose to Proximal Animal Classes},
    author = {Artsiom Sanakoyeu and Vasil Khalidov and Maureen S. McCarthy and Andrea Vedaldi and Natalia Neverova},
    journal = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
    year = {2020},
}

DensePose with confidence estimation:

@InProceedings{Neverova2019DensePoseConfidences,
    title = {Correlated Uncertainty for Learning Dense Correspondences from Noisy Labels},
    author = {Neverova, Natalia and Novotny, David and Vedaldi, Andrea},
    journal = {Advances in Neural Information Processing Systems},
    year = {2019},
}

Original DensePose:

@InProceedings{Guler2018DensePose,
  title={DensePose: Dense Human Pose Estimation In The Wild},
  author={R\{i}za Alp G\"uler, Natalia Neverova, Iasonas Kokkinos},
  journal={The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2018}
}