stylegan_human/docs/Dataset.md
SHHQ is a dataset with high-quality full-body human images in a resolution of 1024 × 512. Since we need to follow a rigorous legal review in our institute, we can not release all of the data at once.
For now, SHHQ-1.0 with 40K images is released! More data will be released in the later versions.
Images are collected in two main ways:
The composition of SHHQ-1.0:
We are aware of privacy concerns and seriously treat the license and privacy issues. All released data will be ensured under the license of CC0 and free for research use. Also, persons in the dataset are anonymised without additional private or sensitive metadata.
The SHHQ is available for non-commercial research purposes only.
You agree not to reproduce, duplicate, copy, sell, trade, resell or exploit any portion of the images and any portion of the derived data for commercial purposes.
You agree NOT to further copy, publish or distribute any portion of SHHQ to any third party for any purpose. Except, for internal use at a single site within the same organization it is allowed to make copies of the dataset.
Shanghai AI Lab reserves the right to terminate your access to the SHHQ at any time.
For those interested in our dataset, we provide a preview version with 100 images randomly sampled from SHHQ-1.0: SHHQ-1.0_samples.
In SHHQ-1.0, we provide aligned raw images along with machine-calculated segmentation masks. Later we are planning to release manually annotated human-parsing version of these 40,000 images. Please stay tuned.
We also provide script bg_white.py to whiten the background of the raw image using its segmentation mask.
If you want to access the full SHHQ-1.0, please read the following instructions.
| Structure | 1024x512 | Metric | Scores | 512x256 | Metric | Scores |
|---|---|---|---|---|---|---|
| StyleGAN1 | to be released | - | - | to be released | - | - |
| StyleGAN2 | SHHQ-1.0_sg2_1024.pkl | fid50k_full | 3.56 | SHHQ-1.0_sg2_512.pkl | fid50k_full | 3.68 |
| StyleGAN3 | to be released | - | - | to be released | - | - |
Please download the SHHQ Dataset Release Agreement from link. Read it carefully, complete and sign it appropriately.
Please send the completed form to Jianglin Fu ([email protected]) and Shikai Li ([email protected]), and cc to Wayne Wu ([email protected]) using institutional email address. The email Subject Title is "SHHQ Dataset Release Agreement". We will verify your request and contact you with the dataset link and password to unzip the image data.
Note:
We are currently facing large incoming applications, and we need to carefully verify all the applicants, please be patient, and we will reply to you as soon as possible.
The signature in the agreement should be hand-written.
<a id="1">[1]</a> Liu, Ziwei and Luo, Ping and Qiu, Shi and Wang, Xiaogang and Tang, Xiaoou. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. CVPR (2016)
<a id="2">[2]</a> Hacheme, Gilles and Sayouti, Noureini. Neural fashion image captioning: Accounting for data diversity. arXiv preprint arXiv:2106.12154 (2021)