6 Star 41 Fork 11

OpenMMLab / mmgeneration

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
README.md 5.29 KB
一键复制 编辑 原始数据 按行查看 历史
LeoXing1996 提交于 2022-09-13 15:44 . [CI] Fix Master CI (#431)

Pix2Pix

Pix2Pix: Image-to-Image Translation with Conditional Adversarial Networks

Abstract

We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks. Moreover, since the release of the pix2pix software associated with this paper, hundreds of twitter users have posted their own artistic experiments using our system. As a community, we no longer hand-engineer our mapping functions, and this work suggests we can achieve reasonable results without handengineering our loss functions either.

Results and Models

Results from Pix2Pix trained by MMGeneration
We use `FID` and `IS` metrics to evaluate the generation performance of pix2pix.1
Models Dataset FID IS Config Download
Ours facades 124.9773 1.620 config model | log2
Ours aerial2maps 122.5856 3.137 config model
Ours maps2aerial 88.4635 3.310 config model
Ours edges2shoes 84.3750 2.815 config model

FID comparison with official:

Dataset facades aerial2maps maps2aerial edges2shoes average
official 119.135 149.731 102.072 75.774 111.678
ours 124.9773 122.5856 88.4635 84.3750 105.1003

IS comparison with official:

Dataset facades aerial2maps maps2aerial edges2shoes average
official 1.650 2.529 3.552 2.766 2.624
ours 1.620 3.137 3.310 2.815 2.7205

Note:

  1. we strictly follow the paper setting in Section 3.3: "At inference time, we run the generator net in exactly the same manner as during the training phase. This differs from the usual protocol in that we apply dropout at test time, and we apply batch normalization using the statistics of the test batch, rather than aggregated statistics of the training batch." (i.e., use model.train() mode), thus may lead to slightly different inference results every time.
  2. This is the training log before refactoring. Updated logs will be released soon.

Citation

@inproceedings{isola2017image,
  title={Image-to-image translation with conditional adversarial networks},
  author={Isola, Phillip and Zhu, Jun-Yan and Zhou, Tinghui and Efros, Alexei A},
  booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
  pages={1125--1134},
  year={2017},
  url={https://openaccess.thecvf.com/content_cvpr_2017/html/Isola_Image-To-Image_Translation_With_CVPR_2017_paper.html},
}
1
https://gitee.com/open-mmlab/mmgeneration.git
git@gitee.com:open-mmlab/mmgeneration.git
open-mmlab
mmgeneration
mmgeneration
master

搜索帮助

53164aa7 5694891 3bd8fe86 5694891