UNIT series¶

UNIT¶

Unsupervised Image-to-Image Translation Networks (NIPS 2017) by Nvidia
Seperate the generator of GAN into encoder+decoder pairs, just like VAE.
Shared-latent space constraint implies the cycle-consistency constraint.
VAE + GAN + share weight of last layer of E1 and E2; also share the first layer of G1, G2
../_images/UNIT.png
Cycle-consistency is not necessary for this task, however, preformance: proposed(UNIT-shared latent space + cycle-consistency) > cycleGAN > shared latent space (VAE-GAN)
comparing with cycleGAN, it learn shape better.

MUNIT¶

Multimodal Unsupervised Image-to-Image Translation (ECCV 2018)

seperate latent code into content code and style code, learn from swapping attribute code
style is embedded in hidden layer of generator

DRIT¶

Diverse Image-to-Image Translation via Disentangled Representations (ECCV 2018)
Project | Pytorch 0.4.0

concurrent works of MUNIT, style code in MUNIT~attribute code in DRIT
keep weight sharing of UNIT
add content adversarial loss to force content generator produce encoding that could not be distingished, same concept of Transfer Learning/DANN

DRIT++¶

DRIT++: Diverse Image-to-Image Translation via Disentangled Representations (IJCV Journal extension for ECCV 2018)

mode-seeking regularization for improving sample diversity Mode Seeking GAN
add one-hot domain code

FUNIT¶

Few-Shot Unsupervised Image-to-Image Translation (ICCV 2019)
Project page | pyTorch | FUNIT Explained | GANimal Demo Video | Have fun with GANimal

TUNIT¶

Rethinking the Truly Unsupervised Image-to-Image Translation (2020) - Clova AI
pyTorch 1.1 | reddit | Twitter | youtube
Truly unsupervised: no any supervision, neither image-level(paired) or set-level (unpaired, domain)
similar to styleGAN? but styleGAN assume no domain as all while TUNIT assume dataset have dmoain, but it is unknown in training dataset. TUNIT learn to produce domain label via IIC clustering.