Autoencoding beyond pixels using a learned similarity metric

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Anders Boesen Lindbo Larsen
Søren Kaae Sønderby
Hugo Larochelle
Winther, Ole

We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder (VAE) with a generative adversarial network (GAN) we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distribution while offering invariance towards e.g. translation. We apply our method to images of faces and show that it outperforms VAEs with element-wise similarity measures in terms of visual fidelity. Moreover, we show that the method learns an embedding in which high-level abstract visual features (e.g. wearing glasses) can be modified using simple arithmetic.

Originalsprog	Engelsk
Titel	Proceedings of The 33rd International Conference on Machine Learning
Redaktører	Maria Florina Balcan, Kilian Q. Weinberger
Antal sider	9
Publikationsdato	2016
Sider	1558–1566
ISBN (Elektronisk)	978-151082900-8
Status	Udgivet - 2016
Begivenhed	33rd International Conference on Machine Learning - New York, USA Varighed: 19 jun. 2016 → 24 jun. 2016 Konferencens nummer: 33

Konference

Konference	33rd International Conference on Machine Learning
Nummer	33
Land	USA
By	New York
Periode	19/06/2016 → 24/06/2016

Navn	JMLR: Workshop and Conference Proceedings
Vol/bind	48

Forskning

Autoencoding beyond pixels using a learned similarity metric

Konference

Links