Bounding the bias of contrastive divergence learning

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Anja Fischer
Igel, Christian

Optimization based on k-step contrastive divergence (CD) has become a common way to train restricted Boltzmann machines (RBMs). The k-step CD is a biased estimator of the log-likelihood gradient relying on Gibbs sampling. We derive a new upper bound for this bias. Its magnitude depends on k, the number of variables in the RBM, and the maximum change in energy that can be produced by changing a single variable. The last reflects the dependence on the absolute values of the RBM parameters. The magnitude of the bias is also affected by the distance in variation between the modeled distribution and the starting distribution of the Gibbs chain.

Originalsprog	Engelsk
Tidsskrift	Neural Computation
Vol/bind	23
Udgave nummer	3
Sider (fra-til)	664-673
Antal sider	10
ISSN	0899-7667
DOI	https://doi.org/10.1162/NECO_a_00085
Status	Udgivet - 2011

ID: 32089131