Model Architectures.
Data Set . | Architecture . | |
---|---|---|
MNIST/fMNIST//SVHN/3DShapes/sDprites/3Dcars |
Data Set . | Architecture . | |
---|---|---|
MNIST/fMNIST//SVHN/3DShapes/sDprites/3Dcars |
Notes: All convolutions and transposed convolutions are with stride 2 and padding 1. Unless stated otherwise, layers have parametric-RELU () activation functions, except output layers of the preimage maps, which have sigmoid activation functions (since input data are normalized [0, 1]). Adam and Cayley ADAM optimizers have learning rates and , respectively. The preimage map/decoder network is always taken as transposed of the feature map/encoder network. for 3D cars; and for all others. Further, and stride 1 for MNIST, fMNIST, SVHN and 3DShapes; and for others. SVHN and 3DShapes are resized to input dimensions.