chainer.initializers.Orthogonal

class chainer.initializers.Orthogonal(scale=1.1, dtype=None, mode='auto', **kwargs)[source]

Initializes array with an orthogonal system.

This initializer first makes a matrix of the same shape as the array to be initialized whose elements are drawn independently from standard Gaussian distribution. Next, it applies QR decomposition to (the transpose of) the matrix. To make the decomposition (almost surely) unique, we require the diagonal of the triangular matrix R to be non-negative (see e.g. Edelman & Rao, https://web.eecs.umich.edu/~rajnrao/Acta05rmt.pdf). Then, it initializes the array with the (semi-)orthogonal matrix Q. Finally, the array is multiplied by the constant scale.

If the ndim of the input array is more than 2, we consider the array to be a matrix by concatenating all axes except the first one.

The number of vectors consisting of the orthogonal system (i.e. first element of the shape of the array) must be equal to or smaller than the dimension of each vector (i.e. second element of the shape of the array).

Variables
  • ~Orthogonal.scale (float) – A constant to be multiplied by.

  • ~Orthogonal.dtype – Data type specifier.

  • ~Orthogonal.mode (str) – Assertion on the initialized shape. 'auto' (default), 'projection' (before v7), 'embedding', or 'basis'.

  • ~Orthogonal.rng (xp.random.RandomState) – Pseudo-random number generator.

Reference: Saxe et al., https://arxiv.org/abs/1312.6120

Methods

__call__(array)[source]

Initializes given array.

This method destructively changes the value of array. The derived class is required to implement this method. The algorithms used to make the new values depend on the concrete derived classes.

Parameters

array (N-dimensional array) – An array to be initialized by this initializer.

__eq__(value, /)

Return self==value.

__ne__(value, /)

Return self!=value.

__lt__(value, /)

Return self<value.

__le__(value, /)

Return self<=value.

__gt__(value, /)

Return self>value.

__ge__(value, /)

Return self>=value.