chainer.links.ResNet50Layers¶

class chainer.links.ResNet50Layers(pretrained_model='auto')[source]¶

A pre-trained CNN model with 50 layers provided by MSRA.

When you specify the path of the pre-trained chainer model serialized as a .npz file in the constructor, this chain model automatically initializes all the parameters with it. This model would be useful when you want to extract a semantic feature vector per image, or fine-tune the model on a different dataset. Note that unlike VGG16Layers, it does not automatically download a pre-trained caffemodel. This caffemodel can be downloaded at GitHub.

If you want to manually convert the pre-trained caffemodel to a chainer model that can be specified in the constructor, please use convert_caffemodel_to_npz classmethod instead.

ResNet50 has 25,557,096 trainable parameters, and it’s 58% and 43% fewer than ResNet101 and ResNet152, respectively. On the other hand, the top-5 classification accuracy on ImageNet dataset drops only 0.7% and 1.1% from ResNet101 and ResNet152, respectively. Therefore, ResNet50 may have the best balance between the accuracy and the model size. It would be basically just enough for many cases, but some advanced models for object detection or semantic segmentation use deeper ones as their building blocks, so these deeper ResNets are here for making reproduction work easier.

See: K. He et. al., Deep Residual Learning for Image Recognition

Parameters: pretrained_model (str) – the destination of the pre-trained chainer model serialized as a .npz file. If this argument is specified as auto, it automatically loads and converts the caffemodel from $CHAINER_DATASET_ROOT/pfnet/chainer/models/ResNet-50-model.caffemodel, where $CHAINER_DATASET_ROOT is set as $HOME/.chainer/dataset unless you specify another value by modifying the environment variable. Note that in this case the converted chainer model is stored on the same directory and automatically used from the next time. If this argument is specified as None, all the parameters are not initialized by the pre-trained model, but the default initializer used in the original paper, i.e., chainer.initializers.HeNormal(scale=1.0).

Variables: available_layers (list of str) – The list of available layer names used by __call__ and extract methods.

Methods

__call__(self, x, layers=['prob'])[source]¶

Computes all the feature maps specified by layers.

Warning

test argument is not supported anymore since v2. Instead, use chainer.using_config('train', train). See chainer.using_config().

Parameters:	x (Variable) – Input variable. layers (list of str) – The list of layer names you want to extract.
Returns:	A directory in which the key contains the layer name and the value contains the corresponding feature map variable.
Return type:	Dictionary of ~chainer.Variable

__getitem__(name)[source]¶: Equivalent to getattr.

add_link(name, link)[source]¶

Registers a child link to this chain.

Deprecated since version v2.0.0: Assign the child link directly to an attribute within an initialization scope, instead. For example, the following code

chain.add_link('l1', L.Linear(3, 5))

can be replaced by the following line.

with self.init_scope():
    chain.l1 = L.Linear(3, 5)

The latter one is easier for IDEs to keep track of the attribute’s type.

Parameters:	name (str) – Name of the child link. This name is also used as the attribute name. link (Link) – The link object to be registered.

add_param(name, shape=None, dtype=<type 'numpy.float32'>, initializer=None)[source]¶

Registers a parameter to the link.

Deprecated since version v2.0.0: Assign a Parameter object directly to an attribute within an initialization scope instead. For example, the following code

link.add_param('W', shape=(5, 3))

can be replaced by the following assignment.

with self.init_scope():
    link.W = chainer.Parameter(None, (5, 3))

The latter one is easier for IDEs to keep track of the attribute’s type.

Parameters:

name (str) – Name of the parameter. This name is also used as the attribute name.
shape (int or tuple of ints) – Shape of the parameter array. If it is omitted, the parameter variable is left uninitialized.
dtype – Data type of the parameter array.
initializer – If it is not None, the data is initialized with the given initializer. If it is an array, the data is directly initialized by it. If it is callable, it is used as a weight initializer. Note that in these cases, dtype argument is ignored.

add_persistent(name, value)[source]¶

Registers a persistent value to the link.

The registered value is saved and loaded on serialization and deserialization. The value is set to an attribute of the link.

Parameters:	name (str) – Name of the persistent value. This name is also used for the attribute name. value – Value to be registered.

addgrads(link)[source]¶

children()[source]¶

cleargrads()[source]¶

Clears all gradient arrays.

This method should be called before the backward computation at every iteration of the optimization.

convert_caffemodel_to_npz(path_caffemodel, path_npz, n_layers=50)[source]¶

Converts a pre-trained caffemodel to a chainer model.

Parameters:	path_caffemodel (str) – Path of the pre-trained caffemodel. path_npz (str) – Path of the converted chainer model.

copy()[source]¶

copyparams(link)[source]¶

disable_update()[source]¶

Disables update rules of all parameters under the link hierarchy.

This method sets the :attr:~chainer.UpdateRule.enabled` flag of the update rule of each parameter variable to False.

enable_update()[source]¶

Enables update rules of all parameters under the link hierarchy.

This method sets the enabled flag of the update rule of each parameter variable to True.

extract(self, images, layers=['pool5'], size=(224, 224))[source]¶

Extracts all the feature maps of given images.

The difference of directly executing __call__ is that it directly accepts images as an input and automatically transforms them to a proper variable. That is, it is also interpreted as a shortcut method that implicitly calls prepare and __call__ functions.

Warning

test and volatile arguments are not supported anymore since v2. Instead, use chainer.using_config('train', train) and chainer.using_config('enable_backprop', not volatile) respectively. See chainer.using_config().

Parameters:	images (iterable of PIL.Image or numpy.ndarray) – Input images. layers (list of str) – The list of layer names you want to extract. size (pair of ints) – The resolution of resized images used as an input of CNN. All the given images are not resized if this argument is `None`, but the resolutions of all the images should be the same.
Returns:	A directory in which the key contains the layer name and the value contains the corresponding feature map variable.
Return type:	Dictionary of ~chainer.Variable

init_scope(*args, **kwds)[source]¶

Creates an initialization scope.

This method returns a context manager object that enables registration of parameters (and links for Chain) by an assignment. A Parameter object can be automatically registered by assigning it to an attribute under this context manager.

Example

In most cases, the parameter registration is done in the initializer method. Using the init_scope method, we can simply assign a Parameter object to register it to the link.

class MyLink(chainer.Link):
    def __init__(self):
        super().__init__()
        with self.init_scope():
            self.W = chainer.Parameter(0, (10, 5))
            self.b = chainer.Parameter(0, (5,))

links(skipself=False)[source]¶

namedlinks(skipself=False)[source]¶

namedparams(include_uninit=True)[source]¶

params(include_uninit=True)[source]¶

predict(images, oversample=True)[source]¶

Computes all the probabilities of given images.

Parameters:	images (iterable of PIL.Image or numpy.ndarray) – Input images. oversample (bool) – If `True`, it averages results across center, corners, and mirrors. Otherwise, it uses only the center.
Returns:	Output that contains the class probabilities of given images.
Return type:	Variable

register_persistent(name)[source]¶

Registers an attribute of a given name as a persistent value.

This is a convenient method to register an existing attribute as a persistent value. If name has been already registered as a parameter, this method removes it from the list of parameter names and re-registers it as a persistent value.

Parameters:	name (str) – Name of the attribute to be registered.

serialize(serializer)[source]¶

to_cpu()[source]¶

to_gpu(device=None)[source]¶

zerograds()[source]¶

Initializes all gradient arrays by zero.

This method can be used for the same purpose of cleargrads, but less efficient. This method is left for backward compatibility.

Deprecated since version v1.15: Use cleargrads() instead.

Attributes

available_layers¶

functions¶

update_enabled¶: True if at least one parameter has an update rule enabled.

within_init_scope¶

True if the current code is inside of an initialization scope.

See init_scope() for the details of the initialization scope.

xp¶

Array module for this link.

Depending on which of CPU/GPU this link is on, this property returns numpy or cupy.