CuPy Overview¶

CuPy is an implementation of NumPy-compatible multi-dimensional array on CUDA. CuPy consists of the core multi-dimensional array class, cupy.ndarray, and many functions on it. It supports a subset of numpy.ndarray interface that is enough for Chainer.

The following is a brief overview of supported subset of NumPy interface:

Basic indexing (indexing by ints, slices, newaxes, and Ellipsis)
Element types (dtypes): bool_, (u)int{8, 16, 32, 64}, float{16, 32, 64}
Most of the array creation routines
Reshaping and transposition
All operators with broadcasting
All Universal functions (a.k.a. ufuncs) for elementwise operations except those for complex numbers
Dot product functions (except einsum) using cuBLAS
Reduction along axes (sum, max, argmax, etc.)

CuPy also includes following features for performance:

Customizable memory allocator, and a simple memory pool as an example
User-defined elementwise kernels
User-defined reduction kernels
cuDNN utilities

CuPy uses on-the-fly kernel synthesis: when a kernel call is required, it compiles a kernel code optimized for the shapes and dtypes of given arguments, sends it to the GPU device, and executes the kernel. The compiled code is cached to $(HOME)/.cupy/kernel_cache directory (this cache path can be overwritten by setting the CUPY_CACHE_DIR environment variable). It may make things slower at the first kernel call, though this slow down will be resolved at the second execution. CuPy also caches the kernel code sent to GPU device within the process, which reduces the kernel transfer time on further calls.

A list of supported attributes, properties, and methods of ndarray¶

Memory layout¶

base ctypes itemsize flags nbytes shape size strides

Data type¶

dtype

Other attributes¶

Array conversion¶

tolist() tofile() dump() dumps() astype() copy() view() fill()

Shape manipulation¶

reshape() transpose() swapaxes() ravel() squeeze()

Item selection and manipulation¶

take() diagonal()

Calculation¶

max() argmax() min() argmin() clip() trace() sum() mean() var() std() prod() dot()

Special methods¶

__copy__() __deepcopy__() __reduce__() __array__() __len__() __getitem__() __setitem__() __int__() __long__() __float__() __oct__() __hex__() __repr__() __str__()

Memory transfer¶

get() set()

A list of supported routines of `cupy` module¶

Array creation routines¶

empty() empty_like() eye() identity() ones() ones_like() zeros() zeros_like() full() full_like()

array() asarray() ascontiguousarray() copy()

arange() linspace()

diag() diagflat()

Binary operations¶

bitwise_and bitwise_or bitwise_xor invert left_shift right_shift

Indexing routines¶

take() diagonal()

Input and output¶

load() save() savez() savez_compressed()

array_repr() array_str()

Linear algebra¶

dot() vdot() inner() outer() tensordot()

Logic functions¶

isfinite isinf isnan

logical_and logical_or logical_not logical_xor

greater greater_equal less less_equal equal not_equal

Sorting, searching, and counting¶

argmax() argmin() count_nonzero() nonzero() flatnonzero() where()

Statistics¶

amin() amax()

mean() var() std()

Padding¶

External Functions¶

scatter_add()

Other¶

Read the Docs v: v1.24.0

Versions: v2.0.0a1; v1.9.1; v1.9.0; v1.8.2; v1.8.1; v1.8.0; v1.7.2; v1.7.1; v1.7.0; v1.6.2.1; v1.6.2; v1.6.1; v1.6.0; v1.5.1; v1.5.0.3; v1.5.0.2; variable-document; v2-upgrade-guide; v2-docs-cupy; v2_a

Downloads: pdf; htmlzip; epub

On Read the Docs: Project Home; Builds

Free document hosting provided by Read the Docs.