理解 NumPy 的卷积

当计算一个简单的移动平均线时,numpy.convolve似乎可以完成这项工作。

问: 使用 np.convolve(values, weights, 'valid')时如何进行计算?

当医生提到 convolution product is only given for points where the signals overlap completely时,这两个信号指的是什么?

如果任何解释可以包括例子和插图,它将是非常有用的。

window = 10
weights = np.repeat(1.0, window)/window
smas = np.convolve(values, weights, 'valid')
109952 次浏览

Convolution is a mathematical operator primarily used in signal processing. Numpy simply uses this signal processing nomenclature to define it, hence the "signal" references. An array in numpy is a signal. The convolution of two signals is defined as the integral of the first signal, reversed, sweeping over ("convolved onto") the second signal and multiplied (with the scalar product) at each position of overlapping vectors. The first signal is often called the kernel, especially when it is a 2-D matrix in image processing or neural networks, and the reversal becomes a mirroring in 2-D (NOT transpose). It can more clearly be understood using the animations on wikipedia.

Convolutions have multiple definitions depending on the context. Some start the convolution when the overlap begins while others start when the overlap is only partial. In the case of numpy's "valid" mode, the overlap is specified to be always complete. It is called "valid" since every value given in the result is done without data extrapolation.

For instance, if your array X have a length of 2 and your array Y have a length of 4, the convolution of X onto Y in "valid" mode will give you an array of length 3.

First step, for X = [4 3] and Y = [1 1 5 5]:

[3 4]                   (X is reversed from [4 3] to [3 4], see note)
[1 1 5 5]
= 3 * 1 + 4 * 1 = 7

Note: If X was not reversed, the operation would be called a cross-correlation instead of a convolution.

Second Step:

  [3 4]
[1 1 5 5]
= 3 * 1 + 4 * 5 = 23

Third step:

    [3 4]
[1 1 5 5]
= 3 * 5 + 4 * 5 = 35

The result of the convolution for mode "valid" would then be [7 23 35].

If the overlap is be specified as one single data point (as the case in mode "full"), the result would have given you an array of length 5. The first step being:

[3 4]
[1 1 5 5]
= 3 * undefined (extrapolated as 0) + 4 * 1 = 4

And so on. More extrapolation modes exist.

It is notable also that the kernel is "centered" in the sense that indices for the kernel are taken with respect to the centre element of the array. In other words, for arrays with index starting at 0 (as in python), the function B = np.convolve (A, K) computes

B[k] = \sum_i A[i]   K[k - i + m]

where m = (len(K) - 1)//2 (integer division). This is an integer, also when len(K) is even.

The summation is nominally over all values of i from -∞ to ∞, where values of A out of range are assumed equal to zero. The same is true for values of the kernel. For np.convolution2D, you have to option of using the mode, boundary and fillvalue options to specify how values of A out of range are treated.

Thus, for example, you get different answers for np.convolve(A, K) if K = np.array([1, 2, 3]) or K = np.array([1, 2, 3, 0, 0])