首頁猿問 numpy...

numpy 在另一個數組中創建最大連續對的數組

Python

白豬掌柜的 2023-08-08 10:32:18

我有一個 numpy 數組：A = np.array([8, 2, 33, 4, 3, 6])我想要的是創建另一個數組 B，其中每個元素是 A 中 2 個連續對的成對最大值，所以我得到：B = np.array([8, 33, 33, 4, 6])關于如何實施有什么想法嗎？關于如何對兩個以上的元素實現這一點有什么想法嗎？（同樣的事情，但對于連續的 n 個元素）編輯：答案給了我一種解決這個問題的方法，但是對于n大小的窗口情況，是否有一種不需要循環的更有效的方法？編輯2：事實證明，這個問題相當于詢問如何對窗口大小為 n 的列表執行 1d 最大池化。有誰知道如何有效地實施這一點？

查看完整描述

6 回答

Qyouu

TA貢獻1786條經驗獲得超11個贊

成對問題的一種解決方案是使用np.maximum函數和數組切片：

B?=?np.maximum(A[:-1],?A[1:])

反對回復 2023-08-08

白衣非少年

TA貢獻1155條經驗獲得超0個贊

無循環解決方案是max在由以下命令創建的窗口上使用skimage.util.view_as_windows：

list(map(max, view_as_windows(A, (2,))))

[8, 33, 33, 4, 6]

復制/粘貼示例：

import numpy as np

from skimage.util import view_as_windows

A = np.array([8, 2, 33, 4, 3, 6])

list(map(max, view_as_windows(A, (2,))))

反對回復 2023-08-08

蕪湖不蕪

TA貢獻1796條經驗獲得超7個贊

在本次問答中，我們基本上要求滑動最大值。之前已經探討過這一點 -?NumPy 數組中滑動窗口中的 Max。由于我們希望提高效率，因此我們可以看得更遠。其中之一是numba，這里有兩個最終變體，我最終得到了杠桿parallel指令，它比沒有版本提高了性能：

import numpy as np

from numba import njit, prange

@njit(parallel=True)

def numba1(a, W):

? ? L = len(a)-W+1

? ? out = np.empty(L, dtype=a.dtype)

? ? v = np.iinfo(a.dtype).min

? ? for i in prange(L):

? ? ? ? max1 = v

? ? ? ? for j in range(W):

? ? ? ? ? ? cur = a[i + j]

? ? ? ? ? ? if cur>max1:

? ? ? ? ? ? ? ? max1 = cur? ? ? ? ? ? ? ??

? ? ? ? out[i] = max1

? ? return out?

@njit(parallel=True)

def numba2(a, W):

? ? L = len(a)-W+1

? ? out = np.empty(L, dtype=a.dtype)

? ? for i in prange(L):

? ? ? ? for j in range(W):

? ? ? ? ? ? cur = a[i + j]

? ? ? ? ? ? if cur>out[i]:

? ? ? ? ? ? ? ? out[i] = cur? ? ? ? ? ? ? ??

? ? return out?

從之前鏈接的問答中，等效的 SciPy 版本將是 -

from scipy.ndimage.filters import maximum_filter1d

def scipy_max_filter1d(a, W):

? ? L = len(a)-W+1

? ? hW = W//2 # Half window size

? ? return maximum_filter1d(a,size=W)[hW:hW+L]

標桿管理

其他發布的通用窗口 arg 的工作方法：

from skimage.util import view_as_windows

def rolling(a, window):

? ? shape = (a.size - window + 1, window)

? ? strides = (a.itemsize, a.itemsize)

? ? return np.lib.stride_tricks.as_strided(a, shape=shape, strides=strides)

# @mathfux's soln

def npmax_strided(a,n):

? ? return np.max(rolling(a, n), axis=1)

# @Nicolas Gervais's soln

def mapmax_strided(a, W):

? ? return list(map(max, view_as_windows(a,W)))

cummax = np.maximum.accumulate

def pp(a,w):

? ? N = a.size//w

? ? if a.size-w+1 > N*w:

? ? ? ? out = np.empty(a.size-w+1,a.dtype)

? ? ? ? out[:-1] = cummax(a[w*N-1::-1].reshape(N,w),axis=1).ravel()[:w-a.size-1:-1]

? ? ? ? out[-1] = a[w*N:].max()

? ? else:

? ? ? ? out = cummax(a[w*N-1::-1].reshape(N,w),axis=1).ravel()[:w-a.size-2:-1]

? ? out[1:N*w-w+1] = np.maximum(out[1:N*w-w+1],

? ? ? ? ? ? ? ? ? ? ? ? ? ? cummax(a[w:w*N].reshape(N-1,w),axis=1).ravel())

? ? out[N*w-w+1:] = np.maximum(out[N*w-w+1:],cummax(a[N*w:]))

? ? return out

使用benchit包（很少有基準測試工具打包在一起；免責聲明：我是它的作者）對建議的解決方案進行基準測試。

import benchit

funcs = [mapmax_strided, npmax_strided, numba1, numba2, scipy_max_filter1d, pp]

in_ = {(n,W):(np.random.randint(0,100,n),W) for n in 10**np.arange(2,6) for W in [2, 10, 20, 50, 100]}

t = benchit.timings(funcs, in_, multivar=True, input_name=['Array-length', 'Window-length'])

t.plot(logx=True, sp_ncols=1, save='timings.png')

因此，numba 非常適合小于的窗口大小10，此時沒有明顯的贏家，而在較大的窗口大小上，ppSciPy 則排名第二。

反對回復 2023-08-08

紫衣仙女

TA貢獻1839條經驗獲得超15個贊

這是專門為較大窗戶設計的方法。窗口大小為 O(1)，數據大小為 O(n)。

我已經完成了純 numpy 和 pythran 實現。

我們如何在窗口大小上實現 O(1) ？我們使用“鋸齒”技巧：如果 w 是窗口寬度，我們將數據分為很多 w ，對于每個組，我們從左到右和從右到左求累積最大值。任何中間窗口的元素分布在兩組中，并且交集的最大值位于我們之前計算的累積最大值之中。因此，每個數據點總共需要 3 次比較。

benchit w=100；我的函數是 pp (numpy) 和 winmax (pythran)：

對于小窗口尺寸 w=5，圖像更加均勻。有趣的是，即使對于非常小的尺寸，pythran 仍然具有巨大的優勢。他們必須采取正確的措施來最大限度地減少呼叫開銷。

蟒蛇代碼：

cummax = np.maximum.accumulate

def pp(a,w):

? ? N = a.size//w

? ? if a.size-w+1 > N*w:

? ? ? ? out = np.empty(a.size-w+1,a.dtype)

? ? ? ? out[:-1] = cummax(a[w*N-1::-1].reshape(N,w),axis=1).ravel()[:w-a.size-1:-1]

? ? ? ? out[-1] = a[w*N:].max()

? ? else:

? ? ? ? out = cummax(a[w*N-1::-1].reshape(N,w),axis=1).ravel()[:w-a.size-2:-1]

? ? out[1:N*w-w+1] = np.maximum(out[1:N*w-w+1],

? ? ? ? ? ? ? ? ? ? ? ? ? ? cummax(a[w:w*N].reshape(N-1,w),axis=1).ravel())

? ? out[N*w-w+1:] = np.maximum(out[N*w-w+1:],cummax(a[N*w:]))

? ? return out

pythran 版本；編譯用pythran -O3 <filename.py>; 這將創建一個可以導入的已編譯模塊：

import numpy as np

# pythran export winmax(float[:],int)

# pythran export winmax(int[:],int)

def winmax(data,winsz):

? ? N = data.size//winsz

? ? if N < 1:

? ? ? ? raise ValueError

? ? out = np.empty(data.size-winsz+1,data.dtype)

? ? nxt = winsz

? ? for j in range(winsz,data.size):

? ? ? ? if j == nxt:

? ? ? ? ? ? nxt += winsz

? ? ? ? ? ? out[j+1-winsz] = data[j]

? ? ? ? else:

? ? ? ? ? ? out[j+1-winsz] = out[j-winsz] if out[j-winsz]>data[j] else data[j]

? ? running = data[-winsz:N*winsz].max()

? ? nxt -= winsz << (nxt > data.size)

? ? for j in range(data.size-winsz,0,-1):

? ? ? ? if j == nxt:

? ? ? ? ? ? nxt -= winsz

? ? ? ? ? ? running = data[j-1]

? ? ? ? else:

? ? ? ? ? ? running = data[j] if data[j] > running else running

? ? ? ? ? ? out[j] = out[j] if out[j] > running else running

? ? out[0] = data[0] if data[0] > running else running

? ? return out

反對回復 2023-08-08

紅糖糍粑

TA貢獻1815條經驗獲得超6個贊

如果存在連續的n項目，則擴展解決方案需要循環：

np.maximum(*[A[i:len(A)-n+i+1] for i in range(n)])

為了避免這種情況，您可以使用跨步技巧并將其轉換A為n-length 塊的數組：

def rolling(a, window):

shape = (a.size - window + 1, window)

strides = (a.itemsize, a.itemsize)

return np.lib.stride_tricks.as_strided(a, shape=shape, strides=strides)

例如：

>>> rolling(A, 3)

array([[ 8, 2, 8],

[ 2, 8, 33],

[ 8, 33, 33],

[33, 33, 4]])

完成后你可以用殺死它np.max(rolling(A, n), axis=1)。

盡管它很優雅，但這個解決方案和第一個解決方案都不是高效的，因為我們在僅相差兩項的相鄰塊上重復應用最大值。

反對回復 2023-08-08

牛魔王的故事

TA貢獻1830條經驗獲得超3個贊

對于所有 n 的遞歸解決方案

import numpy as np

import sys

def recursive(a: np.ndarray, n: int, b=None, level=2):

if n <= 0 or n > len(a):

raise ValueError(f'len(a):{len(a)} n:{n}')

if n == 1:

return a

if len(a) == n:

return np.max(a)

b = np.maximum(a[:-1], a[1:]) if b is None else np.maximum(a[level - 1:], b)

if n == level:

return b

return recursive(a, n, b[:-1], level + 1)

test_data = np.array([8, 2, 33, 4, 3, 6])

for test_n in range(1, len(test_data) + 2):

try:

print(recursive(test_data, n=test_n))

except ValueError as e:

sys.stderr.write(str(e))

輸出

[ 8 2 33 4 3 6]

[ 8 33 33 4 6]

[33 33 33 6]

[33 33 33]

[33 33]

len(a):6 n:7

關于遞歸函數

你可以觀察下面的數據，然后你就會知道如何寫遞歸函數了。

"""

np.array([8, 2, 33, 4, 3, 6])

n=2: (8, 2), (2, 33), (33, 4), (4, 3), (3, 6) => [8, 33, 33, 4, 6] => B' = [8, 33, 33, 4]

n=3: (8, 2, 33), (2, 33, 4), (33, 4, 3), (4, 3, 6) => B' [33, 4, 3, 6] => np.maximum([8, 33, 33, 4], [33, 4, 3, 6]) => 33, 33, 33, 6

...

"""

反對回復 2023-08-08

6 回答
0 關注
249 瀏覽

關注

添加回答

舉報

0/150

提交

取消

亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

熱搜

最近搜索清空

numpy 在另一個數組中創建最大連續對的數組

numpy 在另一個數組中創建最大連續對的數組

6 回答

添加回答