陣列分配的CPU和GPU功能在NUMBA

2017-10-21 62 views -1 likes

-1

我想寫一些函數在numba，我可以交換使用不同的目標（cpu，cuda，並行）。我遇到的萬阿英，蔣達清是一個新的數組的分配是CUDA設備代碼，例如，不同：陣列分配的CPU和GPU功能在NUMBA

cuda.local.array(shape, dtype)

對比做了CPU的功能類似，即

np.empty(shape, dtype)

是否有聰明的方式如何處理這個，而不必編寫單獨的功能？

來源

2017-10-21 Philipp Eller

難道你不能在你的函數中測試類型嗎？ – 0TTT0

問題是我不能在我的函數中有任何聲明不起作用，因爲numba編譯代碼並且嚇壞了。否則，我會做一個簡單的if/else或這樣的。處理這種情況的自然方式是C中的預處理器指令，但是沒有這樣的東西可用於python –

回答

我發現問題的一個骯髒的解決方法。這是我能做到的唯一方法。使用@myjit裝飾器代替@jit和@cuda.jit，並將所有陣列分配爲cuda.local.array。

def myjit(f): 
''' 
f : function 
Decorator to assign the right jit for different targets 
In case of non-cuda targets, all instances of `cuda.local.array` 
are replaced by `np.empty`. This is a dirty fix, hopefully in the 
near future numba will support numpy array allocation and this will 
not be necessary anymore 
''' 
if target == 'cuda': 
    return cuda.jit(f, device=True) 
else: 
    source = inspect.getsource(f).splitlines() 
    assert '@myjit' in source[0] 
    source = '\n'.join(source[1:]) + '\n' 
    source = source.replace('cuda.local.array', 'np.empty') 
    exec(source) 
    fun = eval(f.__name__) 
    newfun = jit(fun, nopython=True) 
    # needs to be exported to globals 
    globals()[f.__name__] = newfun 
    return newfun

來源

2017-10-31 16:08:42

相關問題

1. CUDA：在GPU上分配2D陣列
2. Tensorflow：在GPU和CPU
3. ç賽格故障分配通過功能的陣列時和打印陣列
4. 功能輸入的Numba jitclass
5. CPU和GPU與Theano
6. iPhone/iPod touch CPU/GPU性能
7. Numba不加速功能
8. Tensorflow在使用tf.device（'/ cpu：0'）時分配GPU內存
9. numba guvectorize target ='parallel'slow than target ='cpu'
10. 陣列的功能
11. 常見算法的GPU與CPU性能
12. cpu vs gpu - CPU好時
13. 分配功能
14. 符合OpenCL標準的CPU/GPU列表
15. 陣列時功能
16. 的JavaScript - 分配 - 陣列陣列
17. GPU內存分配
18. 在C++分配陣列
19. CPU和GPU相同的代碼庫
20. Keras中CPU和GPU的混合使用
21. Intel HD GPU vs Intel CPU性能比較
22. TensorFlow：圖形優化（GPU vs CPU性能）
23. 函數獲取陣列和功能的元件陣列
24. 不能分配到陣列-vba
25. 列表陣列功能
26. MPI-2 CPU VS GPU
27. GPU + CPU Tensorflow訓練
28. C++ Amp GPU數據在刪除指針陣列GPU（動態分配）的數據後不會自由
29. CPU核心線程分類功能
30. Tensorflow GPU /多GPU如何分配內存？