-
由 Sebastian Ohlmann 创作于
This is recommended by the CUDA best practices guide, see https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#thread-and-block-heuristics Make sure that the limit of threads per block for the corresponding kernel is respected (though it is unlikely to be smaller than 256).
0eb428c9