该项目从 https://gitlab.com/octopus-code/octopus.git 镜像。
Pull mirroring failed .
由于尝试失败次数过多,仓库镜像已暂停,可以由项目维护者或所有者恢复。
上次成功更新 。
由于尝试失败次数过多,仓库镜像已暂停,可以由项目维护者或所有者恢复。
上次成功更新 。
- 5月 24, 2024
-
-
由 Sebastian Ohlmann 创作于
Print out the kernel name and also some more detail about the dimensions that are problematic.
-
- 4月 16, 2024
-
-
由 Sebastian Ohlmann 创作于
-
由 Sebastian Ohlmann 创作于
-
由 Sebastian Ohlmann 创作于
This does not impact performance, but helps in debugging
-
- 4月 09, 2024
-
-
由 Sebastian Ohlmann 创作于
Co-authored-by:
Cristian Le <cristian.le@mpsd.mpg.de>
-
- 4月 04, 2024
-
-
由 Sebastian Ohlmann 创作于
-
由 Sebastian Ohlmann 创作于
-
由 Sebastian Ohlmann 创作于
This is recommended by the CUDA best practices guide, see https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#thread-and-block-heuristics Make sure that the limit of threads per block for the corresponding kernel is respected (though it is unlikely to be smaller than 256).
-
由 Nicolas Tancogne-Dejean 创作于
causing an error. This is now properly captured by the code.
-
- 2月 06, 2024
-
-
由 Henri Menke 创作于
-
- 12月 19, 2023
-
-
由 Alex Buccheri 创作于
Replace `kind_oct_m` with `intrinsic :: iso_fortran_env`. Access to the fortran intrinsics is also available through `global_oct_m`.
-
由 Alex Buccheri 创作于
Replace all instances of _i4, _i8, r4 and _r8. Note, this affects routine names as well as precision suffixes. Update OPTION variables to also use standard fortran precision, _int64
-
由 Alex Buccheri 创作于
Replace declarations using kinds aliases with fortran standard variables. Note, formatting has not been preserved, but it's only cosmetic.
-
- 12月 18, 2023
-
-
由 Alex Buccheri 创作于
-
- 5月 10, 2023
-
-
... messages_print_with_emphasis
-
- 3月 08, 2023
-
-
由 NicolasTD 创作于
-
- 3月 01, 2023
-
-
由 NicolasTD 创作于
The use a complex as two doubles required to have the state index running first in the GPU kernel.
-
- 2月 01, 2023
- 1月 31, 2023
- 1月 10, 2023
-
-
由 NicolasTD 创作于
-
- 1月 03, 2023
-
-
由 Nicolas Tancogne-Dejean 创作于
The routine was not working for integers, causing the code to stop is one tries to initialize all GPU buffers to zero.
-
- 12月 13, 2022
-
-
由 Martin Lueders 创作于
-
- 12月 12, 2022
-
-
由 Martin Lueders 创作于
-
- 12月 08, 2022
-
-
由 Martin Lueders 创作于
-
- 12月 07, 2022
-
-
由 Martin Lueders 创作于
-
由 Martin Lueders 创作于
-
- 11月 24, 2022
-
-
由 Martin Lueders 创作于
-
由 Martin Lueders 创作于
-
- 10月 11, 2022
-
-
由 Sebastian Ohlmann 创作于
Also show the host name. This allows easier identification of ranks and devices to hosts. Also get rid of the leading zeros in this output.
-
- 9月 28, 2022
-
-
由 Nicolas Tancogne-Dejean 创作于
Batchifying the calculation of the commutator of the position operator and the DFT+U(+V) term. This allows to compute the current fully batchified.
-
- 9月 15, 2022
-
-
由 Nicolas Tancogne-Dejean 创作于
-
由 Nicolas Tancogne-Dejean 创作于
-
- 8月 17, 2022
-
-
由 Sebastian Ohlmann 创作于
-
由 Sebastian Ohlmann 创作于
Avoid potential integer overflows when computing the product of pack_size along its dimensions or with other, potentially large numbers. Add some more low-level routines to allow calling with i4 and i8 integers.
-
由 Sebastian Ohlmann 创作于
For large batch sizes, the multiplication of the number of grid points with the number of states in a batch could overflow. Change the integers involved to 8-byte.
-