该项目从 https://github.com/microsoft/DeepSpeed 镜像。
Pull mirroring failed .
由于尝试失败次数过多,仓库镜像已暂停,可以由项目维护者或所有者恢复。
上次成功更新 。
由于尝试失败次数过多,仓库镜像已暂停,可以由项目维护者或所有者恢复。
上次成功更新 。
- 9月 16, 2021
-
-
由 Stas Bekman 创作于
Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 创作于
* [zero Init] fix regression * clean up the warning
-
- 9月 15, 2021
-
-
由 Sean Naren 创作于
-
- 9月 14, 2021
-
-
由 Jeff Rasley 创作于
-
- 9月 13, 2021
-
-
由 Anurag Kumar 创作于
updated classifiers
-
- 9月 10, 2021
-
-
由 eltonzheng 创作于
-
由 Ammar Ahmad Awan 创作于
Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 创作于
* pass GAS boundary state from PP -> ZeRO * formatting Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
由 Hyunwoong Ko 创作于
-
- 9月 09, 2021
-
-
由 Aswin John Mathews 创作于
* Added 4-byte alignment on NCCL/RCCL * pre-commit formatting fixes * Fix for checkpoint loading with optimizer partitioning * Better assert print * Added unit tests for nccl/rccl 4-byte alignment * bug * Updated alignment to implicit Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 创作于
-
- 9月 08, 2021
-
-
由 Jeff Rasley 创作于
-
- 9月 02, 2021
-
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
- 9月 01, 2021
-
-
由 Jeff Rasley 创作于
-
由 Olatunji Ruwase 创作于
-
由 Olatunji Ruwase 创作于
-
由 Hari Prasad 创作于
* Added drop_last to DeepSpeedDataLoader This solves issue #326 * Updated drop_last in engine.py added drop_last as a ds_config as mentioned by @tjruwase * Update engine.py * Update engine.py * updated config.py and constants.py * Update constants.py * added dataloader_ prefix * Update dataloader.py * corrected yapf test errors * Update test_data.py Added dataloader_drop_last unit test * Corrected yapf and formatting issues * updated simple_model.py and test_data.py * Update simple_model.py * pre-commit fix * corrected issues * Update test_data.py * Update test_data.py * Update test_data.py * Update test_data.py * removed batch_size from test_data.py * Update simple_model.py * Update test_data.py * Update test_data.py * Fix unit test issues * Use fp32 to make things work Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 8月 31, 2021
-
-
由 Ammar Ahmad Awan 创作于
* Remove the wrong function with duplicate name * fix format. * add mpu check. fix tests.
-
- 8月 30, 2021
-
-
由 Stas Bekman 创作于
Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 8月 28, 2021
-
-
由 Olatunji Ruwase 创作于
-
- 8月 27, 2021
-
-
由 Olatunji Ruwase 创作于
* Rename PA_TO_cpu * Code cleanup * Revert accidental change
-
由 Reza Yazdani 创作于
Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com>
-
由 Reza Yazdani 创作于
* add more synchronizations and barriers for resolving gpu-halt issue * removing unuseful broadcasts
-
- 8月 26, 2021
-
-
由 Jeff Rasley 创作于
-
由 Reza Yazdani 创作于
Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 8月 25, 2021
-
-
由 Olatunji Ruwase 创作于
* Callable option for optimizer and scheduler * Add unit test * Formatting * Disable debug prints * Use base optimizer to construct lr scheduler * Formatting * Remove dead import
-
由 Jeff Rasley 创作于
* restore fp16 params if no zero ckpts available * formatting
-
- 8月 19, 2021
-
-
由 Jeff Rasley 创作于
-
- 8月 18, 2021
-
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
由 Pruthvi Madugundu 创作于
-
- 8月 17, 2021
-
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
由 Jeff Rasley 创作于
-
由 Ammar Ahmad Awan 创作于
Co-authored-by:
Alex Muzio <Alex.Muzio@microsoft.com> Co-authored-by:
Ammar Ahmad Awan <ammar.awan@microsoft.com> Co-authored-by:
Conglong Li <conglong.li@gmail.com> Co-authored-by:
Felipe Cruz Salinas <Andres.Cruz@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <shaden.smith@microsoft.com> Co-authored-by:
Young Jin Kim <youki@microsoft.com> Co-authored-by:
bapatra <bapatra@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <shaden.smith@microsoft.com> Co-authored-by:
Young Jin Kim <youki@microsoft.com>
-
- 8月 16, 2021
-
-
由 Conglong Li 创作于
Co-authored-by:
Conglong Li <conglong.li@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-