该项目从 https://github.com/hpcaitech/ColossalAI.git 镜像。
拉取镜像更新于 。
- 6月 20, 2024
-
-
由 Hongxin Liu 创作于
-
- 6月 19, 2024
-
-
由 Yuanheng Zhao 创作于
* fix glide llama model * revise
-
由 Guangyao Zhang 创作于
-
由 Guangyao Zhang 创作于
-
- 6月 18, 2024
-
-
由 Kai Lv 创作于
-
由 Guangyao Zhang 创作于
[shardformer] Support the Command-R model
-
由 Edenzzzz 创作于
* add to sidebar * fix chinese
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 pre-commit-ci[bot] 创作于
for more information, see https://pre-commit.ci
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
- 6月 17, 2024
-
-
由 Edenzzzz 创作于
* support tp + sp + pp * remove comments --------- Co-authored-by:
Edenzzzz <wtan45@wisc.edu>
-
由 GuangyaoZhang 创作于
-
- 6月 14, 2024
-
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 pre-commit-ci[bot] 创作于
for more information, see https://pre-commit.ci
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 GuangyaoZhang 创作于
-
由 flybird11111 创作于
* [shardformer]upgrade transformers for gpt2/gptj/whisper (#5807) * [shardformer] fix modeling of gpt2 and gptj * [shardformer] fix whisper modeling * [misc] update requirements --------- Co-authored-by:
ver217 <lhx0217@gmail.com> * [shardformer]upgrade transformers for mistral (#5808) * upgrade transformers for mistral * fix * fix * [shardformer]upgrade transformers for llama (#5809) * update transformers fix * fix * fix * [inference] upgrade transformers (#5810) * update transformers fix * fix * fix * fix * fix * [gemini] update transformers for gemini (#5814) --------- Co-authored-by:
ver217 <lhx0217@gmail.com>
-
- 6月 13, 2024
-
-
由 botbw 创作于
* [gemini] quick fix on possible async operation * [gemini] quick fix on possible async operation
-
- 6月 12, 2024
-
-
由 Haze188 创作于
* use async stream to prefetch and h2d data moving * Remove redundant code
-
由 Li Xingjian 创作于
* Fix torch int32 dtype Signed-off-by:
char-1ee <xingjianli59@gmail.com> * Fix flash-attn import Signed-off-by:
char-1ee <xingjianli59@gmail.com> * Add generalized model test Signed-off-by:
char-1ee <xingjianli59@gmail.com> * Remove exposed path to model Signed-off-by:
char-1ee <xingjianli59@gmail.com> * Add default value for use_flash_attn Signed-off-by:
char-1ee <xingjianli59@gmail.com> * Rename model test Signed-off-by:
char-1ee <xingjianli59@gmail.com> --------- Signed-off-by:
char-1ee <xingjianli59@gmail.com>
-
由 Guangyao Zhang 创作于
-
- 6月 11, 2024
-
-
由 Hongxin Liu 创作于
-
由 Hongxin Liu 创作于
-
由 YeAnbang 创作于
[ColossalChat] Colossalchat upgrade
-
由 Runyu Lu 创作于
* refactor baichuan * remove unused code and add TODO for lazyinit
-
由 YeAnbang 创作于
-
- 6月 10, 2024
-
-
由 Li Xingjian 创作于
[Inference] Refactor modeling attention layer by abstracting attention backends
-
由 char-1ee 创作于
Signed-off-by:
char-1ee <xingjianli59@gmail.com>
-
由 YeAnbang 创作于
-