Skip to content
GitLab
探索
登录
主导航
搜索或转到…
项目
O
OpenBLAS
管理
动态
成员
标记
计划
议题
0
议题看板
里程碑
迭代
Wiki
代码
合并请求
0
仓库
分支
提交
标签
仓库图
比较修订版本
代码片段
锁定的文件
构建
流水线
作业
流水线计划
产物
部署
发布
软件包库
运维
环境
Terraform 模块
监控
事件
服务台
分析
价值流分析
Contributor analytics
CI/CD 分析
仓库分析
代码评审分析
议题分析
模型实验
帮助
帮助
支持
GitLab 文档
比较 GitLab 各版本
社区论坛
为极狐GitLab 提交贡献
提交反馈
快捷键
?
支持
扫描加入微信群:
1. 获取企业级DevOps解决方案支持
2. 免费或折扣极狐GitLab 官方培训认证
代码片段
群组
项目
HPCSource
OpenBLAS
提交
a6f533b2
提交
a6f533b2
编辑于
7年前
作者:
Martin Kroeker
提交者:
GitHub
7年前
浏览文件
操作
下载
补丁
差异文件
Revert "Fix calculated range limit exceeding actual data size for last thread"
上级
e70a6b92
无相关合并请求
变更
3
隐藏空白变更内容
行内
左右并排
显示
3 个更改的文件
driver/level2/gbmv_thread.c
+0
-1
0 个添加, 1 个删除
driver/level2/gbmv_thread.c
driver/level2/sbmv_thread.c
+0
-2
0 个添加, 2 个删除
driver/level2/sbmv_thread.c
driver/level2/tbmv_thread.c
+0
-3
0 个添加, 3 个删除
driver/level2/tbmv_thread.c
有
0 个添加
和
6 个删除
driver/level2/gbmv_thread.c
+
0
−
1
浏览文件 @
a6f533b2
...
...
@@ -233,7 +233,6 @@ int CNAME(BLASLONG m, BLASLONG n, BLASLONG ku, BLASLONG kl, FLOAT *alpha, FLOAT
#else
range_m
[
num_cpu
]
=
num_cpu
*
((
n
+
15
)
&
~
15
);
#endif
if
(
range_m
[
num_cpu
]
>
n
)
range_m
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
gbmv_kernel
;
...
...
This diff is collapsed.
Click to expand it.
driver/level2/sbmv_thread.c
+
0
−
2
浏览文件 @
a6f533b2
...
...
@@ -246,7 +246,6 @@ int CNAME(BLASLONG n, BLASLONG k, FLOAT *alpha, FLOAT *a, BLASLONG lda, FLOAT *x
range_m
[
MAX_CPU_NUMBER
-
num_cpu
-
1
]
=
range_m
[
MAX_CPU_NUMBER
-
num_cpu
]
-
width
;
range_n
[
num_cpu
]
=
num_cpu
*
(((
n
+
15
)
&
~
15
)
+
16
);
if
(
range_n
[
num_cpu
]
>
n
)
range_n
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
sbmv_kernel
;
...
...
@@ -286,7 +285,6 @@ int CNAME(BLASLONG n, BLASLONG k, FLOAT *alpha, FLOAT *a, BLASLONG lda, FLOAT *x
range_m
[
num_cpu
+
1
]
=
range_m
[
num_cpu
]
+
width
;
range_n
[
num_cpu
]
=
num_cpu
*
(((
n
+
15
)
&
~
15
)
+
16
);
if
(
range_n
[
num_cpu
]
>
n
)
range_n
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
sbmv_kernel
;
...
...
This diff is collapsed.
Click to expand it.
driver/level2/tbmv_thread.c
+
0
−
3
浏览文件 @
a6f533b2
...
...
@@ -288,7 +288,6 @@ int CNAME(BLASLONG n, BLASLONG k, FLOAT *a, BLASLONG lda, FLOAT *x, BLASLONG inc
range_m
[
MAX_CPU_NUMBER
-
num_cpu
-
1
]
=
range_m
[
MAX_CPU_NUMBER
-
num_cpu
]
-
width
;
range_n
[
num_cpu
]
=
num_cpu
*
(((
n
+
15
)
&
~
15
)
+
16
);
if
(
range_n
[
num_cpu
]
>
n
)
range_n
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
trmv_kernel
;
...
...
@@ -328,7 +327,6 @@ int CNAME(BLASLONG n, BLASLONG k, FLOAT *a, BLASLONG lda, FLOAT *x, BLASLONG inc
range_m
[
num_cpu
+
1
]
=
range_m
[
num_cpu
]
+
width
;
range_n
[
num_cpu
]
=
num_cpu
*
(((
n
+
15
)
&
~
15
)
+
16
);
if
(
range_n
[
num_cpu
]
>
n
)
range_n
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
trmv_kernel
;
...
...
@@ -358,7 +356,6 @@ int CNAME(BLASLONG n, BLASLONG k, FLOAT *a, BLASLONG lda, FLOAT *x, BLASLONG inc
range_m
[
num_cpu
+
1
]
=
range_m
[
num_cpu
]
+
width
;
range_n
[
num_cpu
]
=
num_cpu
*
(((
n
+
15
)
&
~
15
)
+
16
);
if
(
range_n
[
num_cpu
]
>
n
)
range_n
[
num_cpu
]
=
n
;
queue
[
num_cpu
].
mode
=
mode
;
queue
[
num_cpu
].
routine
=
trmv_kernel
;
...
...
This diff is collapsed.
Click to expand it.
预览
0%
请重试
或
添加新附件
.
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
保存评论
取消
想要评论请
注册
或
登录