AKG(Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks. It provides the ability to automatically fuse ops with specific patterns. AKG works with MindSpore-GraphKernel to improve the performance of networks running on different hardware backends.
AKG composes with three basic optimization module, normalization, auto schedule and backend optimization.
normalization. In order to solve the limitation in expression ability of polyhedral(which can only process static linear programs), the computation IR needs to be normalized first. The mainly optimization of normalization module includes auto-inline, loop fusing, common subexpression elimination and so on.
auto schedule. Base on polyhedral technology, the auto schedule module mainly have auto-vectorization, auto-tiling, thread/block mapping, dependency analysis and memory promotion.
backend optimization. The backend optimization module mainly consists of TensorCore acceleration, double buffer optimization, storage flatten optimization and inject sync optimization.
At present, Ascend910
, NVIDIA V100/A100
and CPU
are supported. More Backends are on the list.
See MindSpore README.md for details.
We suggest you build and run akg together with MindSpore. And we also provide a way to run case in standalone mode for convenience sake. Refer to MindSpore Installation for more information about compilation dependencies.
Build on Ascend910
git-lfs needs to be installed before cloning the source codes.
git clone https://gitee.com/mindspore/akg.git
cd akg
bash build.sh -e ascend -j8
Build on GPU
git clone https://gitee.com/mindspore/akg.git
cd akg
bash build.sh -e gpu -j8
Build on CPU
git clone https://gitee.com/mindspore/akg.git
cd akg
bash build.sh -e cpu -j8
Ascend910
cd tests
source ./test_env.sh
NVIDIA V100/A100
cd tests
source ./test_env.sh gpu
CPU
cd tests
source ./test_env.sh cpu
cd tests/st
python run.py -e gpu -o add -l level0 # run add operator on GPU
Detailed instructions see:python run.py -h
Use specific case:
cd tests/st/ops/
pytest -s test_abs.py -m "level0 and platform_x86_ascend_training" # run level0 testcases on Ascend
cd tests/st/ops/
pytest -s test_abs.py -m "level0 and platform_x86_gpu_training" # run level0 testcases on GPU
cd tests/st/ops/
pytest -s test_abs.py -m "level0 and platform_x86_cpu" # run level0 testcases on CPU
See Wiki.
Welcome contributions. See MindSpore Contributor Wiki for more details.
The release notes, see our RELEASE.
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
Activity
Community
Health
Trend
Influence
:Code submit frequency
:React/respond to issue & PR etc.
:Well-balanced team members and collaboration
:Recent popularity of project
:Star counts, download counts etc.