diff --git a/docs/en/gcc/gcc_basic_performance_optimization_user_guide.md b/docs/en/gcc/gcc_basic_performance_optimization_user_guide.md index e65f3458ee487eb261d7f254db1aeac3b8cb4946..2d840399a920cabcc1ae32c1a105a5098b2add95 100644 --- a/docs/en/gcc/gcc_basic_performance_optimization_user_guide.md +++ b/docs/en/gcc/gcc_basic_performance_optimization_user_guide.md @@ -183,3 +183,15 @@ Enhances data flow analysis for loops with consecutive memory reads by inserting Add the options `-O3 -ftree-slp-transpose-vectorize` to enable the optimization. Note: The `-ftree-slp-transpose-vectorize` option requires `-O3` to be enabled. + +### -floop-sve-mode-opt + +#### Description + +Through static code feature analysis, identify special scenarios. When the conditions are confirmed to be met, add new SVE instruction set optimization opportunities to achieve performance improvement. + +#### Usage + +Add the options `-O3 -floop-sve-mode-opt` to enable the optimization. + +Note: The `-floop-sve-mode-opt` option requires `-O3` and sve in `-march` to be enabled. diff --git a/docs/zh/gcc/gcc_basic_performance_optimization_user_guide.md b/docs/zh/gcc/gcc_basic_performance_optimization_user_guide.md index 63e9ccb36baf4d9e5ccd2a85760f4422603ca885..e4ad3350ff876138b65f5f3ae437686b44748953 100644 --- a/docs/zh/gcc/gcc_basic_performance_optimization_user_guide.md +++ b/docs/zh/gcc/gcc_basic_performance_optimization_user_guide.md @@ -277,3 +277,15 @@ Arm 相关指令合并优化,实现32位复杂组合的64位整形乘法逻辑 #### 使用方法 在选项中加入 `-ffind-with-sve` 即可。 + +### -floop-sve-mode-opt + +#### 说明 + +通过静态代码特征分析,识别特殊场景。并在确认满足条件时新增SVE指令集优化机会,从而获得性能提升。 + +#### 使用方法 + +在选项中加入`-O3 -floop-sve-mode-opt`即可。 + +注:`-floop-sve-mode-opt`选项,需要在`-O3`开启以及`-march`中加入sve的基础上才使能。