# rocm_start_sample **Repository Path**: anjiang2020_admin/rocm_start_sample ## Basic Information - **Project Name**: rocm_start_sample - **Description**: hip rocm start sample for amd gpu - **Primary Language**: C++ - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2023-06-18 - **Last Updated**: 2023-06-18 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README Rocm Start Sample ==== rocm-hip on amdgpu入门示例 --- ## 1. 使用: run **"python build.py [*hip/asm*] [*v2/v3*] [*llvm/hcc*]"** under each project folder, the exacutable will generated under ./out path. ---- ## 2. 示例: ### 2.1 runtime 示例: >>[1. 设备查询](https://github.com/feifei14119/rocm_start_sample/tree/master/DeviceInfo) >>[2. 带宽测试](https://github.com/feifei14119/rocm_start_sample/tree/master/MemBandwidth) >>[3. 向量加模板](https://github.com/feifei14119/rocm_start_sample/tree/master/VectorAdd) >>[4. 矩阵转置](https://github.com/feifei14119/rocm_start_sample/tree/master/Transpose) >>[5. FFT](https://github.com/feifei14119/rocm_start_sample/tree/master/fft) ### 2.2 hip kernel function 示例: >>[1. 计时函数](https://github.com/feifei14119/rocm_start_sample/tree/master/hipClock) >>[2. 内联汇编](https://github.com/feifei14119/rocm_start_sample/tree/master/hipInlineAsm) >>[3. 原子操作](https://github.com/feifei14119/rocm_start_sample/tree/master/hipAtomic) >>[4. shuffle操作](https://github.com/feifei14119/rocm_start_sample/tree/master/hipShuffle) >>[5. vote操作](https://github.com/feifei14119/rocm_start_sample/tree/master/hipVote) ### 2.3 ISA 示例: >>[1. smem读写](https://github.com/feifei14119/rocm_start_sample/tree/master/isaSmemWr) >>[2. flat读写](https://github.com/feifei14119/rocm_start_sample/tree/master/isaFlatWr) >>[3. mubuf读写](https://github.com/feifei14119/rocm_start_sample/tree/master/isaMubufWr) >>[4. lds读写](https://github.com/feifei14119/rocm_start_sample/tree/master/isaLdsWr) >>[5. group间条件跳转](https://github.com/feifei14119/rocm_start_sample/tree/master/isaSbranch) >>[6. thread间条件执行](https://github.com/feifei14119/rocm_start_sample/tree/master/isaVbranch) >>[7. packed float16指令](https://github.com/feifei14119/rocm_start_sample/tree/master/isaPackedFp16) >>[8. dpp指令](https://github.com/feifei14119/rocm_start_sample/tree/master/isaDpp) >>[9. permute指令](https://github.com/feifei14119/rocm_start_sample/tree/master/isaPermute) >>[10. mfma指令](https://github.com/feifei14119/rocm_start_sample/tree/master/isaMfma) ### 2.4 性能优化示例: >>[1. 指令发射](https://github.com/feifei14119/rocm_start_sample/tree/master/instrIssue) >>[2. vmem指令阻塞](https://github.com/feifei14119/rocm_start_sample/tree/master/vmemSQIssue) >>[3. vmem指令L1冲突](https://github.com/feifei14119/rocm_start_sample/tree/master/vmemL1Bank)