# aichip-lucholesky

**Repository Path**: ssslab/aichip-lucholesky

## Basic Information

- **Project Name**: aichip-lucholesky
- **Description**: Implementing LU and Cholesky Factorizations on Artificial Intelligence Accelerators
- **Primary Language**: Unknown
- **License**: Apache-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 3
- **Forks**: 0
- **Created**: 2021-10-18
- **Last Updated**: 2022-12-05

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

## Implementing LU and Cholesky Factorizations on Artifcial Intelligence Accelerators

We in this  work implement LU and Cholesky factorizations on novel massively parallel artifcial intelligence (AI) accelerators originally  developed for deep neural network applications. We explore data parallelism of the matrix factorizations, and exploit neural  compute units and on-chip scratchpad memories of modern AI chips for accelerating them. The experimental results show  that our various optimization methods bring performance improvements and can provide up to 41.54 and 19.77 GFlop/s  performance using single precision data type and 78.37 and 33.85 GFlop/s performance using half precision data type for  LU and Cholesky factorizations on a Cambricon AI accelerator, respectively.

- See the [LU Guide](https://gitee.com/ssslab/aichip-lucholesky/blob/master/LU) for a detailed introduction of the starting of LU factrization.

- See the [Cholesky Guide](https://gitee.com/ssslab/aichip-lucholesky/blob/master/Cholesky) for a detailed introduction of the starting of Cholesky factrization.