# audioFlux **Repository Path**: mirrors/audioFlux ## Basic Information - **Project Name**: audioFlux - **Description**: audioFlux 是一个深度学习相关的工具库，用于音频和音乐分析、特征提取的库，支持数十种时频分析变换方法，以及相应时域、频域数百种特征组合，可以提供给深度学习网络进行训练，用于 - **Primary Language**: C/C++ - **License**: MIT - **Default Branch**: master - **Homepage**: https://www.oschina.net/p/audioFlux - **GVP Project**: No ## Statistics - **Stars**: 13 - **Forks**: 1 - **Created**: 2023-02-21 - **Last Updated**: 2025-09-27 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README

# audioFlux ![GitHub Workflow Status (with branch)](https://img.shields.io/github/actions/workflow/status/libAudioFlux/audioFlux/build.yml?branch=master) ![example branch parameter](https://github.com/libAudioFlux/audioFlux/actions/workflows/build.yml/badge.svg?branch=master) ![language](https://img.shields.io/badge/language-C%20%7C%20Python%20-blue.svg) [![PyPI - Version](https://img.shields.io/pypi/v/audioflux)](https://pypi.org/project/audioflux/) [![PyPI - Python Version](https://img.shields.io/badge/python-%3E%3D3.6-brightgreen)](https://pypi.org/project/audioflux/) [![Docs](https://img.shields.io/badge/Docs-passing-brightgreen)](https://audioflux.top/index.html) ![GitHub](https://img.shields.io/github/license/libAudioFlux/audioFlux) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.7548288.svg)](https://doi.org/10.5281/zenodo.7548288) **`audioflux`** is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc. ##### New Features * v0.1.8 * Add a variety of Pitch algorithms: `YIN`, `CEP`, `PEF`, `NCF`, `HPS`, `LHS`, `STFT` and `FFP`. * Add `PitchShift` and `TimeStretch` algorithms. ### Table of Contents - [Overview](#overview) - [Installation](#installation) - [Python Package Install](#python-package-install) - [Other Build](#other-build) - [Quickstart](#quickstart) - [Benchmark](#benchmark) - [Documentation](#documentation) - [Contributing](#contributing) - [Citing](#citing) - [License](#license) ## Overview **`audioFlux`** is based on data stream design. It decouples each algorithm module in structure, and can quickly and efficiently extract features of multiple dimensions. The following is the main feature architecture diagram.

You can use multiple dimensional feature combinations, select different deep learning networks training, study various tasks in the audio field such as Classification, Separation, MIR etc.

The main functions of **`audioFlux`** include **transform**, **feature** and **mir** modules. #### 1. Transform In the time–frequency representation, main transform algorithm: - **`BFT`** - Based Fourier Transform, similar short-time Fourier transform. - **`NSGT`** - Non-Stationary Gabor Transform. - **`CWT`** - Continuous Wavelet Transform. - **`PWT`** - Pseudo Wavelet Transform. The above transform supports all the following frequency scale types: - Linear - Short-time Fourier transform spectrogram. - Linspace - Linspace-scale spectrogram. - Mel - Mel-scale spectrogram. - Bark - Bark-scale spectrogram. - Erb - Erb-scale spectrogram. - Octave - Octave-scale spectrogram. - Log - Logarithmic-scale spectrogram. The following transform are not supports multiple frequency scale types, only used as independent transform: - **`CQT`** - Constant-Q Transform. - **`VQT`** - Variable-Q Transform. - **`ST`** - S-Transform/Stockwell Transform. - **`FST`** - Fast S-Transform. - **`DWT`** - Discrete Wavelet Transform. - **`WPT`** - Wave Packet Transform. - **`SWT`** - Stationary Wavelet Transform. Detailed transform function, description, and use view the documentation. The *_synchrosqueezing_* or *_reassignment_* is a technique for sharpening a time-frequency representation, contains the following algorithms: - **`reassign`** - reassign transform for `STFT`. - **`synsq`** - reassign data use `CWT` data. - **`wsst`** - reassign transform for `CWT`. #### 2. Feature The feature module contains the following algorithms: - **`spectral`** - Spectrum feature, supports all spectrum types. - **`xxcc`** - Cepstrum coefficients, supports all spectrum types. - **`deconv`** - Deconvolution for spectrum, supports all spectrum types. - **`chroma`** - Chroma feature, only supports `CQT` spectrum, Linear/Octave spectrum based on `BFT`. #### 3. MIR The mir module contains the following algorithms: - **`pitch`** - YIN, STFT, etc algorithm. - **`onset`** - Spectrum flux, novelty, etc algorithm. - **`hpss`** - Median filtering, NMF algorithm. ## Installation ![language](https://img.shields.io/badge/platform-%20Linux%20%7C%20macOS%20%7C%20Windows%20%7C%20iOS%20%7C%20Android%20-lyellow.svg) The library is cross-platform and currently supports Linux, macOS, Windows, iOS and Android systems. ### Python Package Install To install the **audioFlux** package, Python >=3.6, using the released python package. Using PyPI: ``` $ pip install audioflux ``` Using Anaconda: ``` $ conda install -c tanky25 -c conda-forge audioflux ``` ### Other Build - [iOS build](docs/installing.md#ios-build) - [Android build](docs/installing.md#android-build) - [Building from source](docs/installing.md#building-from-source) ## Quickstart - [Mel & MFCC](docs/examples.md#mel--mfcc) - [CWT & Synchrosqueezing](docs/examples.md#cwt--synchrosqueezing) - [CQT & Chroma](docs/examples.md#cqt--chroma) - [Different Wavelet Type](docs/examples.md#different-wavelet-type) - [Spectral Features](docs/examples.md#spectral-features) - [Pitch Estimate](docs/examples.md#pitch-estimate) - [Onset Detection](docs/examples.md#onset-detection) - [Harmonic Percussive Source Separation](docs/examples.md#harmonic-percussive-source-separation) More example scripts are provided in the [Documentation](https://audioflux.top/) section. ## Benchmark server hardware: - CPU: AMD Ryzen Threadripper 3970X 32-Core Processor

More detailed performance benchmark are provided in the [Benchmark](https://github.com/libAudioFlux/audioFlux/tree/master/benchmark) module. ## Documentation Documentation of the package can be found online: [https://audioflux.top](https://audioflux.top/) ## Contributing We are more than happy to collaborate and receive your contributions to **`audioFlux`**. If you want to contribute, please fork the latest git repository and create a feature branch. Submitted requests should pass all continuous integration tests. You are also more than welcome to suggest any improvements, including proposals for need help, find a bug, have a feature request, ask a general question, new algorithms. Open an issue ## Citing If you want to cite **`audioFlux`** in a scholarly work, please use the following ways: - If you are using the library for your work, for the sake of reproducibility, please cite the version you used as indexed at Zenodo: [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.7548288.svg)](https://doi.org/10.5281/zenodo.7548288) ## License audioFlux project is available MIT License.