# SpeechAlgorithms **Repository Path**: sevenhub_admin/speech_algorithms2 ## Basic Information - **Project Name**: SpeechAlgorithms - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2021-12-21 - **Last Updated**: 2024-06-21 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Speech Algorithms [There is an English READEME](https://github.com/Ryuk17/SpeechAlgorithms/blob/master/README_EN.md) [下载单独文件夹](https://minhaskamal.github.io/DownGit/#/home) # 目录 ## 信号处理语音算法 | 标题 | 原文 | 代码 | | -------- | -----: | :----: | | 语音降噪初探——谱减法 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484134&idx=1&sn=0b5adda3ade249f7d37f0146a92293a9&chksm=9f226971a855e0676d985a8f3b72e0fb3e243b1e92102ba6116dd82e43b1bc63196e802ab851&scene=21#wechat_redirect) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpectralSubtraction) | | 基于Mask的语音分离 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484164&idx=1&sn=f0f59a10fa04f02228bbba381348e66c&chksm=9f226893a855e185aab5b0abcf6c8c11802fe0b8d97b22c89222d533cf32c6498fa8587a4e77&scene=21#wechat_redirect) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpeechSperation) | | 生成有噪声/回声/混响/啸叫的混合语音样本 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484214&idx=1&sn=21db3addb4e1163b4c5d26156ee97aa4&chksm=9f2268a1a855e1b7d2dc21c6ab5697231ba21a1014f94a7919349cae10db70dd887015cb31ce&token=324471119&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpeechAugmentation) | | 解析自适应滤波回声消除 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484312&idx=1&sn=afa00c2fb91f72bdfd73fc99f0efede0&chksm=9f22680fa855e119cdc2fc2c6a4ddb646bf2adff27010a6c0c16383b687c0104fc3d9561c0ef&token=1373088786&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/AcousticEchoCancellation) | | 使用AMR编解码器生成VAD的标签 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&tempkey=MTA3OV9DczEweFQ0NlB3U1lCNm0zZlpvbS15dTNqNGhpUHkwT2FydHlwNjBuRjdrd2JMS0VUaUFETms0c3FLUmE5WUc4Z1pySWlTdWh2Q3VIblQ0TDh2T051LTlSN3hVdV8tTjFNSmNEQkxDd0lBbE1SZGl5Yzhkbm9Najc3d2doajV3WE10eXUzYVZMdUNaTFlBaUFJLVlldF9xNjVwRWhlOUpQaVJmcWhnfn4%3D&chksm=1f226f9f2855e6894e033f39502a5ceb9d6e781a2b396dcbdc49e9cacb4898da9d5408a8d710#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/VoiceActivityDetection/VADCoder) | | 使用TDOA进行声源定位 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484417&idx=1&sn=a416da2d9238cd863697d91dd26233e4&chksm=9f226f96a855e6808ac3d90e83f8c673d8daddc57b95a537c0a2ba547ce53307452b0940c19a&token=139302241&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SoundSourceLocalization) | | 以任意频率重采样语音信号 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484499&idx=1&sn=e48de61a2497511626e5ae9312f20e57&chksm=9f226fc4a855e6d2db0590f08459af79e88420af09ec7ce97f75946c0437978e9262a7f36809&token=1215134525&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/Resample) | | 音频数字水印的嵌入和提取 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484522&idx=1&sn=df457333ad5e1af32708c171ae7f5e1b&chksm=9f226ffda855e6eb57579732b002e36e1479e29be73980f3ee8b05d26157a3a6031791e8e0e4&token=909330882&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/Watermarking) | | 语音变速和变调 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484554&idx=1&sn=c7a56dbf1c06654b02f0d51bffbd1d10&chksm=9f226f1da855e60bfdb2584d46a34f10a53028416817415d28d19bead72626d10e6757f80854&token=147554636&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/VoiceChange) | | 生成下雨的声音 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484656&idx=1&sn=d25135333a1cae0356360443544b4ccd&chksm=9f226f67a855e671994c2e946773eea2bac54c1f03146e3d7617d90c62cae2861ad1447ffdc7&token=1233423028&lang=zh_CN#rdd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/DesignSound) | | 分帧,加窗和DFT | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484741&idx=1&sn=1e3ebd6d9a0da6879433bf795677006e&chksm=9f226ed2a855e7c430c53d22b8bd781fde59d6e4760376fcb94708bd8295199a1100971d754a&token=570335002&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/EnframeWindowFFT) | | WebRTC VAD流程解析 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484833&idx=1&sn=8584096ebda5474d7a8907f657100bc0&chksm=9f226e36a855e720da11974d1a1da4ca2cb4b588670279d8c6cca486f9deffc82e930dbd5cdf&token=1458190114&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/WebRTC_VAD) | | 基于卡尔曼滤波器的回声消除算法 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484999&idx=1&sn=4bad80ad016cae43b0adcead513e28f6&chksm=9f226dd0a855e4c6fd0af54380225f1269e9760043d9c4ff15880d623c25f223ccc3e864db35&token=216336716&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/AcousticEchoCancellation) | | WebRTC ANR流程解析 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247485024&idx=1&sn=4d8f700913b0e1bf282844cc5c4bc37c&chksm=9f226df7a855e4e11b1ece471b9d188e542993efd9504a5b796457d87c84013dca68401af0e0&token=1865469620&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/WebRTC_ANR) | | WebRTC AGC流程解析 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247485024&idx=1&sn=4d8f700913b0e1bf282844cc5c4bc37c&chksm=9f226df7a855e4e11b1ece471b9d188e542993efd9504a5b796457d87c84013dca68401af0e0&token=1865469620&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/WebRTC_AGC) | ## 机器/深度学习语音算法 | 标题 | 原文 | 代码 | | -------- | -----: | :----: | | DNN单通道语音增强 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484173&idx=1&sn=96ac7133e20dc95c3f2e7f16f74dcfb1&chksm=9f22689aa855e18c8417889ed0da02f143743d4ff805d877bc8dbfc4fde8a391fa6a127cc177&token=324471119&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpeechEnhancement) | | 使用LSTM进行端点检测 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484255&idx=1&sn=676d243fb7aea63b912e1a9833169578&chksm=9f2268c8a855e1deb3c4bb4db0990c625487c3e776c6f9baf293235be3f700ed267af61dfac1&token=221372596&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/VoiceActivityDetection) | | 使用CNN进行简单的指令识别 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484242&idx=1&sn=527db511d57cf4ff4c1f909423034603&chksm=9f2268c5a855e1d3497b657ac04533eca0070d680b1b0bb8971fd998cb4480e6e17692283850&token=1676318016&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/CommandRecognition) | |说话人性别识别 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484304&idx=1&sn=d7820bc93bd9dabe73079a5f56df9807&chksm=9f226807a855e11162ed0856b12723ba0b967ccb901b7f8a27219e3b475bbecd1327dd72874d&token=2016526173&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/GenderClassify) | | 使用XGBoost进行环境声音分类 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484371&idx=1&sn=2b4cb91b1044d46a0da41e9421bbcfce&chksm=9f226844a855e152eef92d9278e8c81e6137decaf11b8c39f00d58b0c7cecebdd996e9433420&token=45703017&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/EnvironmentSoundClassification) | ## 语音评价标准 | 标题 | 原文 | 代码 | | -------- | -----: | :----: | | 语音客观评价标准——语音质量评价 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484460&idx=1&sn=ee26a6c9fc19857b416eef7264fba244&chksm=9f226fbba855e6ad0997e376079772e05a468084710d7955506e527300ea8e5aaef97f6c3e30&token=1215134525&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpeechQualityMeasures) | | 语音可懂度评估(一)——基于清晰度指数的方法 | [Link](https://mp.weixin.qq.com/s?__biz=MzA3MjEyMjEwNA==&mid=2247484699&idx=1&sn=13b21c5556fb3815816e21c95e7c6f59&chksm=9f226e8ca855e79aa230335fc4000f4fab3c17ffb31d671d98fd5f1602441e272ce29bdb4c52&token=1700055186&lang=zh_CN#rd) | [Code](https://github.com/Ryuk17/SpeechAlgorithms/tree/master/SpeechIntelligibilityMetrics) |