# python_voice

**Repository Path**: mayjean/python_voice

## Basic Information

- **Project Name**: python_voice
- **Description**: python 语音处理
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2024-11-20
- **Last Updated**: 2024-11-21

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README


Python+whisper/vosk实现语音识别

Whisper 是一个由 OpenAI 开发的人工智能语音识别模型，它能够将语音转换为文本。Whisper 模型特别之处在于它的设计目标是能够理解和转录多种语言的语音，包括但不限于英语。Whisper 模型在训练时使用了大量不同语言的语音数据，这使得它具有很好的跨语言能力。
```
pip install openai-whisper

```
Vosk 是一个开源的语音识别库，它可以在离线环境下工作，不依赖于任何外部服务。 Vosk 基于Kaldi语音识别框架，支持多种语言，包括中文。 Vosk提供了多种预训练模型，可以根据需求选择适合的模型进行语音识别。Vosk的优势在于可以在没有网络连接的情况下进行语音识别，并且支持多种操作系统和编程语言，用户可以根据自己的需求训练和优化模型。

https://alphacephei.com/vosk/models

以下下载的是大型通用中文模型vosk-model-cn-0.22.zip

```
pip install vosk
 
# 下载模型并解压
wget https://alphacephei.com/vosk/models/vosk-model-cn-0.22.zip
unzip vosk-model-cn-0.22.zip
 
```