"RAG-Anything: All-in-One RAG Framework"
On-device TTS model by Neuphonic
ContextGem: Effortless LLM extraction from documents
This project is a real-time, multilingual voice translator that leverages the power of local AI models for speech-to-text, translation, and text-to-speech. It is designed to be a powerful and flexible tool for anyone who needs to communicate across language barriers.
Load Aspect Models in Python
This project is a real-time, multilingual voice translator that leverages the power of local AI models for speech-to-text, translation, and text-to...
最近更新: 2天前A voice-enabled AI assistant that converts MCP (Model Context Protocol) servers into OpenAI API tool format and provides real-time voice interactio...
最近更新: 2天前Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
最近更新: 2天前Kotai is a fully local, zero-cost voice assistant that combines the power of Kyutai TTS/STT, LiveKit, and local LLMs to create natural conversation...
最近更新: 2天前A FastAPI-based Speech-to-Text service that provides OpenAI Whisper API compatibility using Kyutai's powerful STT models. This allows you to use an...
最近更新: 2天前A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
最近更新: 2天前Python library for working with the QUDT (Quantity, Unit, Dimension and Type) ontology.
最近更新: 2天前