# VideoAgent
**Repository Path**: zx4321/VideoAgent
## Basic Information
- **Project Name**: VideoAgent
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2025-12-09
- **Last Updated**: 2025-12-09
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
| Feature Type |
Video Demo |
Required Models |
| Cross Talk |
English Stand-up Comedy to Chinese Crosstalk |
CosyVoice, Whisper, ImageBind |
| Talk Show |
Chinese Crosstalk to English Stand-up Comedy |
CosyVoice, Whisper, ImageBind |
| MAD TTS |
Xiao-Ming-Jian-Mo(ๅฐๆๅ้ญ) Meme |
fish-speech |
| MAD SVC |
AI Music Videos |
DiffSinger, seed-vc, Whisper, ImageBind |
| Rhythm |
Spider-Man: Across the Spider-Verse |
Whisper, ImageBind |
| Comm |
Commentary Video |
CosyVoice, Whisper, ImageBind |
| News |
Tech News: OpenAI's GPT-4o Image Generation Release |
CosyVoice, Whisper, ImageBind |
| Video QA/Summarization |
Dune 2 Movie Cast Update Podcast |
Whisper |
### ๐ค **LLM Configuration**
```bash
# VideoAgent\environment\config\config.yml
# Applicable scenarios and LLM configuration
# Claude is required as it powers the Agentic Graph Router
llm:
# Video Remixing/TTS/SVC/Stand-up/CrossTalk
deepseek_api_key: ""
deepseek_base_url: ""
# Agentic Graph Router/TTS/SVC/Stand-up/CrossTalk
claude_api_key: ""
claude_base_url: ""
# Video Editing/Overview/Summarization/QA/Commentary Video
gpt_api_key: ""
gpt_base_url: ""
# MLLM for caption and fine-grained video understanding
gemini_api_key: ""
gemini_base_url: ""
```
### ๐ฏ **Usage**
```bash
# With the configuration now complete, proceed to run the following instructions:
python main.py
# The console will output:
User Requirement: ...
# Requirement Example:
# 1. I need to create a reworded version of an existing video where the speech content is modified while maintaining the original speaker's voice. The video should have the same visuals as the original, but with updated dialogue that follows my specific requirements.
# 2. I have a standup comedy script that I'd like to turn into a professional-looking video. I need the script to be performed with good comedic timing and audience reactions, then matched with relevant video footage to create a complete standup comedy special. I already have a reference script and some footage I want to use for the video.
```
The current LLM selections are optimized for each function.
You can also adjust the model names in `VideoAgent\environment\config\llm.py` if needed.
---
## ๐ฎDemos