From 6cb126b395f5d310e2b8655cf925b1dfb4dba1cd Mon Sep 17 00:00:00 2001
From: liuchuting <liuchuting1@huawei.com>
Date: Thu, 8 May 2025 17:02:54 +0800
Subject: [PATCH] Format the document

---
 mshub_res/assets/mindspore/2.5/animatediff.md | 12 +--
 .../assets/mindspore/2.5/autoencoders.md      |  6 +-
 mshub_res/assets/mindspore/2.5/cogview.md     |  6 +-
 mshub_res/assets/mindspore/2.5/dit.md         |  2 +-
 mshub_res/assets/mindspore/2.5/emu3.md        |  4 +-
 mshub_res/assets/mindspore/2.5/fit.md         |  2 +-
 mshub_res/assets/mindspore/2.5/hunyuan_dit.md | 10 +--
 .../assets/mindspore/2.5/hunyuanvideo-i2v.md  | 18 +++--
 .../assets/mindspore/2.5/hunyuanvideo.md      | 28 +++----
 mshub_res/assets/mindspore/2.5/hunyun3d_1.md  |  4 +-
 mshub_res/assets/mindspore/2.5/instantmesh.md |  4 +-
 mshub_res/assets/mindspore/2.5/janus.md       |  6 +-
 .../assets/mindspore/2.5/kohya_sd_scripts.md  |  8 +-
 mshub_res/assets/mindspore/2.5/mvdream.md     |  4 +-
 mshub_res/assets/mindspore/2.5/openlrm.md     |  2 +-
 .../assets/mindspore/2.5/opensora_hpcai.md    | 79 ++++++++++---------
 .../assets/mindspore/2.5/opensora_pku.md      | 38 ++++-----
 mshub_res/assets/mindspore/2.5/qwen2_vl.md    |  4 +-
 mshub_res/assets/mindspore/2.5/sharegpt_4v.md |  6 +-
 .../assets/mindspore/2.5/step_video_t2v.md    |  4 +-
 .../assets/mindspore/2.5/story_diffusion.md   | 10 +--
 mshub_res/assets/mindspore/2.5/var.md         |  4 +-
 mshub_res/assets/mindspore/2.5/venhancer.md   |  6 +-
 .../assets/mindspore/2.5/videocomposer.md     | 33 ++++----
 mshub_res/assets/mindspore/2.5/wan2_1.md      | 22 +++---
 25 files changed, 165 insertions(+), 157 deletions(-)

diff --git a/mshub_res/assets/mindspore/2.5/animatediff.md b/mshub_res/assets/mindspore/2.5/animatediff.md
index 1076ec4..44ef4a7 100644
--- a/mshub_res/assets/mindspore/2.5/animatediff.md
+++ b/mshub_res/assets/mindspore/2.5/animatediff.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/animatediff>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/animatediff>
 
 user-id: MindSpore
 
@@ -38,11 +38,11 @@ This repository is the MindSpore implementation of [AnimateDiff](https://arxiv.o
 
 ## Features
 
-- [x] Text-to-video generation with AnimdateDiff v2, supporting 16 frames @512x512 resolution on Ascend Atlas 800T A2 machines
-- [x] MotionLoRA inference
-- [x] Motion Module Training
-- [x] Motion LoRA Training
-- [x] AnimateDiff v3 Inference
+- ✔ Text-to-video generation with AnimdateDiff v2, supporting 16 frames @512x512 resolution on Ascend Atlas 800T A2 machines
+- ✔ MotionLoRA inference
+- ✔ Motion Module Training
+- ✔ Motion LoRA Training
+- ✔ AnimateDiff v3 Inference
 
 ## Requirements
 
diff --git a/mshub_res/assets/mindspore/2.5/autoencoders.md b/mshub_res/assets/mindspore/2.5/autoencoders.md
index 0f2554e..84eb2f4 100644
--- a/mshub_res/assets/mindspore/2.5/autoencoders.md
+++ b/mshub_res/assets/mindspore/2.5/autoencoders.md
@@ -20,7 +20,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/autoencoders>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/autoencoders>
 
 user-id: MindSpore
 
@@ -41,8 +41,8 @@ This repository contains SoTA image and video autoencoders and their training an
 ## Features
 
 - VAE (Image Variational AutoEncoder)
-    - [x] KL-reg with GAN loss (SD VAE)
-    - [x] VQ-reg with GAN loss (VQ-GAN)
+    - ✔ KL-reg with GAN loss (SD VAE)
+    - ✔ VQ-reg with GAN loss (VQ-GAN)
 
 ## Requirements
 
diff --git a/mshub_res/assets/mindspore/2.5/cogview.md b/mshub_res/assets/mindspore/2.5/cogview.md
index cd2371e..d25f4cc 100644
--- a/mshub_res/assets/mindspore/2.5/cogview.md
+++ b/mshub_res/assets/mindspore/2.5/cogview.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/cogview>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/cogview>
 
 user-id: MindSpore team
 
@@ -32,7 +32,7 @@ summary: CogView4 is used for text-to-image generation
 
 ---
 
-## News
+# CogView4 based on MindSpore
 
 - 🔥🔥 `2025/03/05`: We have reproduced the inference of the excellent work CogView4, which was open-sourced by THUDM, on MindSpore.
 
diff --git a/mshub_res/assets/mindspore/2.5/dit.md b/mshub_res/assets/mindspore/2.5/dit.md
index 4938b94..9303e15 100644
--- a/mshub_res/assets/mindspore/2.5/dit.md
+++ b/mshub_res/assets/mindspore/2.5/dit.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/dit>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/dit>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/emu3.md b/mshub_res/assets/mindspore/2.5/emu3.md
index f6c528d..a596e3b 100644
--- a/mshub_res/assets/mindspore/2.5/emu3.md
+++ b/mshub_res/assets/mindspore/2.5/emu3.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/emu3>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/emu3>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/fit.md b/mshub_res/assets/mindspore/2.5/fit.md
index 099fbd1..93933a9 100644
--- a/mshub_res/assets/mindspore/2.5/fit.md
+++ b/mshub_res/assets/mindspore/2.5/fit.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/fit>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/fit>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/hunyuan_dit.md b/mshub_res/assets/mindspore/2.5/hunyuan_dit.md
index 5382cc9..5e1f913 100644
--- a/mshub_res/assets/mindspore/2.5/hunyuan_dit.md
+++ b/mshub_res/assets/mindspore/2.5/hunyuan_dit.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/hunyuan_dit>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/hunyuan_dit>
 
 user-id: MindSpore
 
@@ -54,9 +54,9 @@ summary: HunyuanDiT is a multi-resolution diffusion transformer with fine-graine
 
 ### TODO
 
-- [ ] EMA
-- [ ] ControlNet training
-- [ ] Enhance prompt
+- ✖ EMA
+- ✖ ControlNet training
+- ✖ Enhance prompt
 
 ## Dependencies and Installation
 
diff --git a/mshub_res/assets/mindspore/2.5/hunyuanvideo-i2v.md b/mshub_res/assets/mindspore/2.5/hunyuanvideo-i2v.md
index 44d4ee1..4487e31 100644
--- a/mshub_res/assets/mindspore/2.5/hunyuanvideo-i2v.md
+++ b/mshub_res/assets/mindspore/2.5/hunyuanvideo-i2v.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/hunyuanvideo-i2v>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/hunyuanvideo-i2v>
 
 user-id: MindSpore
 
@@ -32,6 +32,8 @@ summary: HunyuanVideo-I2V is used for image-to-video generation
 
 ---
 
+# HunyuanVideo-I2V based on MindSpore
+
 This is a **MindSpore** implementation of [HunyuanVideo-I2V](https://github.com/Tencent/HunyuanVideo-I2V). It contains the code for **training** and **inference** of HunyuanVideo and 3D CausalVAE.
 
 ## 📑 Development Plan
@@ -39,13 +41,13 @@ This is a **MindSpore** implementation of [HunyuanVideo-I2V](https://github.com/
 Here is the development plan of the project:
 
 - CausalVAE:
-    - [x] Inference
-    - [ ] Evaluation
-    - [ ] Training
+    - ✔ Inference
+    - ✖ Evaluation
+    - ✖ Training
 - HunyuanVideo (13B):
-    - [x] Inference (w. and w.o. LoRA weight)
-    - [ ] Training
-    - [ ] LoRA finetune
+    - ✔ Inference (w. and w.o. LoRA weight)
+    - ✖ Training
+    - ✖ LoRA finetune
 
 ## 📦 Requirements
 
diff --git a/mshub_res/assets/mindspore/2.5/hunyuanvideo.md b/mshub_res/assets/mindspore/2.5/hunyuanvideo.md
index 2fb0152..d29274d 100644
--- a/mshub_res/assets/mindspore/2.5/hunyuanvideo.md
+++ b/mshub_res/assets/mindspore/2.5/hunyuanvideo.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/hunyuanvideo>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/hunyuanvideo>
 
 user-id: MindSpore
 
@@ -41,18 +41,18 @@ This is a **MindSpore** implementation of [HunyuanVideo](https://arxiv.org/abs/2
 Here is the development plan of the project:
 
 - CausalVAE:
-    - [x] Inference
-    - [x] Evaluation
-    - [x] Training
+    - ✔ Inference
+    - ✔ Evaluation
+    - ✔ Training
 - HunyuanVideo (13B):
-    - [x] Inference
-    - [x] Sequence Parallel (Ulysses SP)
-    - [x] VAE latent cache
-    - [x] Training up to `544x960x129` and `720x1280x129` with SP and VAE latent cache
-    - [x] Training stage 1: T2I 256px
-    - [ ] Training stage 2: T2I 256px 512px (buckets)
-    - [ ] Training stage 3: T2I/V up to 720x1280x129 (buckets)
-    - [ ] LoRA finetune
+    - ✔ Inference
+    - ✔ Sequence Parallel (Ulysses SP)
+    - ✔ VAE latent cache
+    - ✔ Training up to `544x960x129` and `720x1280x129` with SP and VAE latent cache
+    - ✔ Training stage 1: T2I 256px
+    - ✖ Training stage 2: T2I 256px 512px (buckets)
+    - ✖ Training stage 3: T2I/V up to 720x1280x129 (buckets)
+    - ✖ LoRA finetune
 
 ## 📦 Requirements
 
@@ -137,7 +137,7 @@ If you want to run T2V inference using sequence parallel (Ulysses SP), please us
 
 ### Run Image-to-Video Inference
 
-Please find more information about HunyuanVideo Image-to-Video Inference at this [url](https://github.com/mindspore-lab/mindone/tree/master/examples/hunyuanvideo-i2v).
+Please find more information about HunyuanVideo Image-to-Video Inference at this [url](https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/hunyuanvideo-i2v).
 
 ## 🔑 Training
 
diff --git a/mshub_res/assets/mindspore/2.5/hunyun3d_1.md b/mshub_res/assets/mindspore/2.5/hunyun3d_1.md
index 76f64ef..0e451b1 100644
--- a/mshub_res/assets/mindspore/2.5/hunyun3d_1.md
+++ b/mshub_res/assets/mindspore/2.5/hunyun3d_1.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/hunyuan3d>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/hunyuan3d_1>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/instantmesh.md b/mshub_res/assets/mindspore/2.5/instantmesh.md
index 618abab..a22ba5e 100644
--- a/mshub_res/assets/mindspore/2.5/instantmesh.md
+++ b/mshub_res/assets/mindspore/2.5/instantmesh.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/instantmesh>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/instantmesh>
 
 user-id: MindSpore
 
@@ -129,8 +129,6 @@ huggingface-cli download zxhezexin/openlrm-base-obj-1.0  # do this if your proxy
 
 Hurray! Now `mindone.transformers` supported pretrained ckpt loading for `xx_model.bin`. You can now bypass the conversion above.
 
----
-
 The image features are extracted with dino-vit, which depends on HuggingFace's transformer package. We reuse [the MindSpore's implementation](https://github.com/mindspore-lab/mindone/blob/master/mindone/transformers/modeling_utils.py#L499) and the only challenge remains to be that `.bin` checkpoint of [dino-vit](https://huggingface.co/facebook/dino-vitb16/tree/main) is not supported by MindSpore off-the-shelf. The checkpoint script above serves easy conversion purposes and ensures that dino-vit is still based on `MSPreTrainedModel` safe and sound.
 
 ### InstantMesh Checkpoint
diff --git a/mshub_res/assets/mindspore/2.5/janus.md b/mshub_res/assets/mindspore/2.5/janus.md
index bd925ef..ef9d894 100644
--- a/mshub_res/assets/mindspore/2.5/janus.md
+++ b/mshub_res/assets/mindspore/2.5/janus.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/janus>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/janus>
 
 user-id: MindSpore
 
@@ -58,7 +58,7 @@ summary: Janus is a unified multimodal understanding and generation model
 
 We provide an efficient MindSpore implementation of [JanusPro](https://github.com/deepseek-ai/Janus). This repository is built on the models and code released by DeepSeek. We are grateful for their exceptional work and generous contribution to open source.
 
-## News
+# Janus-Pro based on MindSpore
 
 **2025.03.12**: We have reproduced the multi-modal training pipelines referring to the JanusPro [paper](https://github.com/deepseek-ai/Janus), see [docs/training.md](docs/training.md).
 
diff --git a/mshub_res/assets/mindspore/2.5/kohya_sd_scripts.md b/mshub_res/assets/mindspore/2.5/kohya_sd_scripts.md
index 82bd4b5..3ef51d2 100644
--- a/mshub_res/assets/mindspore/2.5/kohya_sd_scripts.md
+++ b/mshub_res/assets/mindspore/2.5/kohya_sd_scripts.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-03-10
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/kohya_sd_scripts>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/kohya_sd_scripts>
 
 user-id: MindSpore
 
@@ -38,9 +38,9 @@ Here we provide a MindSpore implementation of [Kohya's Stable Diffusion trainers
 
 Currently, we support
 
-- [x] SDXL LoRA training
-- [x] SDXL LoRA (Dreambooth) training
-- [x] SDXL Inference
+- ✔ SDXL LoRA training
+- ✔ SDXL LoRA (Dreambooth) training
+- ✔ SDXL Inference
 
 > Notes: Basically, we've tried to provide a consistent implementation with the torch Kohya SD trainer, but we have limitations due to differences in the framework. Refer to the main difference between the two codebases listed [here](./Limitations.md) if needed.
 
diff --git a/mshub_res/assets/mindspore/2.5/mvdream.md b/mshub_res/assets/mindspore/2.5/mvdream.md
index 762a1eb..136809b 100644
--- a/mshub_res/assets/mindspore/2.5/mvdream.md
+++ b/mshub_res/assets/mindspore/2.5/mvdream.md
@@ -18,7 +18,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/mvdream>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/mvdream>
 
 user-id: MindSpore
 
@@ -32,6 +32,8 @@ summary: MVDream is a diffusion model for multi-view consistent 3D generation
 
 ---
 
+# MVDream based on MindSpore
+
 We support the training/inference pipeline of a diffusion-prior based, neural implicit field rendered, 3D mesh generation work called MVDream here.
 
 ## Introduction
diff --git a/mshub_res/assets/mindspore/2.5/openlrm.md b/mshub_res/assets/mindspore/2.5/openlrm.md
index b1b3434..618f672 100644
--- a/mshub_res/assets/mindspore/2.5/openlrm.md
+++ b/mshub_res/assets/mindspore/2.5/openlrm.md
@@ -20,7 +20,7 @@ author: MindSpore team
 
 update-time: 2025-03-10
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/openlrm>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/openlrm>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/opensora_hpcai.md b/mshub_res/assets/mindspore/2.5/opensora_hpcai.md
index 53e72ef..6057061 100644
--- a/mshub_res/assets/mindspore/2.5/opensora_hpcai.md
+++ b/mshub_res/assets/mindspore/2.5/opensora_hpcai.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: UCF-101 | WebVid | MixKit
+train-dataset: UCF-101
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/opensora_hpcai>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/opensora_hpcai>
 
 user-id: MindSpore
 
@@ -32,13 +32,13 @@ summary: OpenSora-HPCAI is a large video generation model for text-to-video gene
 
 ---
 
-## Open-Sora: Democratizing Efficient Video Production for All
+# Open-Sora: Democratizing Efficient Video Production for All
 
 Here we provide an efficient MindSpore implementation of [OpenSora](https://github.com/hpcaitech/Open-Sora), an open-source project that aims to foster innovation, creativity, and inclusivity within the field of content creation.
 
 This repository is built on the models and code released by HPC-AI Tech. We are grateful for their exceptional work and generous contribution to open source.
 
-<h4>Open-Sora is still at an early stage and under active development.</h4>
+Open-Sora is still at an early stage and under active development.
 
 ## 📰 News & States
 
@@ -46,10 +46,10 @@ This repository is built on the models and code released by HPC-AI Tech. We are
 | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------- |
 | **[2025.03.12]** 🔥 We released **Open-Sora 2.0** (11B). 🎬 11B model achieves [on-par performance](#evaluation) with 11B HunyuanVideo & 30B Step-Video on 📐VBench & 📊Human Preference. 🛠️ Fully open-source: checkpoints and training codes for training with only **$200K**. [[report]](https://arxiv.org/abs/2503.09642)                                                 | Inference                                                                                      |
 | **[2024.06.17]** 🔥 HPC-AI released **Open-Sora 1.2**, which includes **3D-VAE**, **rectified flow**, and **score condition**. The video quality is greatly improved. [[checkpoints]](#model-weights) [[report]](https://github.com/hpcaitech/Open-Sora/blob/main/docs/report_03.md)                                                                                          | Text-to-Video                                                                                  |
-| **[2024.04.25]** 🤗 HPC-AI Tech released the [Gradio demo for Open-Sora](https://huggingface.co/spaces/hpcai-tech/open-sora) on Hugging Face Spaces.                                                                                                                                                                                                                          | N.A.                                                                                           |
+| **[2024.04.25]** 🤗 HPC-AI Tech released the [Gradio demo for Open-Sora](https://huggingface.co/spaces/hpcai-tech/open-sora) on Hugging Face Spaces.                                                                                                                                                                                                                          | N/A                                                                                           |
 | **[2024.04.25]** 🔥 HPC-AI Tech released **Open-Sora 1.1**, which supports **2s~15s, 144p to 720p, any aspect ratio** text-to-image, **text-to-video, image-to-video, video-to-video, infinite time** generation. In addition, a full video processing pipeline is released. [[checkpoints]]() [[report]](https://github.com/hpcaitech/Open-Sora/blob/main/docs/report_02.md) | Image/Video-to-Video; Infinite time generation; Variable resolutions, aspect ratios, durations |
-| **[2024.03.18]** HPC-AI Tech released **Open-Sora 1.0**, a fully open-source project for video generation.                                                                                                                                                                                                                                                                    | ✅ VAE + STDiT training and inference                                                          |
-| **[2024.03.04]** HPC-AI Tech Open-Sora provides training with 46% cost reduction                                                                                                                                                                                                                                                                                              | ✅ Parallel training on Ascend devices                                                         |
+| **[2024.03.18]** HPC-AI Tech released **Open-Sora 1.0**, a fully open-source project for video generation.                                                                                                                                                                                                                                                                    | ✔ VAE + STDiT training and inference                                                          |
+| **[2024.03.04]** HPC-AI Tech Open-Sora provides training with 46% cost reduction                                                                                                                                                                                                                                                                                              | ✔ Parallel training on Ascend devices                                                         |
 
 ## Requirements
 
@@ -64,10 +64,10 @@ The following videos are generated based on MindSpore and Ascend Atlas 800T A2 m
 ### OpenSora 2.0 Demo
 
 | 3s 576×1024                                                                                                                                                                                                                                                                                                                                                           | 5s 576×1024                                                                                                                                                                                                                                                                                                                           |
-| --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| <video src="https://github.com/user-attachments/assets/e9335efb-bbbd-4a20-b5b2-e44bd39bfd2b" />                                                                                                                                                                                                                                                                       | <video src="https://github.com/user-attachments/assets/f61f8df4-7f34-4231-8abc-d2a4eec130ac" />                                                                                                                                                                                                                                       |
+|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| [video](https://gitee.com/liuchuting/hub/raw/source/mshub_res/assets/mindspore/2.5/source/0001.mp4)                                                                                                                                                                                                                                                                   | [video](https://gitee.com/liuchuting/hub/raw/source/mshub_res/assets/mindspore/2.5/source/0005.mp4)                                                                                                                                                                                                                                   |
 | <details><summary>Caption</summary>A playful dog in a pink coat with a red leash dashes across a muddy field with sparse crops. The camera tracks its energetic movement from right to left against a backdrop of trees and distant power lines under an overcast sky. The realistic, medium shot captures a candid, lively moment in soft, diffused light.</details> | <details><summary>Caption</summary>A coastal landscape painting with a prominent archway is displayed on an easel in a bright studio. A camera pan reveals a table cluttered with art supplies and a potted plant, enhancing the artistic vibe. Large windows and soft natural lighting create a cozy, creative atmosphere.</details> |
-| <video src="https://github.com/user-attachments/assets/4766752e-0752-4b2a-8847-b20ddbc4a3c8" />                                                                                                                                                                                                                                                                       | <video src="https://github.com/user-attachments/assets/a05c3313-6b97-456a-a480-87abbbe31fb0" />                                                                                                                                                                                                                                       |
+| [video](https://gitee.com/liuchuting/hub/raw/source/mshub_res/assets/mindspore/2.5/source/0000.mp4)                                                                                                                                                                                                                                                                   | [video](https://gitee.com/liuchuting/hub/raw/source/mshub_res/assets/mindspore/2.5/source/0004.mp4)                                                                                                                                                                                                                                   |
 | <details><summary>Caption</summary>Two women sit on a beige couch in a cozy, warmly lit room with a brick wall backdrop. They engage in a cheerful conversation, smiling and toasting red wine in an intimate medium shot.</details>                                                                                                                                  | <details><summary>Caption</summary>A drone camera circles a historic church on a rocky outcrop along the Amalfi Coast, highlighting its stunning architecture, tiered patios, and the dramatic coastal views with waves crashing below and people enjoying the scene in the warm afternoon light.</details>                           |
 
 > [!TIP]
@@ -78,9 +78,9 @@ The following videos are generated based on MindSpore and Ascend Atlas 800T A2 m
 <details>
 <summary>Demo</summary>
 
-| 4s 720×1280                                                                                     | 4s 720×1280                                                                                     | 4s 720×1280                                                                                     |
-| ----------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------- |
-| <video src="https://github.com/user-attachments/assets/7d9c812b-1642-4019-99da-dabf94c41596" /> | <video src="https://github.com/user-attachments/assets/9f463262-9ee0-4931-9d39-63fe925cbe6e" /> | <video src="https://github.com/user-attachments/assets/e0fa61bd-8bd0-40aa-9ea6-c587d492482a" /> |
+| 4s 720×1280                                                                              | 4s 720×1280                                                                              | 4s 720×1280                                                                              |
+|------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------|
+| [video](https://github.com/user-attachments/assets/7d9c812b-1642-4019-99da-dabf94c41596) | [video](https://github.com/user-attachments/assets/9f463262-9ee0-4931-9d39-63fe925cbe6e) | [video](https://github.com/user-attachments/assets/e0fa61bd-8bd0-40aa-9ea6-c587d492482a) |
 
 > [!TIP]
 > To generate better-looking videos, you can try generating in two stages: Text-to-Image and then Image-to-Video.
@@ -154,47 +154,47 @@ Videos are downsampled to `.gif` for display. Click for original videos. Prompts
 - 📍 **Open-Sora 2.0** released. Model weights are available [here](#model-weights).
   See [report 1.2](https://arxiv.org/abs/2503.09642) for more details.
 
-    - ✅ New backbone that is based on Flux.
-    - ✅ Uses reduced patch size of 1 for better training stability and finer details in video generation.
-    - ✅ Employs full attention and 3D RoPE.
-    - ✅ Uses Deep Compression Autoencoder (DC-AE) for increased spatial compression of 32x with an increased number of 128
+    - ✔ New backbone that is based on Flux.
+    - ✔ Uses reduced patch size of 1 for better training stability and finer details in video generation.
+    - ✔ Employs full attention and 3D RoPE.
+    - ✔ Uses Deep Compression Autoencoder (DC-AE) for increased spatial compression of 32x with an increased number of 128
     latent channels.
-    - ✅ Employs two text encoders: T5, which captures complex textual semantics, and CLIP-Large, which improves alignment
+    - ✔ Employs two text encoders: T5, which captures complex textual semantics, and CLIP-Large, which improves alignment
     between text and visual concepts.
 
 - 📍 **Open-Sora 1.2** released. Model weights are available [here](#model-weights). See [report 1.2](https://github.com/hpcaitech/Open-Sora/blob/main/docs/report_03.md) for more details.
 
-    - ✅ Support rectified flow scheduling.
-    - ✅ Support more conditioning including fps, aesthetic score, motion strength and camera motion.
-    - ✅ Trained our 3D-VAE for temporal dimension compression.
+    - ✔ Support rectified flow scheduling.
+    - ✔ Support more conditioning including fps, aesthetic score, motion strength and camera motion.
+    - ✔ Trained our 3D-VAE for temporal dimension compression.
 
 - 📍 **Open-Sora 1.1** with the following features
 
-    - ✅ Improved ST-DiT architecture includes Rotary Position Embedding (RoPE), QK Normalization, longer text length, etc.
-    - ✅ Support image and video conditioning and video editing, and thus support animating images, connecting videos, etc.
-    - ✅ Support training with any resolution, aspect ratio, and duration.
+    - ✔ Improved ST-DiT architecture includes Rotary Position Embedding (RoPE), QK Normalization, longer text length, etc.
+    - ✔ Support image and video conditioning and video editing, and thus support animating images, connecting videos, etc.
+    - ✔ Support training with any resolution, aspect ratio, and duration.
 
 - 📍 **Open-Sora 1.0** with the following features
-    - ✅ Text-to-video generation in 256x256 or 512x512 resolution and up to 64 frames.
-    - ✅ Three-stage training: i) 16x256x256 video pretraining, ii) 16x512x512 video fine-tuning, and iii) 64x512x512 videos
-    - ✅ Optimized training recipes for MindSpore+Ascend framework (see `configs/opensora/train/xxx_ms.yaml`)
-    - ✅ Acceleration methods: flash attention, recompute (gradient checkpointing), data sink, mixed precision, and graph compilation.
-    - ✅ Data parallelism + Optimizer parallelism, allow training on 300x512x512 videos
+    - ✔ Text-to-video generation in 256x256 or 512x512 resolution and up to 64 frames.
+    - ✔ Three-stage training: i) 16x256x256 video pretraining, ii) 16x512x512 video fine-tuning, and iii) 64x512x512 videos
+    - ✔ Optimized training recipes for MindSpore+Ascend framework (see `configs/opensora/train/xxx_ms.yaml`)
+    - ✔ Acceleration methods: flash attention, recompute (gradient checkpointing), data sink, mixed precision, and graph compilation.
+    - ✔ Data parallelism + Optimizer parallelism, allow training on 300x512x512 videos
 
 <details>
 <summary>View more</summary>
 
-- ✅ Following the findings in OpenSora, we also adopt the VAE from Stable Diffusion for video latent encoding.
-- ✅ We pick the **STDiT** model as our video diffusion transformer following the best practice in OpenSora.
-- ✅ Support T5 text conditioning.
+- ✔ Following the findings in OpenSora, we also adopt the VAE from Stable Diffusion for video latent encoding.
+- ✔ We pick the **STDiT** model as our video diffusion transformer following the best practice in OpenSora.
+- ✔ Support T5 text conditioning.
 
 </details>
 
 <details>
 <summary>View more</summary>
 
-- [ ] Evaluation pipeline.
-- [ ] Complete the data processing pipeline (including dense optical flow, aesthetics scores, text-image similarity, etc.).
+- ✖ Evaluation pipeline.
+- ✖ Complete the data processing pipeline (including dense optical flow, aesthetics scores, text-image similarity, etc.).
 
 </details>
 
@@ -231,6 +231,7 @@ In case `decord` package is not available, try `pip install eva-decord`.
 For EulerOS, instructions on ffmpeg and decord installation are as follows.
 
 <details onclose>
+<summary>How to install ffmpeg and decord</summary>
 
 ```shell
 1. install ffmpeg 4, referring to https://ffmpeg.org/releases
@@ -835,12 +836,12 @@ Open-Sora 1.2 based on MindSpore and Ascend Atlas 800T A2 machines supports 0s\~
 
 |      | image | 2s  | 4s  | 8s  | 16s |
 | ---- | ----- | --- | --- | --- | --- |
-| 240p | ✅    | ✅  | ✅  | ✅  | ✅  |
-| 360p | ✅    | ✅  | ✅  | ✅  | ✅  |
-| 480p | ✅    | ✅  | ✅  | ✅  | 🆗  |
-| 720p | ✅    | ✅  | ✅  | 🆗  | 🆗  |
+| 240p | ✔    | ✔  | ✔  | ✔  | ✔  |
+| 360p | ✔    | ✔  | ✔  | ✔  | ✔  |
+| 480p | ✔    | ✔  | ✔  | ✔  | 🆗  |
+| 720p | ✔    | ✔  | ✔  | 🆗  | 🆗  |
 
-Here ✅ means that the data is seen during training, and 🆗 means although not trained, the model can inference at that config. Inference for 🆗 requires sequence parallelism.
+Here ✔ means that the data is seen during training, and 🆗 means although not trained, the model can inference at that config. Inference for 🆗 requires sequence parallelism.
 
 #### Training Performance
 
diff --git a/mshub_res/assets/mindspore/2.5/opensora_pku.md b/mshub_res/assets/mindspore/2.5/opensora_pku.md
index 6370541..8079c4b 100644
--- a/mshub_res/assets/mindspore/2.5/opensora_pku.md
+++ b/mshub_res/assets/mindspore/2.5/opensora_pku.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: UCF101 | K400
+train-dataset: UCF-101
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/opensora_pku>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/opensora_pku>
 
 user-id: MindSpore
 
@@ -42,14 +42,14 @@ Here we provide an efficient MindSpore version of [Open-Sora-Plan](https://githu
 
 | Official News from OpenSora-PKU                                                                                                                                                                                                                                                                                                                                   | MindSpore Support                                                                                                         |
 | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------- |
-| **[2024.10.16]** 🎉 PKU released version 1.3.0, featuring: **WFVAE**, **pompt refiner**, **data filtering strategy**, **sparse attention**, and **bucket training strategy**. They also support 93x480p within **24G VRAM**. More details can be found at their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.3.0.md). | ✅ V.1.3.0 WFVAE and OpenSoraT2V: inference, multi-stage & multi-devices training                                         |
-| **[2024.07.24]** 🔥🔥🔥 PKU launched Open-Sora Plan v1.2.0, utilizing a 3D full attention architecture instead of 2+1D. See their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.2.0.md).                                                                                                                               | ✅ V.1.2.0 CausalVAE inference & OpenSoraT2V multi-stage training                                                         |
-| **[2024.05.27]** 🚀🚀🚀 PKU launched Open-Sora Plan v1.1.0, which significantly improves video quality and length, and is fully open source! Please check out their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.1.0.md).                                                                                             | ✅ V.1.1.0 CausalVAE inference and LatteT2V infernece & three-stage training (`65x512x512`, `221x512x512`, `513x512x512`) |
-| **[2024.04.09]** 🚀 PKU shared the latest exploration on metamorphic time-lapse video generation: [MagicTime](https://github.com/PKU-YuanGroup/MagicTime), and the dataset for train (updating): [Open-Sora-Dataset](https://github.com/PKU-YuanGroup/Open-Sora-Dataset).                                                                                         | N.A.                                                                                                                      |
-| **[2024.04.07]** 🔥🔥🔥 PKU released Open-Sora-Plan v1.0.0. See their [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.0.0.md).                                                                                                                                                                                                  | ✅ CausalVAE+LatteT2V+T5 inference and three-stage training (`17×256×256`, `65×256×256`, `65x512x512`)                    |
-| **[2024.03.27]** 🚀🚀🚀 PKU released the report of [VideoCausalVAE](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Train_And_Eval_CausalVideoVAE.md), which supports both images and videos.                                                                                                                                                      | ✅ CausalVAE training and inference                                                                                       |
+| **[2024.10.16]** 🎉 PKU released version 1.3.0, featuring: **WFVAE**, **pompt refiner**, **data filtering strategy**, **sparse attention**, and **bucket training strategy**. They also support 93x480p within **24G VRAM**. More details can be found at their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.3.0.md). | ✔ V.1.3.0 WFVAE and OpenSoraT2V: inference, multi-stage & multi-devices training                                         |
+| **[2024.07.24]** 🔥🔥🔥 PKU launched Open-Sora Plan v1.2.0, utilizing a 3D full attention architecture instead of 2+1D. See their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.2.0.md).                                                                                                                               | ✔ V.1.2.0 CausalVAE inference & OpenSoraT2V multi-stage training                                                         |
+| **[2024.05.27]** 🚀🚀🚀 PKU launched Open-Sora Plan v1.1.0, which significantly improves video quality and length, and is fully open source! Please check out their latest [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.1.0.md).                                                                                             | ✔ V.1.1.0 CausalVAE inference and LatteT2V infernece & three-stage training (`65x512x512`, `221x512x512`, `513x512x512`) |
+| **[2024.04.09]** 🚀 PKU shared the latest exploration on metamorphic time-lapse video generation: [MagicTime](https://github.com/PKU-YuanGroup/MagicTime), and the dataset for train (updating): [Open-Sora-Dataset](https://github.com/PKU-YuanGroup/Open-Sora-Dataset).                                                                                         | N/A                                                                                                                      |
+| **[2024.04.07]** 🔥🔥🔥 PKU released Open-Sora-Plan v1.0.0. See their [report](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.0.0.md).                                                                                                                                                                                                  | ✔ CausalVAE+LatteT2V+T5 inference and three-stage training (`17×256×256`, `65×256×256`, `65x512x512`)                    |
+| **[2024.03.27]** 🚀🚀🚀 PKU released the report of [VideoCausalVAE](https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Train_And_Eval_CausalVideoVAE.md), which supports both images and videos.                                                                                                                                                      | ✔ CausalVAE training and inference                                                                                       |
 | **[2024.03.10]** 🚀🚀🚀 PKU supports training a latent size of 225×90×90 (t×h×w), which means to **train 1 minute of 1080P video with 30FPS** (2× interpolated frames and 2× super resolution) under class-condition.                                                                                                                                             | frame interpolation and super-resolution are under-development.                                                           |
-| **[2024.03.08]** PKU support the training code of text condition with 16 frames of 512x512.                                                                                                                                                                                                                                                                       | ✅ CausalVAE+LatteT2V+T5 training (`16x512x512`)                                                                          |
+| **[2024.03.08]** PKU support the training code of text condition with 16 frames of 512x512.                                                                                                                                                                                                                                                                       | ✔ CausalVAE+LatteT2V+T5 training (`16x512x512`)                                                                          |
 | **[2024.03.07]** PKU support training with 128 frames (when sample rate = 3, which is about 13 seconds) of 256x256, or 64 frames (which is about 6 seconds) of 512x512.                                                                                                                                                                                           | class-conditioned training is under-development.                                                                          |
 
 ## Requirements
@@ -86,19 +86,19 @@ Videos are saved to `.gif` for display.
 ## 🔆 Features
 
 - 📍 **Open-Sora-Plan v1.3.0** with the following features
-    - ✅ WFVAE inference & multi-stage training.
-    - ✅ mT5-xxl TextEncoder model inference.
-    - ✅ Prompt Refiner Inference.
-    - ✅ Text-to-video generation up to 93 frames and 640x640 resolution.
-    - ✅ Multi-stage training using Zero2 and sequence parallelism.
-    - ✅ Acceleration methods: flash attention, recompute (graident checkpointing), mixed precision,    data parallelism, etc..
-    - ✅ Evaluation metrics : PSNR and SSIM.
+    - ✔ WFVAE inference & multi-stage training.
+    - ✔ mT5-xxl TextEncoder model inference.
+    - ✔ Prompt Refiner Inference.
+    - ✔ Text-to-video generation up to 93 frames and 640x640 resolution.
+    - ✔ Multi-stage training using Zero2 and sequence parallelism.
+    - ✔ Acceleration methods: flash attention, recompute (graident checkpointing), mixed precision,    data parallelism, etc..
+    - ✔ Evaluation metrics : PSNR and SSIM.
 
 ### TODO
 
-- [ ] Image-to-Video model **[WIP]**.
-- [ ] Scaling model parameters and dataset size **[WIP]**.
-- [ ] Evaluation of various metrics **[WIP]**.
+- ✖ Image-to-Video model **[WIP]**.
+- ✖ Scaling model parameters and dataset size **[WIP]**.
+- ✖ Evaluation of various metrics **[WIP]**.
 
 You contributions are welcome.
 
diff --git a/mshub_res/assets/mindspore/2.5/qwen2_vl.md b/mshub_res/assets/mindspore/2.5/qwen2_vl.md
index 908b848..27c2484 100644
--- a/mshub_res/assets/mindspore/2.5/qwen2_vl.md
+++ b/mshub_res/assets/mindspore/2.5/qwen2_vl.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/transformers/qwen2-vl>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/qwen2_vl>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/sharegpt_4v.md b/mshub_res/assets/mindspore/2.5/sharegpt_4v.md
index ec0105d..3f4c310 100644
--- a/mshub_res/assets/mindspore/2.5/sharegpt_4v.md
+++ b/mshub_res/assets/mindspore/2.5/sharegpt_4v.md
@@ -12,7 +12,7 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 evaluation: BLEU0.17
 
@@ -20,7 +20,7 @@ author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/sharegpt_4v>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/sharegpt_4v>
 
 user-id: MindSpore
 
@@ -46,7 +46,7 @@ Here we provide a MindSpore version of ShareGPT4V.
 
 Currently, we support
 
-- [x] ShareGPT4V Inference
+- ✔ ShareGPT4V Inference
 
 ## Requirements
 
diff --git a/mshub_res/assets/mindspore/2.5/step_video_t2v.md b/mshub_res/assets/mindspore/2.5/step_video_t2v.md
index dba734d..36c235c 100644
--- a/mshub_res/assets/mindspore/2.5/step_video_t2v.md
+++ b/mshub_res/assets/mindspore/2.5/step_video_t2v.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/step_video_t2v>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/step_video_t2v>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/story_diffusion.md b/mshub_res/assets/mindspore/2.5/story_diffusion.md
index 26f8cdd..b423e53 100644
--- a/mshub_res/assets/mindspore/2.5/story_diffusion.md
+++ b/mshub_res/assets/mindspore/2.5/story_diffusion.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/story_diffusion>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/story_diffusion>
 
 user-id: MindSpore
 
@@ -47,9 +47,9 @@ The current implementation only supports text-to-image generation with consisten
 
 ## TODOs
 
-- [x] Comic Generation Inference script
-- [x] Gradio demo of comic generation
-- [ ] Motion predictor with condition images
+- ✔ Comic Generation Inference script
+- ✔ Gradio demo of comic generation
+- ✖ Motion predictor with condition images
 
 ## 📦 Requirements
 
diff --git a/mshub_res/assets/mindspore/2.5/var.md b/mshub_res/assets/mindspore/2.5/var.md
index 8c0cfd9..e644365 100644
--- a/mshub_res/assets/mindspore/2.5/var.md
+++ b/mshub_res/assets/mindspore/2.5/var.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/var>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/var>
 
 user-id: MindSpore
 
diff --git a/mshub_res/assets/mindspore/2.5/venhancer.md b/mshub_res/assets/mindspore/2.5/venhancer.md
index 8d1d552..ac86b8f 100644
--- a/mshub_res/assets/mindspore/2.5/venhancer.md
+++ b/mshub_res/assets/mindspore/2.5/venhancer.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/venhancer>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/venhancer>
 
 user-id: MindSpore
 
@@ -32,6 +32,8 @@ summary: VEnhancer is a generative space-time enhancement framework for video ge
 
 ---
 
+# VEnhancer based on MindSpore
+
 This repository is the MindSpore implementation of [VEnhancer](https://arxiv.org/abs/2407.07667)[<a href="#references">1</a>].
 
 VEnhancer, a generative space-time enhancement framework that improves the existing text-to-video results by adding more details in spatial domain and synthetic detailed motion in temporal domain. Given a generated low-quality video, VEnhancer can increase its spatial and temporal resolution simultaneously with arbitrary up-sampling space and time scales through a unified video diffusion model. Furthermore, VEnhancer effectively removes generated spatial artifacts and temporal flickering of generated videos.
diff --git a/mshub_res/assets/mindspore/2.5/videocomposer.md b/mshub_res/assets/mindspore/2.5/videocomposer.md
index 4e83c40..625b5db 100644
--- a/mshub_res/assets/mindspore/2.5/videocomposer.md
+++ b/mshub_res/assets/mindspore/2.5/videocomposer.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/videocomposer>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/videocomposer>
 
 user-id: MindSpore
 
@@ -160,16 +160,16 @@ MindSpore implementation & optimization of [VideoComposer: Compositional Video S
 
 ## Main Features
 
-- [x] Conditional Video Generation including the following tasks:
-    - [x] Motion transfer from a video to a single image (exp02)
-    - [x] Single sketch to videos with or without style guidance (exp03 and exp04)
-    - [x] Depth to video with or without style guidance (exp5 and exp6)
-    - [x] Generate videos based on multiple conditions: depth maps, local image, masks, motion, and sketch
-- [x] Model Training (vanilla finetuning) supporting both Ascend 910 and Atlas 800T A2 machines
-- [x] Acceleration and Memory Reduction
-    - [x] Mixed Precision
-    - [x] Graph Mode for Training
-    - [x] Recompute
+- ✔ Conditional Video Generation including the following tasks:
+    - ✔ Motion transfer from a video to a single image (exp02)
+    - ✔ Single sketch to videos with or without style guidance (exp03 and exp04)
+    - ✔ Depth to video with or without style guidance (exp5 and exp6)
+    - ✔ Generate videos based on multiple conditions: depth maps, local image, masks, motion, and sketch
+- ✔ Model Training (vanilla finetuning) supporting both Ascend 910 and Atlas 800T A2 machines
+- ✔ Acceleration and Memory Reduction
+    - ✔ Mixed Precision
+    - ✔ Graph Mode for Training
+    - ✔ Recompute
 
 <div align="center">
 <img src="https://github.com/SamitHuang/mindone/assets/8156835/eb8a19d3-9ce2-4a31-9696-d6a13857e986" width="720" />
@@ -189,9 +189,10 @@ Python: 3.7 or higher.
 Then run `pip install -r requirements.txt` to install the necessary packages.
 
 For `ffmpeg`, install by
-`shell
-    conda install ffmpeg
-    `
+
+```shell
+conda install ffmpeg
+```
 
 If case you fail to install `motion-vector-extractor` via pip, please manually install it referring to the [official](https://github.com/LukasBommes/mv-extractor) repo.
 
@@ -383,7 +384,7 @@ conditions_for_train: ["text", "local_image", "motion"]
 
 ### Distributed Training
 
-Please generate the HCCL config file on your running server at first referring to [this tutorial](https://github.com/mindspore-lab/mindocr/blob/main/docs/cn/tutorials/distribute_train.md#12-%E9%85%8D%E7%BD%AErank_table_file%E8%BF%9B%E8%A1%8C%E8%AE%AD%E7%BB%83). Then update `scripts/run_train_distribute.sh` by setting
+Please generate the HCCL config file on your running server at first referring to [this tutorial](https://github.com/mindspore-lab/mindocr/blob/v0.4.0/docs/cn/tutorials/distribute_train.md). Then update `scripts/run_train_distribute.sh` by setting
 
 ```shell
 rank_table_file=path/to/hccl_8p_01234567_xxx.json
diff --git a/mshub_res/assets/mindspore/2.5/wan2_1.md b/mshub_res/assets/mindspore/2.5/wan2_1.md
index 368ba83..b9a8109 100644
--- a/mshub_res/assets/mindspore/2.5/wan2_1.md
+++ b/mshub_res/assets/mindspore/2.5/wan2_1.md
@@ -12,13 +12,13 @@ fine-tunable: True
 
 model-version: 2.5
 
-train-dataset: N·A
+train-dataset: N/A
 
 author: MindSpore team
 
 update-time: 2025-04-22
 
-repo-link: <https://github.com/mindspore-lab/mindone/tree/master/examples/wan2_1>
+repo-link: <https://github.com/mindspore-lab/mindone/tree/v0.3.0/examples/wan2_1>
 
 user-id: MindSpore
 
@@ -32,6 +32,8 @@ summary: Wan2.1 is an open and advanced large-scale video generative model
 
 ---
 
+# Wan2.1 based on MindSpore
+
 <div align="center">
 <h1>🚀 Wan: Open and Advanced Large-Scale Video Generative Models </h1>
 
@@ -95,15 +97,15 @@ The following videos are generated based on MindSpore and Ascend Atlas 800T A2 m
 ## 📑 Todo List
 
 - Wan2.1 Text-to-Video
-    - [x] Single-NPU inference code of the 14B and 1.3B models
-    - [x] Multi-NPU inference acceleration for the 14B models
-    - [x] Prompt extension support
-    - [x] Gradio demo
+    - ✔ Single-NPU inference code of the 14B and 1.3B models
+    - ✔ Multi-NPU inference acceleration for the 14B models
+    - ✔ Prompt extension support
+    - ✔ Gradio demo
 - Wan2.1 Image-to-Video
-    - [x] Single-NPU inference code of the 14B model
-    - [x] Multi-NPU inference acceleration for the 14B model
-    - [x] Prompt extension support
-    - [ ] Gradio demo
+    - ✔ Single-NPU inference code of the 14B model
+    - ✔ Multi-NPU inference acceleration for the 14B model
+    - ✔ Prompt extension support
+    - ✖ Gradio demo
 
 ## Quickstart
 
-- 
Gitee