# PaddleVideo **Repository Path**: tzthomas/PaddleVideo ## Basic Information - **Project Name**: PaddleVideo - **Description**: PaddleVideo是飞桨官方出品的视频模型开发套件,旨在帮助开发者更好的进行视频领域的学术研究和产业实践。基于模块化的设计,提供丰富的视频算法实现、产业级的视频算法优化与应用,包括安防、体育、互联网、媒体等行业的动作定位与识别、行为分析、智能封面、视频标注、视频打标签等,涵盖动作识别与视频分类、动作定位、动作检测、多模态文本视频检索等技术。 - **Primary Language**: Python - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 19 - **Created**: 2023-02-18 - **Last Updated**: 2023-02-18 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README [简体中文](README.md) | English # PaddleVideo ## Update: - add skeleton-base action recognition model [CTR-GCN](./docs/en/model_zoo/recognition/ctrgcn.md). - add lite action recognition model [MoViNet](./docs/zh-CN/model_zoo/recognition/movinet.md). - add temporal segment model [MS-TCN](./docs/zh-CN/model_zoo/segmentation/mstcn.md), [ASRF](./docs/zh-CN/model_zoo/segmentation/asrf.md). ​ 💖 **Welcome to scan the code and join the group discussion** 💖
- Scan the QR code below with your Wechat and reply "video", you can access to official technical exchange group. Look forward to your participation. ## Introduction ![python version](https://img.shields.io/badge/python-3.7+-orange.svg) ![paddle version](https://img.shields.io/badge/PaddlePaddle-2.0-blue ) PaddleVideo is a toolset for video tasks prepared for the industry and academia. This repository provides examples and best practice guildelines for exploring deep learning algorithm in the scene of video area.

## Model and Applications ### Model zoo - Please refer to [Installation guide](docs/zh-CN/install.md) and [Usage doc](docs/zh-CN/usage.md) before using the model zoo.
Action recognition method
PP-TSM (PP series) PP-TSN (PP series) PP-TimeSformer (PP series) TSN (2D’) TSM (2D')
SlowFast (3D’) TimeSformer (Transformer') VideoSwin (Transformer’) AttentionLSTM (RNN') MoViNet (Lite‘)
Skeleton based action recognition
ST-GCN (Custom’) AGCN (Adaptive') CTR-GCN (GCN‘)
Sequence action detection method
BMN (One-stage')
temporal segment
MS-TCN ASRF
Spatio-temporal motion detection method
SlowFast+Fast R-CNN
Multimodal
ActBERT (Learning') T2VLAD (Retrieval')
Video target segmentation
CFBI (Semi') MA-Net (Supervised')
Monocular depth estimation
ADDS (Unsupervised‘)
### Dataset
Action Recognition
Kinetics-400 (Homepage) (CVPR'2017) UCF101 (Homepage) (CRCV-IR-12-01) ActivityNet (Homepage) (CVPR'2015) YouTube-8M (Homepage) (CVPR'2017)
Action Localization
ActivityNet (Homepage) (CVPR'2015)
Spatio-Temporal Action Detection
AVA (Homepage) (CVPR'2018)
Skeleton-based Action Recognition
NTURGB+D (Homepage) (IEEE CS'2016) FSD (Homepage)
Depth Estimation
Oxford-RobotCar (Homepage) (IJRR'2017)
Text-Video Retrieval
MSR-VTT (Homepage) (CVPR'2016)
Text-Video Pretrained Model
HowTo100M (Homepage) (ICCV'2019)
### Applications | Applications | Descriptions | | :--------------- | :------------ | | [FootballAction]() | Football action detection solution| | [BasketballAction](applications/BasketballAction) | Basketball action detection solution | | [TableTennis](applications/ableTennis) | Table tennis action recognition solution| | [FigureSkating](applications/FigureSkating) | Figure skating action recognition solution| | [VideoTag](applications/VideoTag) | 3000-category large-scale video classification solution | | [MultimodalVideoTag](applications/MultimodalVideoTag) | Multimodal video classification solution| | [VideoQualityAssessment](applications/VideoQualityAssessment) | Video quality assessment solution| | [PP-Care](applications/PP-Care) | 3DMRI medical image recognition solution | | [EIVideo](applications/EIVideo) | Interactive video segmentation tool| | [Anti-UAV](applications/Anti-UAV) |UAV detection solution| | [AbnormalActionDetection](applications/AbnormalActionDetection) |Abnormal action detection solution| | [PP-Human](applications/PPHuman) | Action recognition solution for pedestrian analysis scene | ## Documentation tutorial - AI-Studio Tutorial - [[Official] Paddle 2.1 realizes video understanding optimization model -- PP-TSM](https://aistudio.baidu.com/aistudio/projectdetail/3399656?contributionType=1) - [[Official] Paddle 2.1 realizes video understanding optimization model -- PP-TSN](https://aistudio.baidu.com/aistudio/projectdetail/2879980?contributionType=1) - [[Official] Paddle 2.1 realizes the classic model of video understanding -- TSN](https://aistudio.baidu.com/aistudio/projectdetail/2250682) - [[Official] Paddle 2.1 realizes the classic model of video understanding -- TSM](https://aistudio.baidu.com/aistudio/projectdetail/2310889) - [BMN video action positioning](https://aistudio.baidu.com/aistudio/projectdetail/2250674) - [ST-GCN Tutorial for Figure Skate Skeleton Point Action Recognition](https://aistudio.baidu.com/aistudio/projectdetail/2417717) - [[Practice]video understanding transformer model TimeSformer](https://aistudio.baidu.com/aistudio/projectdetail/3413254?contributionType=1) - Contribute code - [How to add a new algorithm](./docs/zh-CN/contribute/add_new_algorithm.md) - [Configuration system design analysis](./docs/en/tutorials/config.md) - [How to mention PR](./docs/zh-CN/contribute/how_to_contribute.md) ## Competition - [Figure skating action recoginition using skeleton based on PaddlePaddle](https://aistudio.baidu.com/aistudio/competition/detail/115/0/introduction), [AI Studio projects](https://aistudio.baidu.com/aistudio/projectdetail/2417717), [video course](https://www.bilibili.com/video/BV1w3411172G) - [Table tennis action proposal localization based on PaddlePaddle](https://aistudio.baidu.com/aistudio/competition/detail/127/0/introduction) - [CCKS 2021: Knowledge Augmented Video Semantic Understanding](https://www.biendata.xyz/competition/ccks_2021_videounde) ## License PaddleVideo is released under the [Apache 2.0 license](LICENSE). ## Thanks - Many thanks to [mohui37](https://github.com/mohui37)、[zephyr-fun](https://github.com/zephyr-fun)、[voipchina](https://github.com/voipchina) for contributing the code for prediction.