Contents
Openpose Description
Model Architecture
Dataset
Features
- Mixed Precision
Environment Requirements
Quick Start
Script Description
Model Description
- Performance
  - Evaluation Performance

Openpose Description

Openpose network proposes a bottom-up human attitude estimation algorithm using Part Affinity Fields (PAFs). Instead of a top-down algorithm: Detect people first and then return key-points and skeleton. The advantage of openpose is that the computing time does not increase significantly as the number of people in the image increases.However,the top-down algorithm is based on the detection result, and the runtimes grow linearly with the number of people.

Paper: Zhe Cao,Tomas Simon,Shih-En Wei,Yaser Sheikh,"Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields",The IEEE Conference on Computer Vision and Pattern Recongnition(CVPR),2017

Model Architecture

In first step the image is passed through baseline CNN network to extract the feature maps of the input In the paper. In this paper thee authors used first 10 layers of VGG-19 network. The feature map is then process in a multi-stage CNN pipeline to generate the Part Confidence Maps and Part Affinity Field. In the last step, the Confidence Maps and Part Affinity Fields that are generated above are processed by a greedy bipartite matching algorithm to obtain the poses for each person in the image.

Dataset

Prepare datasets, including training sets, verification sets, and annotations.The training set and validation set samples are located in the "dataset" directory, The available datasets include coco2014,coco2017 datasets. In the currently provided training script, the coco2017 data set is used as an example to perform data preprocessing during the training process. If users use data sets in other formats, please modify the data set loading and preprocessing methods

Download data from coco2017 data official website and unzip.

    wget http://images.cocodataset.org/zips/train2017.zip
    wget http://images.cocodataset.org/zips/val2017.zip
    wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

Create the mask dataset.

Run python gen_ignore_mask.py

    python gen_ignore_mask.py --train_ann ../dataset/annotations/person_keypoints_train2017.json --val_ann ../dataset/annotations/person_keypoints_val2017.json --train_dir ../dataset/train2017 --val_dir ../dataset/val2017

The dataset folder is generated in the root directory and contains the following files:

├── dataset
    ├── annotations
        ├─person_keypoints_train2017.json
        └─person_keypoints_val2017.json
    ├─ignore_mask_train2017
    ├─ignore_mask_val2017
    ├─train2017
    └─val2017

Features

Mixed Precision

The mixed precision training method accelerates the deep learning neural network training process by using both the single-precision and half-precision data formats, and maintains the network precision achieved by the single-precision training at the same time. Mixed precision training can accelerate the computation process, reduce memory usage, and enable a larger model or batch size to be trained on specific hardware. For FP16 operators, if the input data type is FP32, the backend of MindSpore will automatically handle it with reduced precision. Users could check the reduced-precision operators by enabling INFO log and then searching ‘reduce precision’.

Environment Requirements

Hardware (Ascend)
- Prepare hardware environment with Ascend.
Framework
- MindSpore
Download the VGG19 model of the MindSpore version:
- vgg19-0-97_5004.ckpt
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Quick Start

After installing MindSpore via the official website, you can start training and evaluation as follows:

# run training example
python train.py --imgpath_train ./train2017 --jsonpath_train ./person_keypoints_train2017.json --maskpath_train ./ignore_mask_train2017 --vgg_path ./vgg19-0-97_5004.ckpt > train.log 2>&1 &

# run distributed training example
bash run_distribute_train.sh [RANK_TABLE_FILE] [IMGPATH_TRAIN] [JSONPATH_TRAIN] [MASKPATH_TRAIN] [VGG_PATH]

# run evaluation example
python eval.py --model_path path_to_eval_model.ckpt --imgpath_val ./dataset/val2017 --ann ./dataset/annotations/person_keypoints_val2017.json > eval.log 2>&1 &
OR
bash scripts/run_eval_ascend.sh [MODEL_PATH] [IMGPATH_VAL] [ANN]

[RANK_TABLE_FILE] is the path of the multi-card information configuration table in the environment. The configuration table can be automatically generated by the tool hccl_tool.

Script Description

Script and Sample Code

├── openpose
    ├── README.md                        // descriptions about openpose
    ├── scripts
    │   ├──run_standalone_train.sh       // shell script for distributed on Ascend
    │   ├──run_distribute_train.sh       // shell script for distributed on Ascend with 8p
    │   ├──run_eval_ascend.sh            // shell script for evaluation on Ascend
    ├── src
    │   ├── model_utils
    │       ├── config.py                           # Parameter config
    │       ├── moxing_adapter.py                   # modelarts device configuration
    │       └── device_adapter.py                   # Device Config
    │       └── local_adapter.py                    # local device config
    │   ├──openposenet.py                // Openpose architecture
    │   ├──loss.py                       // Loss function
    │   ├──dataset.py                    // Data preprocessing
    │   ├──utils.py                      // Utils
    │   ├──gen_ignore_mask.py            // Generating mask data script
    ├── export.py                        // model conversion script
    ├── train.py                         // training script
    ├── eval.py                          // evaluation script
    ├── mindspore_hub_config.py          // hub config file
    ├── default_config.yaml              // config file

Script Parameters

Parameters for both training and evaluation can be set in default_config.yaml

config for openpose

'imgpath_train': 'path to dataset'               # absolute full path to the train and evaluation datasets
'vgg_path': 'path to vgg model'                  # absolute full path to vgg19 model
'save_model_path': 'path of saving models'       # absolute full path to output models
'load_pretrain': 'False'                         # whether training based on the pre-trained model
'pretrained_model_path':''                       # load pre-trained model path
'lr': 1e-4                                       # initial learning rate
'batch_size': 10                                 # training batch size
'lr_gamma': 0.1                                  # lr scale when reach lr_steps
'lr_steps': '100000,200000,250000'               # the steps when lr * lr_gamma
'loss scale': 16384                              # the loss scale of mixed precision
'max_epoch_train': 60                            # total training epochs
'insize': 368                                    # image size used as input to the model
'keep_checkpoint_max': 1                         # only keep the last keep_checkpoint_max checkpoint
'log_interval': 100                              # the interval of print a log
'ckpt_interval': 5000                            # the interval of saving a output model

For more configuration details, please refer the script default_config.yaml.

Training Process

Training

running on Ascend

python train.py --imgpath_train ./train2017 --jsonpath_train ./person_keypoints_train2017.json --maskpath_train ./ignore_mask_train2017 --vgg_path ./vgg19-0-97_5004.ckpt > train.log 2>&1 &

The python command above will run in the background, you can view the results through the file train.log.

After training, you'll get some checkpoint files under the script folder by default. The loss value will be achieved as follows:

# grep "epoch " train.log
epoch[0], iter[23], mean loss is 0.292112287
epoch[0], iter[123], mean loss is 0.060355084
epoch[0], iter[223], mean loss is 0.026628130
...

The model checkpoint will be saved in the directory of default_config.yaml: 'save_model_path'.

running on ModelArts
If you want to train the model on modelarts, you can refer to the [official guidance document] of modelarts (https://support.huaweicloud.com/modelarts/)

#  Example of using distributed training dpn on modelarts :
#  Data set storage method
#   ├── openpose_dataset
#       ├── annotations
#           ├─person_keypoints_train2017.json
#           └─person_keypoints_val2017.json
#       ├─ignore_mask_train2017
#       ├─ignore_mask_val2017
#       ├─train2017
#       └─val2017
#       └─checkpoint
#       └─pre_trained
#
# (1) Choose either a (modify yaml file parameters) or b (modelArts create training job to modify parameters) 。
#       a. set "enable_modelarts=True"
#          set "vgg_path=/cache/data/pre_trained/vgg19-0-97_5004.ckpt"
#          set "maskpath_train=/cache/data/ignore_mask_train2017"
#          set "jsonpath_train=/cache/data/annotations/person_keypoints_train2017"
#          set "save_model_path=/cache/train/checkpoint"
#          set "imgpath_train=/cache/data/train2017"
#
#       b. add "enable_modelarts=True" Parameters are on the interface of modearts。
#          Set the parameters required by method a on the modelarts interface
#          Note: The path parameter does not need to be quoted

# (2) Set the path of the network configuration file  "_config_path=/The path of config in default_config.yaml/"
# (3) Set the code path on the modelarts interface "/path/openpose"。
# (4) Set the model's startup file on the modelarts interface "train.py" 。
# (5) Set the data path of the model on the modelarts interface ".../openpose_dataset"(choices openpose_dataset Folder path) ,
# The output path of the model "Output file path" and the log path of the model "Job log path" 。
# (6) start trainning the model。

# Example of using model inference on modelarts
# (1) Place the trained model to the corresponding position of the bucket。
# (2) chocie a or b。
#        a.set "enable_modelarts=True"
#          set "ann=/cache/data/annotations/person_keypoints_val2017"
#          set "output_img_path=/cache/data/output_imgs/"
#          set "imgpath_val=/cache/data/val2017"
#          set "model_path=/cache/data/checkpoint/0-80_663.ckpt"

#       b. Add "enable_modelarts=True" parameter on the interface of modearts。
#          Set the parameters required by method a on the modelarts interface
#          Note: The path parameter does not need to be quoted

# (3) Set the path of the network configuration file "_config_path=/The path of config in default_config.yaml/"
# (4) Set the code path on the modelarts interface "/path/openpose"。
# (5) Set the model's startup file on the modelarts interface "eval.py" 。
# (6) Set the data path of the model on the modelarts interface ".../openpose_dataset"(openpose_dataset Folder path) ,
# The output path of the model "Output file path" and the log path of the model "Job log path"  。
# (7) Start model inference。

Evaluation Process

Evaluation

running on Ascend

Before running the command below, please check the checkpoint path used for evaluation. Please set the checkpoint path to be the absolute full path, e.g., "username/openpose/outputs/*time*/0-6_30000.ckpt".

python eval.py --model_path path_to_eval_model.ckpt --imgpath_val ./dataset/val2017 --ann ./dataset/annotations/person_keypoints_val2017.json > eval.log 2>&1 &
OR
bash scripts/run_eval_ascend.sh [MODEL_PATH] [IMGPATH_VAL] [ANN]

The above python command will run in the background. You can view the results through the file "eval.log". The accuracy of the test dataset will be as follows:

# grep "AP" eval.log

{'AP': 0.40250956300341397, 'Ap .5': 0.6658941566481336, 'AP .75': 0.396047897339743, 'AP (M)': 0.3075356543635785, 'AP (L)': 0.533772768618845, 'AR': 0.4519836272040302, 'AR .5': 0.693639798488665, 'AR .75': 0.4570214105793451, 'AR (M)': 0.32155148866429945, 'AR (L)': 0.6330360460795242}

Export MindIR on Modelarts

Export MindIR example on ModelArts
Data storage method is the same as training
# (1) Choose either a (modify yaml file parameters) or b (modelArts create training job to modify parameters)。
#       a. set "enable_modelarts=True"
#          set "file_name=openpose"
#          set "file_format=MINDIR"
#          set "ckpt_file=/cache/data/checkpoint file name"

#       b. Add "enable_modelarts=True" parameter on the interface of modearts。
#          Set the parameters required by method a on the modelarts interface
#          Note: The path parameter does not need to be quoted
# (2)Set the path of the network configuration file "_config_path=/The path of config in default_config.yaml/"
# (3) Set the code path on the modelarts interface "/path/openpose"。
# (4) Set the model's startup file on the modelarts interface "export.py" 。
# (5) Set the data path of the model on the modelarts interface ".../openpose_dataset/checkpoint"(choices openpose_dataset/checkpoint Folder path) ,
# The output path of the model "Output file path" and the log path of the model "Job log path"  。

Model Description

Performance

Evaluation Performance

Parameters	Ascend
Model Version	openpose
Resource	Ascend 910; CPU 2.60GHz, 192cores; Memory 755G; OS Euler2.8
uploaded Date	12/14/2020 (month/day/year)
MindSpore Version	1.0.1
Training Parameters	epoch=60(1pcs)/80(8pcs), steps=30k(1pcs)/5k(8pcs), batch_size=10, init_lr=0.0001
Optimizer	Adam(1pcs)/Momentum(8pcs)
Loss Function	MSE
outputs	pose
Speed	1pcs: 35fps, 8pcs: 230fps
Total time	1pcs: 22.5h, 8pcs: 5.1h
Checkpoint for Fine tuning	602.33M (.ckpt file)

MindSpore/models

Contents

Openpose Description

Model Architecture

Dataset

Features

Mixed Precision

Environment Requirements

Quick Start

Script Description

Script and Sample Code

Script Parameters

Training Process

Training

Evaluation Process

Evaluation

Model Description

Performance

Evaluation Performance

简介

发行版

贡献者 (115)

语言

近期动态

MindSpore/models .gitee-modal { width: 500px !important; }

Contents

Mixed Precision

Training

Evaluation

Evaluation Performance

简介

发行版

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者 (115)

语言

近期动态

搜索帮助

MindSpore/models