Contents

BiMLP Description

This paper studies the problem of designing compact binary architectures for vision multi-layer perceptrons (MLPs). We provide extensive analysis on the difficulty of binarizing vision MLPs and find that previous binarization methods perform poorly due to limited capacity of binary MLPs. In contrast with the traditional CNNs that utilizing convolutional operations with large kernel size, fully-connected (FC) layers in MLPs can be treated as convolutional layers with kernel size 1×1. Thus, the representation ability of the FC layers will be limited when being binarized, and places restrictions on the capability of spatial mixing and channel mixing on the intermediate features. To this end, we propose to improve the performance of binary MLP (BiMLP) model by enriching the representation ability of binary FC layers. We design a novel binary block that contains multiple branches to merge a series of outputs from the same stage, and also a universal shortcut connection that encourages the information flow from the previous stage. The downsampling layers are also carefully designed to reduce the computational complexity while maintaining the classification performance. Experimental results on benchmark dataset ImageNet-1k demonstrate the effectiveness of the proposed BiMLP models, which achieve state-of-the-art accuracy compared to prior binary CNNs.

Paper: Yixing Xu, Xinghao Chen, Yunhe Wang. BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons. Neurips 2022.

RNA architecture

An illustration of Random Normalization Aggregation and Black-box Adversarial Training:

RNA

Dataset

Dataset used: [ImageNet2012]

Dataset size 224*224 colorful images in 1000 classes
- Train：1,281,167 images
- Test： 50,000 images
Data format：jpeg
- Note：Data will be processed in dataset.py

Environment Requirements

Hardware(Ascend/GPU)
- Prepare hardware environment with Ascend or GPU.
Framework
- MindSpore
For more information, please check the resources below£º
- MindSpore Tutorials
- MindSpore Python API

Script description

Script and sample code

├── BiMLP
  ├── Readme.md     # descriptions about BiMLP   # shell script for evaluation with GPU
  ├── src
  │   ├──quan_conv.py      # parameter configuration
  │   ├──dataset.py     # creating dataset
  │   ├──wavemlp_20_3.py      # Pruned ResNet architecture
  ├── eval.py       # evaluation script

Eval process

Usage

After installing MindSpore via the official website, you can start evaluation as follows:

Launch

# infer example
  # python
  GPU: python eval.py --dataset_path dataset --platform GPU --checkpoint_path [CHECKPOINT_PATH] --checkpoint_nm BiMLP_M

checkpoint can be produced in training process.

Result

result: {'acc': 0.7155689820742638} ckpt= ./BiMLP_M.ckpt

Model Description

Performance

Evaluation Performance

Parameters	Ascend
Model Version	BiMLP_M
Resource	GPU
Uploaded Date	26/11/2022 (month/day/year)
MindSpore Version	1.8.1
Dataset	ImageNet2012
batch_size	64
outputs	probability
Accuracy	1pc: 71.56%

Description of Random Situation

In dataset.py, we set the seed inside “create_dataset" function. We also use random seed in train.py.

ModelZoo Homepage

Please check the official homepage.

MindSpore/models

Contents

BiMLP Description

RNA architecture

Dataset

Environment Requirements

Script description

Script and sample code

Eval process

Usage

Launch

Result

Model Description

Performance

Evaluation Performance

Description of Random Situation

ModelZoo Homepage

简介

发行版

贡献者

语言

近期动态

MindSpore/models .gitee-modal { width: 500px !important; }

Contents

Usage

Launch

Result

Evaluation Performance

简介

发行版

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者

语言

近期动态

搜索帮助

MindSpore/models