# ResidualMaskingNetwork **Repository Path**: cumthxy/ResidualMaskingNetwork ## Basic Information - **Project Name**: ResidualMaskingNetwork - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2020-04-11 - **Last Updated**: 2021-12-24 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/challenges-in-representation-learning-a/facial-expression-recognition-on-fer2013)](https://paperswithcode.com/sota/facial-expression-recognition-on-fer2013?p=challenges-in-representation-learning-a) # Facial Expression Recognition using Residual Masking Network, in PyTorch A [PyTorch](http://pytorch.org/) implementation of my thesis with the same name.

# Live Demo: - Model file: [download](https://drive.google.com/open?id=1_6CzlKRS9ksxlo0TjqIGXMzQE4I83tE0) (this checkpoint is trained on VEMO dataset, locate it at ```./saved/checkpoints/``` directory) - Download 2 files: [prototxt](https://drive.google.com/open?id=1ANVPx3JM4EcJVZOstV_kEO1Jcv74mBu5), and [res10_300x300_ssd](https://drive.google.com/open?id=1Iy_3I_mWGhBA63W0IK8tRrUuvr-WrGQ2) for face detection OpenCV. Locate at current directory or checking file path with ```ssd_infer.py``` file. ```Shell python ssd_infer.py ```

### Table of Contents - Recent Update - Benchmarking on FER2013 - Benchmarking on ImageNet - Installation - Download datasets - Training on FER2013 - Training on ImageNet - Evaluation results - Download dissertation and slide

## Recent Update - [27/02/2020] Update Tensorboard visualizations and Overleaf source - [22/02/2020] Test-time augmentation implementation. - [21/02/2020] Imagenet training code and trained weights released. - [21/02/2020] Imagenet evaluation results released. - [10/01/2020] Checking demo stuff and training procedure works on another machine - [09/01/2020] First time upload

## Benchmarking on FER2013 We benchmark our code thoroughly on two datasets: FER2013 and VEMO. Below are the results and trained weights: Model | Accuracy | ---------|--------| [ResAttNet56](https://drive.google.com/open?id=1sG3ERWLPdBkjYSaZb_ynypHD15KUSsUR) | 68.54 [VGG19](https://drive.google.com/open?id=1FPkwhmel0AiGK3UtYiWCHPi5CYkF7BRc) | 70.80 [EfficientNet\_b2b](https://drive.google.com/open?id=1pEyupTGQPoX1gj0NoJQUHnK5-mxB8NcS) | 70.80 [Googlenet](https://drive.google.com/open?id=1LvxAxDmnTuXgYoqBj41qTdCRCSzaWIJr) | 71.97 [Resnext50\_32x4d](https://drive.google.com/open?id=12AR1LUlcQlg62WU_nNxBnlpqXuEV4c-c) | 72.22 [Resnet34](https://drive.google.com/open?id=1iuTkqApioWe_IBPQ7gQHticrVlPA-xz_) | 72.42 [Inception\_v3](https://drive.google.com/open?id=17mapZKWYMdxGTrbrAbRpfgniT5onmQXO) | 72.72 [Resnet50](https://drive.google.com/open?id=1PoJPhoQP12VZ-1n8JWgUqy-w2zHSHcSp) | 72.86 [Bam\_Resnet50](https://drive.google.com/open?id=1K_gyarekwIxQMA_fEPJMApgqo3mYaM0H) | 73.14 [Densenet121](https://drive.google.com/open?id=1f8wUtQj-UatrZtCnkJFcB--X2eJS1m_N) | 73.16 [Resnet152](https://drive.google.com/open?id=1LBaHaVtu8uKiNsoTN7wl5Pg5ywh-QxRW) | 73.22 [Cbam\_Resnet50](https://drive.google.com/open?id=1i9zk8sGXiixkQGTA1txBxSuew6z_c86T) | 73.39 [Resnet101](https://drive.google.com/open?id=1GadrX04NJIqtGHA85vz-ts93JuKj54ih) | 73.47 [ResMaskingNet](https://drive.google.com/open?id=1_ASpv0QNxknMFI75gwuVWi8FeeuMoGYy) | 74.14 [ResMaskingNet + 6](https://drive.google.com/open?id=1y28VHzJcgBpW0Qn_K0XVVd-hxG4feIHG) | 76.82 Results in VEMO dataset could be found in my thesis or slide (attached below)

## Benchmarking on ImageNet We also benchmark our model on ImageNet dataset. Model | Top-1 Accuracy | Top-5 Accuracy | ---------|--------|--------| [Resnet34](https://drive.google.com/open?id=16lErBAk7K3WswKP0wyE9S0dNrr7AF6wd) | 72.59 | 90.92 [CBAM Resnet34](https://drive.google.com/open?id=16lErBAk7K3WswKP0wyE9S0dNrr7AF6wd) | 73.77 | 91.72 [ResidualMaskingNetwork](https://drive.google.com/open?id=1myjp4_XL8mNJlAbz0TFjYKUc7B0N64eb) | 74.16 | 91.91

## Installation - Install [PyTorch](http://pytorch.org/) by selecting your environment on the website and running the appropriate command. - Clone this repository and install package [prerequisites](#prerequisites) below. - Then download the dataset by following the [instructions](#datasets) below. ### prerequisites * Python 3.6+ * PyTorch 1.3+ * Torchvision 0.4.0+ * [requirements.txt](requirements.txt)

## Datasets - [FER2013 Dataset](https://drive.google.com/open?id=18ovcnZBsPvwXXFVAqczACe9zciO_1q6J) (locate it in ```saved/data/fer2013``` like ```saved/data/fer2013/train.csv```) - [ImageNet 1K Dataset](http://image-net.org/download-images) (ensure it can be loaded by torchvision.datasets.Imagenet)

## Training on FER2013 - To train network, you need to specify model name and other hyperparameters in config file (located at configs/\*) then ensure it is loaded in main file, then run training procedure by simply run main file, for example: ```Shell python main_fer.py # Example for fer2013_config.json file ``` - The best checkpoints will chosen at term of best validation accuracy, located at ```saved/checkpoints``` - The TensorBoard training logs are located at ```saved/logs```, to open it, use ```tensorboard --logdir saved/logs/```

- By default, it will train `alexnet` model, you can switch to another model by edit `configs/fer2013\_config.json` file (to `resnet18` or `cbam\_resnet50` or my network `resmasking\_dropout1`.

## Training on Imagenet dataset To perform training resnet34 on 4 V100 GPUs on a single machine: ```Shell python ./main_imagenet.py -a resnet34 --dist-url 'tcp://127.0.0.1:12345' --dist-backend 'nccl' --multiprocessing-distributed --world-size 1 --rank 0 ```

## Evaluation For student, who takes care of font family of confusion matrix and would like to write things in LaTeX, below is an example for generating a striking confusion matrix. (Read [this article](https://matplotlib.org/3.1.1/tutorials/text/usetex.html) for more information, there will be some bugs if you blindly run the code without reading). ```Shell python cm_cbam.py ```

## Ensemble method I used no-weighted sum avarage ensemble method to fusing 7 different models together, to reproduce results, you need to do some steps: 1. Download all needed trained weights and located on ```./saved/checkpoints/``` directory. Link to download can be found on Benchmarking section. 2. Edit file ```gen_results``` and run it to generate result offline for **each** model. 3. Run ```gen_ensemble.py``` file to generate accuracy for example methods.

## Dissertation and Slide - [Dissertation PDF (in Vietnamese)](https://drive.google.com/open?id=1HxqvQSZRf-3ashGtZ5o9OABdhmdjS64a) - [Dissertation Overleaf Source](https://www.overleaf.com/read/qdyhnzjmbscd) - [Presentation slide PDF (in English) with full appendix](https://drive.google.com/open?id=19zweCDX8Vz4jgwJ6cBWr5x_iQPvahsQg) - [Presentation slide Overleaf Source](https://www.overleaf.com/read/vxdhjvhvgwdn) ## TODO We have accumulated the following to-do list, which we hope to complete in the near future - Still to come: * [x] Upload all models and training code. * [x] Test time augmentation. * [x] GPU-Parallel. * [x] Pretrained model. * [x] Demo and inference code. * [x] Imagenet trained and pretrained weights. * [ ] GradCAM visualization and Pooling method for visualize activations. * [ ] Centerloss Visualizations.

## Authors * [**Luan Pham**](https://github.com/phamquiluan) * [**Tuan Anh Tran**](https://github.com/phamquiluan) ***Note:*** Unfortunately, I am currently join a full-time job and research on another topic, so I'll do my best to keep things up to date, but no guarantees. That being said, thanks to everyone for your continued help and feedback as it is really appreciated. I will try to address everything as soon as possible.

## References - Same as in dissertation. ## Citation ``` @misc{luanresmaskingnet2020, Author = {Luan Pham & Tuan Anh Tran}, Title = {Facial Expression Recognition using Residual Masking Network}, url = {https://github.com/phamquiluan/ResidualMaskingNetwork}, Year = {2020} } ```