Ascend/ModelZoo-TensorFlow

基本信息
概述
训练环境准备
快速上手
迁移学习指导
高级参考

基本信息

应用领域（Application Domain）：Natural Language Processing

版本（Version）：1.1

修改时间（Modified）：2021.7.17

大小（Size）：552K

框架（Framework）：TensorFlow 1.15.0

模型格式（Model Format）：ckpt

精度（Precision）：Mixed

应用级别（Categories）：Official

描述（Description）：基于TensorFlow框架的BERT网络在CLUE数据集上的finetune代码

概述

BERT是一种与训练语言表示的方法，这意味着我们在大型文本语料库（如维基百科）上训练一个通用的”语言理解“模型，然后将该模型用于我们关心的下游NLP任务（如问答）。该工程提供了在CLUE数据集上finetune的方法。

参考论文：

https://arxiv.org/abs/1810.04805.
参考实现：

https://github.com/CLUEbenchmark/CLUE

https://github.com/chineseGLUE/chineseGLUE
适配昇腾 AI 处理器的实现：

https://gitee.com/ascend/ModelZoo-TensorFlow/tree/master/TensorFlow/built-in/nlp/Bert-CLUE_for_TensorFlow

通过Git获取对应commit_id的代码方法如下：

git clone {repository_url}    # 克隆仓库的代码
cd {repository_name}    # 切换到模型的代码仓目录
git checkout  {branch}    # 切换到对应分支
git reset --hard ｛commit_id｝     # 代码设置到对应的commit_id
cd ｛code_path｝    # 切换到模型代码所在路径，若仓库下只有该模型，则无需切换

默认配置

训练超参
- train_batch_size: 32
- learning_rate: 2e-5
- num_train_epochs: 3.0
- max_seq_length: 128

支持特性

特性列表	是否支持
分布式训练	是
混合精度	是
并行数据	否

混合精度训练

昇腾910 AI处理器提供自动混合精度功能，可以针对全网中float32数据类型的算子，按照内置的优化策略，自动将部分float32的算子降低精度到float16，从而在精度损失很小的情况下提升系统性能并减少内存使用。

开启混合精度

脚本已默认开启混合精度，设置precision_mode参数的脚本参考如下。

run_config = NPURunConfig(
      model_dir=FLAGS.output_dir,
      save_checkpoints_steps=FLAGS.save_checkpoints_steps,
      iterations_per_loop=FLAGS.iterations_per_loop,
      session_config=config,
      precision_mode="allow_mix_precision",
      keep_checkpoint_max=5)

训练环境准备

硬件环境准备请参见各硬件产品文档"驱动和固件安装升级指南"。需要在硬件设备上安装与CANN版本配套的固件与驱动。
宿主机上需要安装Docker并登录Ascend Hub中心获取镜像。

当前模型支持的镜像列表如表1所示。

表 1 镜像列表
镜像名称

镜像版本

配套CANN版本
ARM架构：ascend-tensorflow-arm
x86架构：ascend-tensorflow-x86
20.2.0

20.2

镜像名称	镜像版本	配套CANN版本
ARM架构：ascend-tensorflow-arm x86架构：ascend-tensorflow-x86	20.2.0	20.2

快速上手

数据集准备

模型训练使用TNEWS和MSRANER数据集，参考源代码提供路径下载。
预处理模型使用BERT-Base，Uncased，参考源代码提供路径下载。下载的文件夹中应该含有预处理模型，vocab.txt和bert_config.json。
数据集和预处理模型下载完成后，放入模型目录下，在训练脚本中指定数据集和模型路径，可正常使用。

模型训练

单击“立即下载”，并选择合适的下载方式下载源码包。
启动训练之前，首先要配置程序运行相关环境变量。

环境变量配置信息参见：

Ascend 910训练平台环境变量设置

将环境变量配置到scrpts/run_*.sh中
TNEWS训练

启动单卡训练

修改test/train_full_tnews_1p.sh中的data_path，里面包括提前下载的数据集和预训练模型。
```
cd test
bash train_full_tnews_1p.sh
```
MSRANER训练

启动单卡训练

修改test/train_full_msraner_8p.sh中的data_path，里面包括提前下载的数据集和预训练模型。
```
cd test
bash run_8p.sh
```

高级参考

脚本和示例代码

└─BertMRPC_for_TensorFlow
    ├─test
    |     ├─train_full_tnews_1p.sh
    |     ├─train_full_msraner_8p.sh
    ├─CONTRIBUTING.md
    ├─create_pretraining_data.py
    ├─evaluate-v1.1.py
    ├─extract_features.py
    ├─gpu_environment.py
    ├─LICENSE
    ├─modeling.py
    ├─modeling_test.py
    ├─multilingual.md
    ├─optimization.py
    ├─optimization_test.py
    ├─README.md
    ├─run.sh
    ├─run_classifier.py
    ├─run_classifier_with_tfhub.py
    ├─run_pretraining.py
    ├─run_squad.py
    ├─tokenization.py
    └─tokenization_test.py

脚本参数

TNEWS:
python3 run_classifier.py \
  --task_name=tnews \
  --do_train=true \
  --do_eval=true \
  --data_dir=$data_path/tnews \
  --vocab_file=$data_path/chinese_L-12_H-768_A-12/vocab.txt \
  --bert_config_file=$data_path/chinese_L-12_H-768_A-12/bert_config.json \
  --init_checkpoint=$data_path/chinese_L-12_H-768_A-12/bert_model.ckpt \
  --max_seq_length=128 \
  --train_batch_size=${batch_size} \
  --learning_rate=2e-5 \
  --num_train_epochs=3.0 \
  --output_dir=tnews_output \
  
MSRANER:
python3 run_ner.py \
  --task_name=msraner \
  --do_train=true \
  --do_predict=true \
  --data_path=$data_path/msraner \
  --vocab_file=$data_path/chinese_L-12_H-768_A-12/vocab.txt \
  --bert_config_file=$data_path/chinese_L-12_H-768_A-12/bert_config.json \
  --init_checkpoint=$data_path/chinese_L-12_H-768_A-12/bert_model.ckpt \
  --max_seq_length=128 \
  --train_batch_size=${batch_size} \
  --learning_rate=2e-5 \
  --num_train_epochs=3.0 \
  --output_dir=msraner_output \

训练过程

通过“模型训练”中的训练指令启动性能或者精度训练。单卡和多卡通过运行不同脚本，支持单卡网络训练。
参考脚本的模型存储路径为test/output/*。