BERT_tensor

Run PyTorch version

Based on the script run_glue.py.

Fine-tuning the library models for sequence classification on the GLUE benchmark: General Language Understanding Evaluation. This script can fine-tune the following models: BERT, XLM, XLNet and RoBERTa.

GLUE is made up of a total of 9 different tasks. We get the following results on the dev set of the benchmark with an uncased BERT base model (the checkpoint bert-base-uncased). All experiments ran single V100 GPUs with a total train batch sizes between 16 and 64. Some of these tasks have a small dataset and training can lead to high variance in the results between different runs. We report the median on 5 runs (with different seeds) for each of the metrics.

Task	Metric	Result
CoLA	Matthew's corr	57.9
SST-2	Accuracy	92.8
MRPC	F1	89.4
STS-B	Spearman corr.	89.1
QQP	F1	91.4
MNLI	Accuracy	84.3
QNLI	Accuracy	91.6
RTE	Accuracy	71.5
WNLI	Accuracy	56.3

Some of these results are significantly different from the ones reported on the test set of GLUE benchmark on the website. For QQP and WNLI, please refer to FAQ #12 on the webite.

Before running any one of these GLUE tasks you should download the GLUE data by running the following lines at the root of the repo

python utils/download_glue_data.py --data_dir /path/to/glue --tasks all

after replacing path/to/glue with a value that you like. Then you can run

export GLUE_DIR=/path/to/glue
export TASK_NAME=MRPC

python run_glue.py \
  --model_name_or_path bert-base-cased \
  --task_name $TASK_NAME \
  --do_train \
  --do_eval \
  --data_dir $GLUE_DIR/$TASK_NAME \
  --max_seq_length 128 \
  --per_device_train_batch_size 32 \
  --learning_rate 2e-5 \
  --num_train_epochs 3.0 \
  --output_dir /tmp/$TASK_NAME/

where task name can be one of CoLA, SST-2, MRPC, STS-B, QQP, MNLI, QNLI, RTE, WNLI.

The dev set results will be present within the text file eval_results.txt in the specified output_dir. In case of MNLI, since there are two separate dev sets (matched and mismatched), there will be a separate output folder called /tmp/MNLI-MM/ in addition to /tmp/MNLI/.

The code has not been tested with half-precision training with apex on any GLUE task apart from MRPC, MNLI, CoLA, SST-2. The following section provides details on how to run half-precision training with MRPC. With that being said, there shouldn’t be any issues in running half-precision training with the remaining GLUE tasks as well, since the data processor for each task inherits from the base class DataProcessor.

Example

Following is an example of how we run the experiment of pruning on MRPC

Step 1: Fine-tuning

cd examples/text-classification
bash bert-test_glue_finetune_MRPC.sh

Step 2: Re-weighted training

bash bert-test_prune_MRPC_tensor_tile.sh

Step 3: Prune and Re-weighted retraining

bash bert-test_retrain_MRPC_tensor_tile.sh

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
BERT_tensor		BERT_tensor
transformer_code		transformer_code
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT_tensor

Run PyTorch version

Example

About

Releases

Packages

Languages

cctry/SCpaper-2021-pruning

Folders and files

Latest commit

History

Repository files navigation

BERT_tensor

Run PyTorch version

Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages