Efficient Deep CWS

This is the code for the paper "DEEP-CWS: Distilling Efficient Pre-trained models with Early Exit and Pruning for Scalable Chinese Word Segmentation".

Chinese Word Segmentation is a fundamental task in Chinese NLP. This project aims to accelerate the inference speed of Chinese Word Segmentation models while maintaining high accuracy. We achieve this by combining knowledge distillation, pruning, and early exit techniques. The final model is deployed using ONNX for further optimization.

We public all training scripts， 📘 For detailed training settings, see hyperparameters.md

Requirements

Python 3.9+
torch 2.1.0
transformers 4.38.1

📂 Directory Structure

.
├── run.sh                      # All training and evaluation commands
├── train_teacher.py           # RoBERTa teacher model training
├── train_cnn_wo_distillation.py  # CNN model baseline
├── train_distillation_student.py # Phase I / II distillation & refine
├── prune_cnn_model.py         # Pruning interface
├── convert_2_onnx.py          # ONNX export
├── hyperparameters.md         # All configurab
72A8
le parameters
├── paper.pdf                  # Main paper
├── supplement.pdf             # Extended results and discussion
└── README.md

Usage

⚙️ Pipeline Overview

DEEP-CWS consists of the following modular stages:

Pretrained RoBERTa (Teacher)
        │
        ▼
Distillation Phase I ──► CNN Backbone
        │
        ▼
Distillation Phase II (Gradual Unfreezing)
        │
        ├──► Pruning (L1-based)
        ▼
    Refined CNN
        │
        ▼
Export to ONNX → Efficient Inference

Train

Each stage is optional and configurable. Users can stop after Phase I, or skip pruning if full accuracy is needed.

Training scripts are in the scripts folder. You can run them directly.

train_teacher.py: Train the teacher model.
train_student.py: Train the student model with knowledge distillation.
train_cnn_wo_distillation.py: Train the student model without knowledge distillation.
train_distillation_student.py: Train the student model with knowledge distillation.

Prune

prunne_cnn_model.py: Prune the student model.
prune_analysis.py: Analyze the pruning results, try different pruning rates.

ONNX

convert_2_onnx.py: Convert the model to ONNX format.

📊 Results

The DEEP-CWS framework achieves high segmentation accuracy while significantly improving inference efficiency. Key performance highlights: The following table reports the performance of the final optimized DEEP-CWS model (after distillation, pruning, and ONNX acceleration) across four widely-used Chinese Word Segmentation datasets.

Dataset	F1 Score (%)	Inference Time (ms/sentence)	Model Size
PKU	97.64	1.11	~1.1M
MSR	97.72	0.75	~1.1M
AS	97.59	0.11	~1.1M
CITYU	96.94	0.79	~1.1M

📘 Citation

If you use this project in your work, please cite:

@article{deepcws2025,
  title={DEEP-CWS: Distilling Efficient Pre-trained models with Early Exit and Pruning for Scalable Chinese Word Segmentation},
  author={Xu, Shiting},
  journal={TBD},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
convert_2_onnx.py		convert_2_onnx.py
cws_models.py		cws_models.py
dataset_utils.py		dataset_utils.py
eval_utils.py		eval_utils.py
file_utils.py		file_utils.py
hyperparameters.md		hyperparameters.md
inference_speed_test.py		inference_speed_test.py
prun_analysis.py		prun_analysis.py
prune_cnn_model.py		prune_cnn_model.py
run.sh		run.sh
segmentor.py		segmentor.py
train_cnn_wo_distillation.py		train_cnn_wo_distillation.py
train_distillation_student.py		train_distillation_student.py
train_teacher.py		train_teacher.py
train_utils.py		train_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Efficient Deep CWS

Requirements

📂 Directory Structure

Usage

⚙️ Pipeline Overview

Train

Prune

ONNX

📊 Results

📘 Citation

About

Uh oh!

Releases

Packages

Languages

xvshiting/EfficientDeepCWS

Folders and files

Latest commit

History

Repository files navigation

Efficient Deep CWS

Requirements

📂 Directory Structure

Usage

⚙️ Pipeline Overview

Train

Prune

ONNX

📊 Results

📘 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages