DGL Implementation of DimeNet

This DGL example implements the GNN model proposed in the paper Directional Message Passing for Molecular Graphs and Fast and Uncertainty-Aware Directional Message Passing for Non-Equilibrium Molecules. For the original implementation, see here.

Contributor: xnuohz

Requirements

The codebase is implemented in Python 3.6. For version requirement of packages, see below.

click 7.1.2
dgl 0.5.3
logzero 1.6.3
numpy 1.19.5
pandas 1.1.5
ruamel.yaml 0.16.12
scikit-learn 0.24.1
scipy 1.5.4
sympy 1.7.1
torch 1.7.0
tqdm 4.56.0

The graph datasets used in this example

The DGL's built-in QM9 dataset. Dataset summary:

Molecular Graphs: 13,0831
Number of Tasks: 12

Usage

Note: DimeNet++ is recommended to use instead of DimeNet.

Model options

emb_size          int    Embedding size used throughout the model.                              Default is 128
out_emb_size      int    Output embedding size used in DimeNet++.                               Default is 256
int_emb_size      int    Input embedding size used in DimeNet++.                                Default is 64
basis_emb_size    int    Basis embedding size used in DimeNet++.                                Default is 8
num_blocks        int    Number of building blocks to be stacked.                               Default is 6   
num_bilinear      int    Third dimension of the bilinear layer tensor in DimeNet.               Default is 8   
num_spherical     int    Number of spherical harmonics.                                         Default is 7   
num_radial        int    Number of radial basis functions.                                      Default is 6   
envelope_exponent int    Shape of the smooth cutoff.                                            Default is 5   
cutoff            float  Cutoff distance for interatomic interactions.                          Default is 5.0 
extensive         bool   Readout operator for generating a graph-level representation.          Default is True 
num_before_skip   int    Number of residual layers in interaction block before skip connection. Default is 1   
num_after_skip    int    Number of residual layers in interaction block after skip connection.  Default is 2   
num_dense_output  int    Number of dense layers for the output blocks.                          Default is 3   
targets           list   List of targets to predict.                                            Default is ['mu']
output_init       string Initial function name for output layer.                                Default is 'GlorotOrthogonal'

Training options

num_train         int   Number of train samples.                        Default is 110000
num_valid         int   Number of valid samples.                        Default is 10000
data_seed         int   Random seed.                                    Default is 42
lr                float Learning rate.                                  Default is 0.001
weight_decay      float Weight decay.                                   Default is 0.0001
ema_decay         float EMA decay.                                      Default is 0.999
batch_size        int   Batch size.                                     Default is 100
epochs            int   Training epochs.                                Default is 800
early_stopping    int   Patient epochs to wait before early stopping.   Default is 20
num_workers       int   Number of subprocesses to use for data loading. Default is 18
gpu               int   GPU index.                                      Default is 0, using CUDA:0
interval          int   Time intervals for model evaluation.            Default is 50

Examples

The following commands learn a neural network and predict on the test set. Training a DimeNet++ model on QM9 dataset.

python src/main.py --model-cnf src/config/dimenet_pp.yaml

Convert a TensorFlow model to PyTorch Model

python src/convert_tf_ckpt_to_pytorch.py --model-cnf src/config/dimenet_pp.yaml --convert-cnf src/config/convert.yaml

Performance

batch size is different
linear learning rate warm-up is not used
exponential learning rate decay is not used
exponential moving average (EMA) is closed

Target	mu	alpha	homo	lumo	gap	r2	zpve	U0	U	H	G	Cv
MAE(DimeNet in Table 1)	0.0286	0.0469	27.8	19.7	34.8	0.331	1.29	8.02	7.89	8.11	8.98	0.0249
MAE(DimeNet++ in Table 2)	0.0297	0.0435	24.6	19.5	32.6	0.331	1.21	6.32	6.28	6.53	7.56	0.0230
MAE(DimeNet++, TF)	0.0297/0.033	0.0435	0.0246	0.0195/0.0227	-/0.0486	0.3312	0.00121	0.0063	0.00628	0.00653	0.00756	0.0230
MAE(DimeNet++, DGL)	0.0536	0.0726	0.0389	0.0434	0.0568	0.6437	0.00520	0.7269	0.0336	0.0205	0.0477	0.0357

Speed

Model	Original Implementation	DGL Implementation	Improvement
DimeNet	2839	1345	2.1x
DimeNet++	624	238	2.6x

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DGL Implementation of DimeNet

Requirements

The graph datasets used in this example

Usage

Model options

Training options

Examples

Performance

Speed

About

Uh oh!

Releases

Packages

Languages

sailfish009/DimeNet-dgl

Folders and files

Latest commit

History

Repository files navigation

DGL Implementation of DimeNet

Requirements

The graph datasets used in this example

Usage

Model options

Training options

Examples

Performance

Speed

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages