Pocket2Drug

Pocket2Drug is an encoder-decoder deep neural network that predicts binding drugs given protein binding sites (pockets). The pocket graphs are generated using Graphsite. The encoder is a graph neural network, and the decoder is a recurrent neural network. The SELFIES molecule representation is used as the tokenization scheme instead of SMILES. The pipeline of Pocket2Drug is illustrated below:

Usage

Dependency

Pytorch
Pytorch-geometric
Rdkit
SELFIES
Pandas
BioPandas
Numpy
Scipy

Dataset

All the related data can be downloaded here. There are two dataset files:

dataset.tar.gz: contains all binding site data in this project.
pops.tar.gz: contains information of node feature contact surface area.

Train

The configurations for training can be updated in train.yaml. Modify the pocket_dir and pop_dir entries to the paths of the extracted dataset. Modify the out_dir entry to the folder where you want to save the output results. Then,

python train.py

Inference

After training, the trained model will be saved at out_dir, and we can use it to sample predicted molecules:

python sample.py -batch_size 1024 -num_batches 1 -pocket_dir path_to_dataset_folder -popsa_dir path_to_pops_folder

Name		Name	Last commit message	Last commit date
Latest commit History 191 Commits
data		data
doc		doc
rdkit_contrib		rdkit_contrib
vocab		vocab
.gitignore		.gitignore
README.md		README.md
dataloader.py		dataloader.py
model.py		model.py
sample.py		sample.py
train.py		train.py
train.yaml		train.yaml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pocket2Drug

Usage

Dependency

Dataset

Train

Inference

About

Uh oh!

Releases

Packages

Languages

sailfish009/Pocket2Drug

Folders and files

Latest commit

History

Repository files navigation

Pocket2Drug

Usage

Dependency

Dataset

Train

Inference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages