Exploring Complex and Heterogeneous Correlations on Hypergraph for the Prediction of Drug-Target Interactions
HHDTI is a deep learning method that uses hypergraphs to model heterogeneous biological networks, integrates various interaction information, and accurately captures the topological properties of individual in the biological network to generate suitable structured embeddings for interactions prediction.
We provide an example script to run experiments on deepDTnet_20 dataset:
- Run
python train.py
: predict drug-target interactions, and evaluate the results with 5-fold cross-validation. - You can change the dataset used, adjust the learning rate, hidden layer dimensions, etc
python train.py --dataset DTInet --lr 0.002 --hidden 64
train.py
:run HHDTI to predict drug-target interactionsmodels.py
:the modules of VHAE and HGNNlayers.py
:the layers used in VHAE and HGNNkl_loss.py
:KL divergence loss function for traininghypergraph_utils.py
:generate degree matrices of hypergraphutils_deepDTnet.py
:load deepDTnet dataset for training and testingutils_DTInet.py
:load DTInet dataset for training and testingutils_KEGG_MED.py
:load KEGG_MED dataset for training and testing
DTInet
:The DTIs in the DTINet_17 dataset used for training and testing have been divided into the form of 10-fold cross-validation, as well as disease-drug associations, disease-target associations. Dataset source: https://github.com/luoyunan/DTINetdeepDTnet
:The DTIs in the deepDTnet_20 dataset used for training and testing have been divided into the form of 10-fold cross-validation, as well as disease-drug associations, disease-target associations. Dataset source: https://github.com/ChengF-Lab/deepDTnetKEGG_MED
:The DTIs in the KEGG_MED dataset used for training and testing have been divided into the form of 10-fold cross-validation, as well as disease-drug associations, disease-target associations(the disease-drug associations matrix file is larger than 25M, and cannot be uploaded temporarily). Dataset source: http://drugtargets.insight-centre.org/
- You can also use your own dataset for experiments. Due to the different representations of datasets, it is not recommended that you use our code to load data.The input of HHDTI is 4 incidence matrices, which respectively represent drug-target interactions, target-drug interactions, disease-drug associations, disease-target associations, among which the drug-target interactions incidence matrix and the target-drug interaction incidence matrix are transposed to each other.
- python(v3.7.0)
- numpy(v1.18.1)
- pytorch(v1.1.0)
- scikit-learn(v0.22.1)
Our code is released under MIT License (see LICENSE file for details).