Abstract
Background: Microbe-disease associations are integral to understanding complex diseases
and their screening procedures.
Objective: While numerous computational methods have been developed to detect these associations,
their performance remains limited due to inadequate utilization of weighted inherent similarities
and microbial taxonomy hierarchy. To address this limitation, we have introduced
WTHMDA (weighted taxonomic heterogeneous network-based microbe-disease association), a
novel deep learning framework.
Methods: WTHMDA combines a weighted graph convolution network and the microbial taxonomy
common tree to predict microbe-disease associations effectively. The framework extracts multiple
microbe similarities from the taxonomy common tree, facilitating the construction of a microbe-
disease heterogeneous interaction network. Utilizing a weighted DeepWalk algorithm, node
embeddings in the network incorporate weight information from the similarities. Subsequently, a
deep neural network (DNN) model accurately predicts microbe-disease associations based on this
interaction network.
Results: Extensive experiments on multiple datasets and case studies demonstrate WTHMDA's
superiority over existing approaches, particularly in predicting unknown associations.
Conclusion: Our proposed method offers a new strategy for discovering microbe-disease linkages,
showcasing remarkable performance and enhancing the feasibility of identifying disease risk.
Keywords:
Microbe-disease associations, deep learning, weighted deep walk, weighted graph convolution networks, heterogeneous network, taxonomy common tree.
Graphical Abstract
[11]
Shen X, Chen Y, Jiang X, Hu X, He T, Yang J. Predicting disease-microbe association by random walking on the heterogeneous network. In: BioinformBiomed. 2016.
[14]
Chen X, Huang Y-A, You Z-H, Yan G, Wang X. A novel approach based on KATZ measure to predict associations of human microbiota with non-infectious diseases. Bioinformatics 2016.
[20]
Jiang C, Tang M, Jin S, Huang W, Liu X. KGNMDA: A knowledge graph neural network method for predicting microbe-disease associations. IEEE/ACM Trans Comput Biol Bioinform 2022.
[22]
Mikolov T, Chen K, Corrado GS, Dean J. Eds. Efficient estimation of word representations in vector space. Int Conf Learn Represent.
[23]
Kipf T, Welling M. Semi-supervised classification with graph convolutional networks. ArXivabs/160902907 2016.
[24]
Wu Z, Palmer M. Semantics and lexical selection. ArXivabs/cmplg/9406033 1994.
[25]
Leacock C, Chodorow M. Eds, Combining local context and wordnet similarity for word sense identification An electronic lexical database. MIT PressEditors 1998; pp. 265-83.
[26]
Lin D. Ed An information-theoretic definition of similarity. Int Conf Mach Learn.
[27]
Jiang JJ, Conrath DW. Eds, Semantic similarity based on corpus statistics and lexical taxonomy. ROCLING/IJCLCLP 1997.
[31]
Mikolov T, Chen K, Corrado GS, Dean J. Eds. Efficient estimation of word representations in vector space. arXiv:13013781 2013.
[33]
Hamilton WL, Ying R, Leskovec J. Inductive representation learning on large graphs. Proceedings of the 31st International Conference on Neural Information Processing Systems 1025-35.
[38]
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G. Eds. PyTorch: An imperative style.high-performance deep learning library. arXiv:191201703 2019.
[39]
Wang M, Zheng D, Ye Z, Gan Q, Li M, Song X. Eds. Deep graph library: A graph-centric.highly-performant package for graph neural networks. arXiv:190901315 2019.