Current Bioinformatics

Author(s): Xun Wang, Yuan Gao, Haonan Song, Zhiyi Pan and Xianjin Xie*

DOI: 10.2174/0115748936330905241220203450

DownloadDownload PDF Flyer Cite As
InConTPSS: Multi-scale Module Based Temporal Convolutional Networks for Accurate Protein Secondary Prediction
  • * (Excluding Mailing and Handling)

Abstract

Background: Protein secondary structure prediction is an important task in bioinformatics and structural biology. Protein’s structure is the basis for its corresponding function. Experimental methods for determining the tertiary structure of proteins are both costly and time-consuming. Since the tertiary structure of proteins is further formed by secondary structure, leveraging computational approaches for efficient prediction of protein secondary structure is important. Both local and global interactions between amino acids affect the prediction results.

Objective: We propose a module aimed at processing sequence profile features for deep feature extraction and constructing a lightweight network to extract fused features.

Methods: To enhance the network’s ability to capture both local and global interactions, we propose an efficient method InConTPSS, which integrates convolution operation with different receptive fields and temporal convolutional networks in the inception architecture. Concurrently, InConTPSS takes into account the issue of distribution imbalance across various states of secondary structures and improves the predictive performance of scarce categories.

Results: Experimental results on six benchmark datasets (including CASP12, CASP13, CASP14, CB513, TEST2016, and TEST2018) demonstrate our method achieves state-of-the-art performance with a simpler model on both 3-state and 8-state secondary structure prediction.

Conclusion: Through the combination of the convolutional layer and temporal convolutional network, the inception network structure can effectively process the fused features and improve the prediction results. InConTPSS achieves the most advanced performance in protein secondary structure prediction, and the reasonable use of label-distribution-aware margin loss in our method can effectively improve the prediction accuracy of scarce secondary structures.

Keywords: Protein secondary structure prediction, deep learning, convolutional neural network, temporal convolutional network, protein language model, profile feature, embedding feature.