Background: G-protein coupled receptors (GPCRs) represent a large family of membrane proteins, distinguished by their seven-transmembrane helical structures. These receptors play a pivotal role in numerous physiological processes. Nowadays, many researchers have proposed computational methods to identify GPCRs. In the past, we introduced a powerful method, EMCBOW-GPCR, which was designed for this purpose. However, the feature extraction technique employed is susceptible to out-of-vocabulary errors, indicating the potential for enhanced accuracy in GPCR identification.
Methods: To solve the challenges, we propose a novel approach termed GPCR-AFPN. This method leverages the FastText algorithm to effectively extract features from protein sequences. Additionally, it employs a powerful deep neural network as the predictive model to improve prediction accuracy.
Results: To validate the efficacy of the proposed GPCR-AFPN method, we conducted five-fold cross-validation and independent tests, respectively. The experimental results indicate that GPCRAFPN outperforms existing methods.
Conclusion: Overall, our proposed method, GPCR-AFPN, can improve the accuracy of GPCR identification. For the convenience of researchers interested in applying our latest advancements, a userfriendly webserver for GPCR-AFPN is available at www.lzzzlab.top/gpcrafpn/, and the corresponding code can be downloaded at https://github.com/454170054/GPCR-AFPN.
Keywords: GPCRs, word embedding, deep learning, feature extraction, webserver