Abstract
Protein-protein interactions (PPIs) are the physical connections between two or more proteins
via electrostatic forces or hydrophobic effects. Identification of the PPIs is pivotal, which contributes
to many biological processes including protein function, disease incidence, and therapy design.
The experimental identification of PPIs via high-throughput technology is time-consuming and
expensive. Bioinformatics approaches are expected to solve such restrictions. In this review, our main
goal is to provide an inclusive view of the existing sequence-based computational prediction of PPIs.
Initially, we briefly introduce the currently available PPI databases and then review the state-of-the-art
bioinformatics approaches, working principles, and their performances. Finally, we discuss the caveats
and future perspective of the next generation algorithms for the prediction of PPIs.
Keywords:
Protein-protein interactions, PPIs database, sequence features, feature selection, machine learning, bioinformatics.
Graphical Abstract
[7]
Ammari, MG; Gresham, CR; McCarthy, FM; Nanduri, B HPIDB 2.0: a curated database for host-pathogen interactions. Database: J. Biol. Databases Curation, 2016.baw103.
[46]
Khatun, M.S.; Hasan, M.M.; Mollah, M.N.H.; Kurata, H. SIPMA: A Systematic Identification of Protein-Protein Interactions in Zea mays Using Autocorrelation Features in a Machine-Learning Framework 2018. IEEE 18th International Conference on Bioin-formatics and Bioengineering (BIBE), Taichung, Taiwan, , 2018; pp. 122-125.
[79]
Hasan, M.M.; Khatun, M.S. Recent progress and challenges for protein pupylation sites prediction, EC Proteomics and Bioinformatics, 2017, 2(1), 36-45.
[88]
Kim, W.K.; Park, J.; Suh, J.K. Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair; ; Genome Informatics, 2002, pp. 22-50.
[92]
Hasan, M.M.; Kurata, H. iLMS, Computational Identification of Lysine-Malonylation Sites by Combining Multiple Sequence Fea-tures. 2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE), Taichung, Taiwan, , 2018; pp. 356-359.
[93]
Liaw, A. Wiener: Classification and regression by random forest, R News, 2002, 2, 18-22.
[115]
Hasan, MM; Manavalan, B.; Khatun, MS; Kurata, H. Prediction of S-nitrosylation sites by integrating support vector machines and random forest, Molecular Omics, 2019, 15(6), 451-458.
[133]
Hasan, M.M.; Khatun, M.S. Prediction of protein Post-Translational Modifi cation sites: an overview. bann. proteom. bioinform.,, 2018, 2, 049-057.