Abstract
Aims and Objective: Cancer is one of the deadliest diseases, taking the lives of millions
every year. Traditional methods of treating cancer are expensive and toxic to normal cells.
Fortunately, anti-cancer peptides (ACPs) can eliminate this side effect. However, the identification
and development of new anti-cancer peptides through experiments take a lot of time and money,
therefore, it is necessary to develop a fast and accurate calculation model to identify the anti-cancer
peptide. Machine learning algorithms are a good choice.
Materials and Methods: In our study, a multi-classifier system was used, combined with multiple
machine learning models, to predict anti-cancer peptides. These individual learners are composed of
different feature information and algorithms, and form a multi-classifier system by voting.
Results and Conclusion: The experiments show that the overall prediction rate of each individual
learner is above 80% and the overall accuracy of multi-classifier system for anti-cancer peptides
prediction can reach 95.93%, which is better than the existing prediction model.
Keywords:
Anti-cancer peptides, machine learning, individual learner, feature extraction, multi-classifier system, prediction model.
[12]
Zhong, B. Visual tracking via weakly supervised learning from multiple imperfect oracles. Pattern Recognit., 2010, 47(3), 1323-1330.
[15]
Suykens, J.A.K.; Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett., 1999, 9(3), 293-300.
[17]
Dong-Sheng, C.; Qing-Song, X.; Yi-Zeng, L. propy: a tool to generate various modes of Chou’s PseAAC. Bioinformatics, 2013, 29(7), 960-962.
[25]
Zhao, S. Predicting personalized image emotion perceptions in social networks. IEEE Transactions on Affective Computing, 2016, PP(99), 1-1.
[27]
Zhao, S.; Yao, H.; Gao, Y.; Ji, R.; Ding, G. Continuous probability distribution prediction of image emotions via multi-task shared sparse regression. IEEE Trans. Multimed., 2017, 19(3), 632-645.
[37]
Ke, C.; Kurgan, L.A.; Jishou, R. Prediction of protein structural class using novel evolutionary collocation-based sequence representation. J. Comput. Chem., 2008, 29(10), 1596-1604.
[39]
Quinlan, J.R. C4.5: programs for machine learning., 1992.
[40]
Thornton, C.; Hutter, F.; Hoos, H.H.; Leyton-Brown, K. Auto-
WEKA: Combined Selection and Hyperparameter Optimization of
Classification Algorithms, 2013. Technical Report TR-2012-05.
[42]
Kohavi, R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International Joint Conference on Artificial Intelligence; , 1995.