Abstract
Background: Antifungal Peptides (AFP) have been found to be effective against many fungal
infections.
Objective: However, it is difficult to identify AFP. Therefore, it is great practical significance to identify
AFP via machine learning methods (with sequence information).
Methods: In this study, a Multi-Kernel Support Vector Machine (MKSVM) with Hilbert-Schmidt Independence
Criterion (HSIC) is proposed. Proteins are encoded with five types of features (188-bit,
AAC, ASDC, CKSAAP, DPC), and then construct kernels using Gaussian kernel function. HSIC are
used to combine kernels and multi-kernel SVM model is built.
Results: Our model performed well on three AFPs datasets and the performance is better than or comparable
to other state-of-art predictive models.
Conclusion: Our method will be a useful tool for identifying antifungal peptides.
Keywords:
Antifungal peptides, feature representation, amino acid composition, multiple kernel learning, hilbert-schmidt independence criterion, support vector machine.
Graphical Abstract
[1]
Brown GD, Denning DW, Gow NAR, Levitz SM, Netea MG, White TC. Hidden killers: Human fungal infections. Sci Transl Med 2012; 4: 165.
[23]
Bach FR, Lanckriet G. Multiple kernel learningConic duality, and the SMO 2004; 2211-68
[35]
Govindan G, Nair A. Composition, transition and distribution (ctd)
— a dynamic feature for predictions based on hierarchical structure
of cellular sorting Proceedings - 2011 Annual IEEE India Conference:
Engineering sustainable solutions, INDICON-2011.
[48]
Pedregosa F, Michel V, Varoquaux G, et al. Machine learning in python. J Mach Learn Res 2011; 12: 2825-30.
[50]
Gretton A, Bousquet O, Smola A, Schölkopf B. Measuring statistical dependence with hilbert-schmidt norms.Algorithmic learning
theory: 16th international conference, ALT 2005, . 63-78.
[51]
Gangeh MJ, Bedawi SMA, Ghodsi A, Karray F. Semi-supervised dictionary learning based on hilbert-schmidt independence criterionLecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 2016; 9730: 12-9.