Abstract
Aims: In this paper, Forkhead box O (FOXO) protein using the ensemble learning algorithm
is predicted. When FOXO is in excess in the human body, it leads to LNCap prostate cancer
cells, and if deficit leading neurodegenerative diseases.
Objective: Neurodegenerative diseases, like Alzheimer's and Parkinson's, are neurological illnesses
that are caused by damaged brain cells. For prediction of FOXO protein, Gradient Boosted Machine
(GBM) and Random forest (RF) techniques are used.
Method: The main idea of using GBM is its non-linear nature but it is difficult for any single decision
tree to fit all training. To overcome this, an RF algorithm is used. RF combines the results at the
end of the process by average or majority rules, while the GBM algorithm combines the results along
the way.
Results: A total of 29.16% improvement has been observed by RF over GBM. Average square error
is also evaluated to check the testing and training of data for 100 trees on 100 tree sizes.
Conclusion: In this paper, a computational model for the prediction of FOXO protein using ensemble
learning techniques (Random Forest and GBM) has been proposed. If the dataset has many variable
features and the prediction accuracy is not as important then RF can be considered. On the other
hand, GBMs are better suited for datasets that have very few or fewer input features and where high
accuracy predictions are required. However, there are instances when either GBM or RF can perform
equally well depending on how they are tuned.
Keywords:
System biology, degenerative diseases, diabetes, FKHR, random forest, boosted tree.
Graphical Abstract
[3]
Jain S. Communication of signals and responses leading to cell survival/cell death using Engineered Regulatory Networks 2012.
[6]
Jain S. System modeling of AKT using linear and robust regression analysis. Curr Trends Biotechnol Pharm 2018; 12(2): 177-86.
[14]
Lalmansingh AS, Karmakar S, Jin Y, Nagaich AK. Multiple modes of chromatin remodeling by Forkhead box pro-teins.BiochimicaetBiophysicaActa (BBA)-. Gene Regulatory Mechanisms 2012; 1819(7): 707-15.
[18]
Jain S, Naik PK, Bhooshan SV. Mathematical modeling deciphering balance between cell survival and cell death using Insulin. New Biol 2011; 1(1): 46-58.
[21]
Jain S. Regression analysis on different mitogenic pathways. New Biol 2016; 6(2): 40-6.
[22]
Jain S, Bhooshan SV, Naik PK. Model of mitogen-activated protein kinases for cell survival/death and its equivalent bio-circuit. Current Research Journal of Biological Sciences 2010; 2(1): 59-71.
[24]
Weiss R. Cellular computation and communications using engineered genetic regulatory networks 2001.
[25]
Roberto C, Marcello F, Diego L, Del FA, Eduardo M. Balance between cell survival and death: A minimal quantitative model of tumor necrosis factor alpha cytotoxicity BIOLOGICAL SCIENCES. Biophysics and Computational Biology 2009; 1-31.
[30]
Jain S. Design of survival-hazard and mathematical model for high osmolarity glycerol protein using parametric and nonparametric meth-ods. Int J Emerging Technologies 2019; 10(3): 01-9.
[36]
Bonab R, Can F. Less is more: A comprehensive framework for the number of components of ensemble classifiers. IEEE Trans Neu Net Learn Sys 14(8)2018;