Aim and Background: In recent periods, micro-array data analysis using soft computing and machine learning techniques gained more interest among researchers to detect prostate cancer. Due to the small sample size of micro-array data with a larger number of attributes, traditional machine learning techniques face difficulty detecting prostate cancer.
Methodology: The selection of relevant genes exploits useful information about micro-array data, which enhances the accuracy of detection. In this research, the samples are acquired from the gene expression omnibus database, particularly related to the prostate cancer GEO IDs such as GSE 21034, GSE 15484 and GSE 3325/GSE 3998. In addition, ensemble feature optimization technique and Bidirectional Long Short Term Memory (Bi-LSTM) network are employed for detecting prostate cancer from the microarray data of gene expression.
Results: The ensemble feature optimization technique includes 4 metaheuristic optimizers that select the top 2000 genes from each GEO IDs, which are relevant to prostate cancer. Next, the selected genes are given to the Bi-LSTM network for classifying the normal and prostate cancer subjects.
Conclusion: The simulation analysis revealed that the ensemble based Bi-LSTM network obtained 99.13%, 98.97%, and 94.12% of accuracy on the GEO IDs like GSE 3325/GSE 3998, GSE 21034, and GSE 15484.