Abstract
Background: The small sample problem widely exists in the fields of the chemical industry,
chemistry, biology, medicine, and food industry. It has been a problem in process modeling
and system optimization. The aim of this study is to focus on the problems of small sample size in
modeling, the process parameters in the ultrasonic extraction of botanical medicinal materials can
be obtained by optimizing the extraction rate model. However, difficulty in data acquisition results
in problem of small sample size in modeling, which eventually reduces the accuracy of modeling
prediction.
Methods: A virtual sample generation method based on full factorial design (FFD) is proposed to
solve the problem ofa small sample size. The experiments are first conducted according to the Box-
Behnken Design (BBD) to obtain small-size samples, and the response surface function is established
accordingly. Then, virtual sample inputs are obtained by the FFD, and the corresponding virtual
sample outputs are calculated by the response surface function. Furthermore, a screening method of
virtual samples is proposed based on an extreme learning machine (ELM). The connection weights
of ELM are used for further optimization and screening of the generated virtual samples.
Result: The results show that virtual sample data can effectively expand the sample size. The precision
of the model trained on semi-synthetic samples (small-size experimental simples and virtual
samples) is higher than the model trained merely on small-size experimental samples.
Conclusion: The virtual sample generation and screening methods proposed in this paper can effectively
solve the modeling problem of small samples. The reliable process parameters can be obtained
by optimizing the model trained by the semi-synthetic samples.
Graphical Abstract
[1]
Zhao, K.L.; Jin, X.L.; Wang, Y.Z. Survey on few-shot learning. J. Softw., 2021, 32(2), 349-369.
[2]
He, P.; Sun, F.; Hu, X.F.; Lin, Y.P.; Duan, S.K. Optimization of prediction system for sample laser cutting process parameters. Las. J., 2021, 42(12), 170-175.
[4]
Lv, Y.Q.; Min, W.Q.; Duan, H.; Jiang, S.Q. Few-shot food recognition combining triplet convolutional neural network with relation network. Comput. Sci., 2020, 47(1), 136-143.
[6]
Yan, B.; Zhou, P.; Yan, L. Disease identification of small sample crop based on transfer learning. Mod. Agri. Sci. Tech., 2019, 6, 87-89.
[9]
Snell, J.; Swersky, K.; Zemel, R.S. Prototypical networks for few-shot learning. Adv. Neur. Inf. Proc. Syst., 2017, 2017, 30.
[10]
Vinyals, O.; Blundell, C.; Lillicrap, T.; Kavukcuoglu, K.; Wierstra, D. Matching networks for one shot learning. Adv. Neur. Inf. Proc. Syst., 2016, 2016, 29.
[11]
Lv, F.; Wang, Y.; Ruan, H.L.; Qin, Y.; Wang, P. Labeled sample augmentation based on deep embedding relation space for semi-supervised fault diagnosis of gearbox. Chinese. J. Sci. Instrum., 2021, 42(2), 55-65.
[15]
Wang, W.D.; Yang, J.Y. Quadratic discriminant analysis method based on virtual training samples. Acta. Auto. Sin., 2008, 34(34), 400-407.
[16]
Tang, J.; Wang, D.D.; Guo, Z.H.; Qiao, J.F. Prediction of dioxin emission concentration in municipal solid waste incineration process based on optimal selection of virtual Samples. J. Beijing Univ. Tech., 2021, 47(5), 431-443.
[17]
Liu, B.Y. Optimization of process parameters of single drug and compound drugs by dual-frequency ultrasound. 2019.
[18]
Sui, Y.K.; Yu, H.P. Improvement of response surface method and its application to engineering optimization; Sci. Pre, 2011, pp. 8-10.
[20]
Xu, S.H.; Mu, X.D.; Chai, D.; Luo, C. Domain adaptation algorithm with ELM parameter transfer. Acta. Auto. Sin., 2018, 44(2), 311-317.