A Hybrid Data Reduction and Knowledge Extraction Algorithm for Quality Prediction

Page: [273 - 280] Pages: 8

  • * (Excluding Mailing and Handling)

Abstract

Background: With the explosive growth of the manufacturing data, the manufacturing enterprises paid more and more attention to dealing with the manufacturing big data. The manufacturing big data also can be summarized as "5Vs”, volume, variety, velocity, veracity and value. Recently, the researchers are focused on proposing better knowledge discovery algorithms to handling the manufacturing big data.

Objective: The high dimensional data can be reduced from two directions. The one was the dimension reduction. It makes the data set simple and overcome the problem of curse dimensionality. This method reduced the data set form the data width.

Methods: We proposed a hybrid data reduction and knowledge extraction algorithm (HDRKE) for quality prediction. There are 5 steps in the algorithm: Step 1: Data preprocessing; Step 2: Dimension reduction; Step 3: Extract SVs by SVM; Step 4: Extract rules from the subset; Step 5: Prediction by the rules extracted in step 3.

Results: The presented HDRKE method reduced the data scales from the data dimensions and the data attributions. Then, the prediction method was used on the subset of reduced data. At last, the HDRKE method was applied to a enterprise sample, the validation of the method can be validated on the enterprise sample.

Conclusion: Quality prediction and control was an important procedure in manufacturing. The HDRKE algorithm was a novel method based on the attribution reduction and dimensionality reduce. The data set simplified from double direction made the data set easily to calculate. The HDRKE method also proposed a new thought of decision rules extracting on the low-embeddings. The HDRKE method also applied to a manufacturing instance and proved its validity.

Keywords: Attribution reduction, support vectors, big data, quality prediction, velocity, veracity.

Graphical Abstract

[1]
J.H. Cho, and P.U. Kurup, "Decision tree approach for classification and dimensionality reduction of electronic noise data", Sens. Actuators B Chem., vol. 160, no. 1, pp. 542-548, 2011.
[http://dx.doi.org/10.1016/j.snb.2011.08.027]
[2]
G. Verdoolaege, G. Karagounis, and G.V. Oost, "Classification and dimensionality reduction of international tokamak confinement data on a probabilistic manifold", Nucl. Instrum. Methods Phys. Res., vol. 720, no. 720, pp. 11-13, 2013.
[http://dx.doi.org/10.1016/j.nima.2012.12.047]
[3]
I.T. Jolliffe, Principal Component Analysis., Springer-Verlag, 1986.
[http://dx.doi.org/10.1007/978-1-4757-1904-8]
[4]
X. Zhu, C. Tang, P. Wang, H. Xu, M. Wang, and J. Tian, "Saliency detection via affinity graph learning and weighted manifold ranking", Neurocomputing, vol. 312, pp. 239-250, 2018.
[http://dx.doi.org/10.1016/j.neucom.2018.05.106]
[5]
M. Farouk, and A. Sutherland, "Principal component pyramids for manifold learning in hand shape recognition", Ict Express, vol. 4, pp. 63-68, 2018.
[http://dx.doi.org/10.1016/j.icte.2018.04.009]
[6]
Y. Zhao, X. You, S. Yu, C. Xu, W. Yuan, X.Y. Jing, T. Zhang, and D. Tao, "Multi-view manifold learning with locality alignment", Pattern Recognit., vol. 78, pp. 154-166, 2018.
[http://dx.doi.org/10.1016/j.patcog.2018.01.012]
[7]
J.R. Quinlan, "Induction of decision trees", Mach. Learn., vol. 1, no. 1, pp. 81-106, 1986.
[http://dx.doi.org/10.1007/BF00116251]
[8]
S. Roy, S. Mondal, A. Ekbal, and M.S. Desarkar, "Dispersion ratio based decision tree model for classification", Expert Syst. Appl., vol. 116, pp. 1-9, 2019.
[http://dx.doi.org/10.1016/j.eswa.2018.08.039]
[9]
P. Müller, K. Salminen, and V. Nieminen, "Scent classification by K nearest neighbors using ion-mobility spectrometry measurements", Expert Syst. Appl., vol. 115, pp. 593-606, 2019.
[http://dx.doi.org/10.1016/j.eswa.2018.08.042]
[10]
W.E. Hadi, Q.A. Al-Radaideh, and S. Alhawari, "Integrating associative rule-based classification with Naïve Bayes for text classification", Appl. Soft Comput., vol. 69, pp. 344-356, 2018.
[http://dx.doi.org/10.1016/j.asoc.2018.04.056]
[11]
L. Scrucca, Dimension Reduction for Model-Based Clustering., Kluwer Academic Publishers, 2010.
[http://dx.doi.org/10.1007/s11222-009-9138-7]
[12]
K. Morris, and P.D. Mcnicholas, "Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures", Comput. Stat. Data Anal., vol. 97, pp. 133-150, 2016.
[http://dx.doi.org/10.1016/j.csda.2015.10.008]
[13]
H. Xie, J. Li, Q. Zhang, and Y. Wang, "Comparison among dimensionality reduction techniques based on Random Projection for cancer classification", Comput. Biol. Chem., vol. 65, pp. 165-172, 2016.
[http://dx.doi.org/10.1016/j.compbiolchem.2016.09.010] [PMID: 27687329]
[14]
Q. Li, S. Zhang, and Z. Zhang, "Online surface defects detection system for cold-rolled steel strip", Recent Pat. Eng., vol. 11, no. 1, pp. 62-67, 2017.
[http://dx.doi.org/10.2174/1872212110666161116164708]
[15]
T. Athanasia, "Z. Anastasios, and S. Petros. “The Incorporation of Ceramic Membranes in MBR Systems for Wastewater Treatment: Advantages and Patented New Developments", Recent Pat. Eng., vol. 8, no. 1, pp. 24-32, 2014.
[http://dx.doi.org/10.2174/1872212107666131126234626]
[16]
H. Wu, R.P. Loce, Y.R. Wang, "Video-based system and method for parking occupancy detection", United States 56890708, 2017,
[17]
T.L. Blevins, W.K. Wojsznis, M.J. Nixon, J.M. Caldwell; "Inferential process modeling, quality prediction and fault detection using multi-stage data segregation".U.S. Patent 9110452, Nov 12, 2015.,
[18]
P.D.M. Truong, “Method and apparatus for multi-radio coexistence”, United States 8787468, 2017,
[19]
Kommisetti, "Method for rejecting tuning disturbances to improve lamp failure prediction quality in thermal processes", U.S. Patent 62055481, Mar 31, 2016.,
[20]
M. Abramoff, S. Russell,; "Methods and systems for determining optimal features for classifying patterns or objects in images."United States, .60940603, 2012.,
[21]
S.T. Roweis, and L.K. Saul, "Nonlinear dimensionality reduction by locally linear embedding", Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
[http://dx.doi.org/10.1126/science.290.5500.2323] [PMID: 11125150]
[22]
J.B. Tenenbaum, V. de Silva, and J.C. Langford, "A global geometric framework for nonlinear dimensionality reduction", Science, vol. 290, no. 5500, pp. 2319-2323, 2000.
[http://dx.doi.org/10.1126/science.290.5500.2319] [PMID: 11125149]
[23]
W. Meng, Z. Shiyuan, and D. Zhankui, "A support subset algorithm and its application to information security risk assessment", Recent Pat. Eng., vol. 11, pp. 188-193, 2017.
[http://dx.doi.org/10.2174/1872212111666170221164622]