Peter Gachoki, Moses Muraya and Gladys Njoroge. Features Selection in Statistical Classification of High Dimensional Image Derived Maize (
L.) Phenomic Data.
. 2022; 10(2):44-51. doi: 10.12691/AJAMS-10-2-2
high throughput phenotyping, high dimensional data, feature extraction, feature importance, Shapley values, LASSO regression
This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit
[1] | Guyon I. and Elisseeff A. 2003. An introduction to variable and feature selection. Journal of Machine Learning Research. 1157-1182. |
|
[2] | Pieruschka R, Schurr U (2019) Plant phenotyping: past, present, and future. Plant Phenomics: 1-6. |
|
[3] | Chen, D., Neumann, K., Friedel, S., Kilian, B., Chen, M., Altmann, T., & Klukas, C. (2014). Dissecting the phenotypic components of crop plant growth and drought responses based on high-throughput image analysis. The Plant Cell, 26(12), 4636-4655. |
|
[4] | Klukas, C., Chen, D., & Pape, J. M. (2014). Integrated analysis platform: an open-source information system for high-throughput plant phenotyping. Plant physiology, 165(2), 506-518. |
|
[5] | Boyd, D., & Crawford, K. (2011,). Six provocations for big data. In A decade in internet time: Symposium on the dynamics of the internet and society. |
|
[6] | Blum, A., Hopcroft, J., & Kannan, R. (2020). Foundations of data science. Cambridge University Press. |
|
[7] | Unnisabegum, A., Hussain, M., & Shaik, M. (2019). Data Mining Techniques for Big Data, Vol. 6, Special Issue. |
|
[8] | Venkatesh, B., & Anuradha, J. (2019). A review of feature selection and its methods. Cybernetics and Information Technologies, 19(1), 3-26. |
|
[9] | Duchesnay, E., & Löfstedt, T. (2018). Statistics and Machine Learning in Python. Release 0.1. |
|
[10] | Cohen, Shay & Ruppin, Eytan & Dror, Gideon. (2005). Feature Selection Based on the Shapley Value. 665-670. |
|
[11] | Fryer, D., Strümke, I., & Nguyen, H. (2021). Shapley values for feature selection: The good, the bad, and the axioms. IEEE Access, 9, 144352-144360. |
|
[12] | Helwig, N. E. (2017). Data, Covariance, and Correlation Matrix. University of Minnesota (Twin Cities). |
|
[13] | Kim, Yongdai & Kim, Jinseog. (2004). Gradient LASSO for feature selection. Proceedings of the 21st International Conference on Machine Learning. |
|
[14] | Muthukrishnan, R. & Rohini, R. (2016). LASSO: A feature selection technique in predictive modeling for machine learning. 18-20. |
|
[15] | Ghojogh, B., Samad, M. N., Mashhadi, S. A., Kapoor, T., Ali, W., Karray, F., & Crowley, M. (2019). Feature selection and feature extraction in pattern analysis: A literature review. arXiv preprint arXiv:1905.02845. |
|
[16] | Pan, R., Yang, T., Cao, J., Lu, K., & Zhang, Z. (2015). Missing data imputation by K nearest neighbours based on grey relational structure and mutual information. Applied Intelligence, 43(3), 614-632. |
|
[17] | Emmanuel, T., Maupong, T., Mpoeleng, D., Semong, T., Mphago, B., & Tabona, O. (2021). A survey on missing data in machine learning. Journal of Big Data, 8(1), 1-37. |
|
[18] | Kaylan, P. (2021). 7 Popular Feature Selection Routines in Machine Learning. |
|
[19] | https://www.analyticsvidhya.com/blog/2021/03/7-popular-feature-selection-routines-in-machine-learning/. |
|
[20] | Saarela, Mirka & Jauhiainen, Susanne. (2021). Comparison of feature importance measures as explanations for classification models. SN Applied Sciences. 3. |
|
[21] | Giersdorf, J., & Conzelmann, M. (2017). Analysis of feature-selection for LASSO regression models. |
|
[22] | Chu, Carlin & Chan, David. (2020). Feature Selection Using Approximated High-Order Interaction Components of the Shapley Value for Boosted Tree Classifier. IEEE Access. PP. 1-1. |
|
[23] | Xiaomao, X., Xudong, Z., & Yuanfang, W. (2019, August). A comparison of feature selection methodology for solving classification problems in finance. In Journal of Physics: Conference Series (Vol. 1284, No. 1, p. 012026). IOP Publishing. |
|