Sorochan Armstrong, Michael D. and de la Mata, A. Paulina and Harynuk, James J. (2022) Review of Variable Selection Methods for Discriminant-Type Problems in Chemometrics. Frontiers in Analytical Science, 2. ISSN 2673-9283
pubmed-zip/versions/2/package-entries/frans-02-867938-r1/frans-02-867938.pdf - Published Version
Download (740kB)
Abstract
Discriminant-type analyses arise from the need to classify samples based on their measured characteristics (variables), usually with respect to some observable property. In the case of samples that are difficult to obtain, or using advanced instrumentation, it is very common to encounter situations with many more measured characteristics than samples. The method of Partial Least Squares Regression (PLS-R), and its variant for discriminant-type analyses (PLS-DA) are among the most ubiquitous of these tools. PLS utilises a rank-deficient method to solve the inverse least-squares problem in a way that maximises the co-variance between the known properties of the samples (commonly referred to as the Y-Block), and their measured characteristics (the X-block). A relatively small subset of highly co-variate variables are weighted more strongly than those that are poorly co-variate, in such a way that an ill-posed matrix inverse problem is circumvented. Feature selection is another common way of reducing the dimensionality of the data to a relatively small, robust subset of variables for use in subsequent modelling. The utility of these features can be inferred and tested any number of ways, this are the subject of this review.
Item Type: | Article |
---|---|
Subjects: | Oalibrary Press > Chemical Science |
Depositing User: | Managing Editor |
Date Deposited: | 21 Nov 2022 04:36 |
Last Modified: | 23 Dec 2023 05:45 |
URI: | http://asian.go4publish.com/id/eprint/262 |