Shrinkage heteroscedastic discriminant algorithms for classifying multi-class high-dimensional data: insights from a national health survey

Show simple item record

dc.date.accessioned 2023-03-22T07:01:32Z
dc.date.available 2023-03-22T07:01:32Z
dc.date.issued 2023-03-17 en
dc.identifier.uri http://hdl.handle.net/20.500.11910/20206
dc.description.abstract In many practical data applications, there are often a large number of pre-processed heteroscedastic features. Discriminant analysis is a standard statistical learning method that is useful for classifying such multivariate features. It is well known in literature that the Linear Discriminant Analysis (LDA) is quite sub-optimal for the analysis of high-dimensional heteroscedastic data because of the inherent singularity and instability of the within-class variance. However, shrinkage discriminant analysis (SDA) and its variants often perform better due to its robustness against inherent multicollinearity and heteroscedasticity. In this article, we propose some newly modified discriminant classification algorithms based on the SDA and compare their sensitivities with those of other competing algorithms. The empirical application show that the proposed algorithms perform moderately well for datasets with high dimensions and unequal co-variance structures when applied to simulated and nutrition data with inherent heteroscedasticity and outliers. The sensitivity and precision of the algorithms for the target classes ranges from 70%???100%. The balanced accuracy of all the algorithms ranges from 50 to 75% for the three-class problem considered. Heteroscedastic discriminant algorithm performs moderately well with high sensitivity for classifying health data with high and low dimensions. en
dc.format.medium Print en
dc.subject SHRINKAGE DISCRIMINANT ANALYSIS (SDA) en
dc.subject DATA ANALYSIS en
dc.subject EXPERIENTIAL LEARNING en
dc.title Shrinkage heteroscedastic discriminant algorithms for classifying multi-class high-dimensional data: insights from a national health survey en
dc.type Journal Article en
dc.description.version Y en
dc.ProjectNumber PSALAA en
dc.Volume 12 en
dc.BudgetYear 2022/23 en
dc.ResearchGroup Human and Social Capabilities en
dc.SourceTitle Machine Learning with Applications en
dc.ArchiveNumber 9812725 en
dc.URL http://ktree.hsrc.ac.za/doc_read_all.php?docid=26617 en
dc.PageNumber Online en
dc.outputnumber 14229 en
dc.bibliographictitle Awe, O.O., Dukhi, N. & Dias, R. (2023) Shrinkage heteroscedastic discriminant algorithms for classifying multi-class high-dimensional data: insights from a national health survey. Machine Learning with Applications. 12:Online. http://hdl.handle.net/20.500.11910/20206 en
dc.publicationyear 2023 en
dc.contributor.author1 Awe, O.O. en
dc.contributor.author2 Dukhi, N. en
dc.contributor.author3 Dias, R. en


Files in this item

This item appears in the following Collection(s)

Show simple item record