Detailed Information

Cited 3 time in webofscience Cited 5 time in scopus
Metadata Downloads

Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data

Full metadata record
DC Field Value Language
dc.contributor.authorJo, J.-
dc.contributor.authorJung, S.-
dc.contributor.authorPark, J.-
dc.contributor.authorKim, Y.-
dc.contributor.authorKang, M.-
dc.date.accessioned2023-01-04T05:05:01Z-
dc.date.available2023-01-04T05:05:01Z-
dc.date.issued2022-12-
dc.identifier.issn1932-6203-
dc.identifier.urihttps://scholarworks.gnu.ac.kr/handle/sw.gnu/29920-
dc.description.abstractHigh-dimensional LASSO (Hi-LASSO) is a powerful feature selection tool for high-dimensional data. Our previous study showed that Hi-LASSO outperformed the other state-of-the-art LASSO methods. However, the substantial cost of bootstrapping and the lack of experiments for a parametric statistical test for feature selection have impeded to apply Hi-LASSO for practical applications. In this paper, the Python package and its Spark library are efficiently designed in a parallel manner for practice with real-world problems, as well as providing the capability of the parametric statistical tests for feature selection on high-dimensional data. We demonstrate Hi-LASSO's outperformance with various intensive experiments in a practical manner. Hi-LASSO will be efficiently and easily performed by using the packages for feature selection. Hi-LASSO packages are publicly available at https://github.com/dataxlab/Hi-LASSO under the MIT license. The packages can be easily installed by Python PIP, and additional documentation is available at https://pypi.org/project/hi-lasso and https://pypi.org/project/Hi-LASSO-spark. Copyright: © 2022 Jo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.-
dc.language영어-
dc.language.isoENG-
dc.publisherPublic Library of Science-
dc.titleHi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1371/journal.pone.0278570-
dc.identifier.scopusid2-s2.0-85143183109-
dc.identifier.wosid000925734000179-
dc.identifier.bibliographicCitationPLoS ONE, v.17, no.12 December-
dc.citation.titlePLoS ONE-
dc.citation.volume17-
dc.citation.number12 December-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
Files in This Item
There are no files associated with this item.
Appears in
Collections
자연과학대학 > Dept. of Information and Statistics > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Young Soon photo

Kim, Young Soon
자연과학대학 (정보통계학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE