Cited 5 time in
Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jo, J. | - |
| dc.contributor.author | Jung, S. | - |
| dc.contributor.author | Park, J. | - |
| dc.contributor.author | Kim, Y. | - |
| dc.contributor.author | Kang, M. | - |
| dc.date.accessioned | 2023-01-04T05:05:01Z | - |
| dc.date.available | 2023-01-04T05:05:01Z | - |
| dc.date.issued | 2022-12 | - |
| dc.identifier.issn | 1932-6203 | - |
| dc.identifier.uri | https://scholarworks.gnu.ac.kr/handle/sw.gnu/29920 | - |
| dc.description.abstract | High-dimensional LASSO (Hi-LASSO) is a powerful feature selection tool for high-dimensional data. Our previous study showed that Hi-LASSO outperformed the other state-of-the-art LASSO methods. However, the substantial cost of bootstrapping and the lack of experiments for a parametric statistical test for feature selection have impeded to apply Hi-LASSO for practical applications. In this paper, the Python package and its Spark library are efficiently designed in a parallel manner for practice with real-world problems, as well as providing the capability of the parametric statistical tests for feature selection on high-dimensional data. We demonstrate Hi-LASSO's outperformance with various intensive experiments in a practical manner. Hi-LASSO will be efficiently and easily performed by using the packages for feature selection. Hi-LASSO packages are publicly available at https://github.com/dataxlab/Hi-LASSO under the MIT license. The packages can be easily installed by Python PIP, and additional documentation is available at https://pypi.org/project/hi-lasso and https://pypi.org/project/Hi-LASSO-spark. Copyright: © 2022 Jo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Public Library of Science | - |
| dc.title | Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1371/journal.pone.0278570 | - |
| dc.identifier.scopusid | 2-s2.0-85143183109 | - |
| dc.identifier.wosid | 000925734000179 | - |
| dc.identifier.bibliographicCitation | PLoS ONE, v.17, no.12 December | - |
| dc.citation.title | PLoS ONE | - |
| dc.citation.volume | 17 | - |
| dc.citation.number | 12 December | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
| dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532
COPYRIGHT 2022 GYEONGSANG NATIONAL UNIVERSITY LIBRARY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
