Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional dataopen access
- Authors
- Jo, J.; Jung, S.; Park, J.; Kim, Y.; Kang, M.
- Issue Date
- Dec-2022
- Publisher
- Public Library of Science
- Citation
- PLoS ONE, v.17, no.12 December
- Indexed
- SCIE
SCOPUS
- Journal Title
- PLoS ONE
- Volume
- 17
- Number
- 12 December
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/29920
- DOI
- 10.1371/journal.pone.0278570
- ISSN
- 1932-6203
- Abstract
- High-dimensional LASSO (Hi-LASSO) is a powerful feature selection tool for high-dimensional data. Our previous study showed that Hi-LASSO outperformed the other state-of-the-art LASSO methods. However, the substantial cost of bootstrapping and the lack of experiments for a parametric statistical test for feature selection have impeded to apply Hi-LASSO for practical applications. In this paper, the Python package and its Spark library are efficiently designed in a parallel manner for practice with real-world problems, as well as providing the capability of the parametric statistical tests for feature selection on high-dimensional data. We demonstrate Hi-LASSO's outperformance with various intensive experiments in a practical manner. Hi-LASSO will be efficiently and easily performed by using the packages for feature selection. Hi-LASSO packages are publicly available at https://github.com/dataxlab/Hi-LASSO under the MIT license. The packages can be easily installed by Python PIP, and additional documentation is available at https://pypi.org/project/hi-lasso and https://pypi.org/project/Hi-LASSO-spark. Copyright: © 2022 Jo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 자연과학대학 > Dept. of Information and Statistics > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.