Cited 13 time in
Strategies for Imputing Missing Values and Removing Outliers in the Dataset for Machine Learning-Based Construction Cost Prediction
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Haneul | - |
| dc.contributor.author | Yun, Seokheon | - |
| dc.date.accessioned | 2024-05-08T01:30:39Z | - |
| dc.date.available | 2024-05-08T01:30:39Z | - |
| dc.date.issued | 2024-04 | - |
| dc.identifier.issn | 2075-5309 | - |
| dc.identifier.issn | 2075-5309 | - |
| dc.identifier.uri | https://scholarworks.gnu.ac.kr/handle/sw.gnu/70476 | - |
| dc.description.abstract | Accurately predicting construction costs during the initial planning stages is crucial for the successful completion of construction projects. Recent advancements have introduced various machine learning-based methods to enhance cost estimation precision. However, the accumulation of authentic construction cost data is not straightforward, and existing datasets frequently exhibit a notable presence of missing values, posing challenges to precise cost predictions. This study aims to analyze diverse substitution methods for addressing missing values in construction cost data. Additionally, it seeks to evaluate the performance of machine learning models in cost prediction through the removal of conditional outliers. The primary goal is to identify and propose optimal strategies for handling missing value in construction cost records, ultimately improving the reliability of cost predictions. According to the analysis results, among single imputation methods, median imputation emerges as the most suitable, while among multiple imputation methods, lasso regression imputation produces the most superior outcomes. This research contributes to enhancing the trustworthiness of construction cost predictions by presenting a pragmatic approach to managing missing data in construction cost performance records, thereby facilitating more precise project planning and execution. © 2024 by the authors. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Multidisciplinary Digital Publishing Institute (MDPI) | - |
| dc.title | Strategies for Imputing Missing Values and Removing Outliers in the Dataset for Machine Learning-Based Construction Cost Prediction | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/buildings14040933 | - |
| dc.identifier.scopusid | 2-s2.0-85191365463 | - |
| dc.identifier.wosid | 001220477800001 | - |
| dc.identifier.bibliographicCitation | Buildings, v.14, no.4 | - |
| dc.citation.title | Buildings | - |
| dc.citation.volume | 14 | - |
| dc.citation.number | 4 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Construction & Building Technology | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalWebOfScienceCategory | Construction & Building Technology | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Civil | - |
| dc.subject.keywordAuthor | construction duration | - |
| dc.subject.keywordAuthor | estimation | - |
| dc.subject.keywordAuthor | imputation | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532
COPYRIGHT 2022 GYEONGSANG NATIONAL UNIVERSITY LIBRARY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
