RHadoop 플랫폼에서 로지스틱 회귀 알고리즘 성능 비교Performance Comparison of Logistic Regression Algorithms on RHadoop
- Other Titles
- Performance Comparison of Logistic Regression Algorithms on RHadoop
- Authors
- 정병호; 임동훈
- Issue Date
- 2017
- Publisher
- 한국컴퓨터정보학회
- Keywords
- Big data; Hadoop; Logistic regression; R; RHadoop
- Citation
- 한국컴퓨터정보학회논문지, v.22, no.4, pp 9 - 16
- Pages
- 8
- Indexed
- KCI
- Journal Title
- 한국컴퓨터정보학회논문지
- Volume
- 22
- Number
- 4
- Start Page
- 9
- End Page
- 16
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/14663
- DOI
- 10.9708/jksci.2017.22.04.009
- ISSN
- 1598-849X
2383-9945
- Abstract
- Machine learning has found widespread implementations and applications in many different domains in our life. Logistic regression is a type of classification in machine leaning, and is used widely in many fields, including medicine, economics, marketing and social sciences.
In this paper, we present the MapReduce implementation of three existing algorithms, this is, Gradient Descent algorithm, Cost Minimization algorithm and Newton-Raphson algorithm, for logistic regression on RHadoop that integrates R and Hadoop environment applicable to large scale data.
We compare the performance of these algorithms for estimation of logistic regression coefficients with real and simulated data sets. We also compare the performance of our RHadoop and RHIPE platforms.
The performance experiments showed that our Newton-Raphson algorithm when compared to Gradient Descent and Cost Minimization algorithms appeared to be better to all data tested, also showed that our RHadoop was better than RHIPE in real data, and was opposite in simulated data.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 자연과학대학 > Dept. of Information and Statistics > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.