Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

RHadoop 플랫폼에서 로지스틱 회귀 알고리즘 성능 비교Performance Comparison of Logistic Regression Algorithms on RHadoop

Other Titles
Performance Comparison of Logistic Regression Algorithms on RHadoop
Authors
정병호임동훈
Issue Date
2017
Publisher
한국컴퓨터정보학회
Keywords
Big data; Hadoop; Logistic regression; R; RHadoop
Citation
한국컴퓨터정보학회논문지, v.22, no.4, pp 9 - 16
Pages
8
Indexed
KCI
Journal Title
한국컴퓨터정보학회논문지
Volume
22
Number
4
Start Page
9
End Page
16
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/14663
DOI
10.9708/jksci.2017.22.04.009
ISSN
1598-849X
2383-9945
Abstract
Machine learning has found widespread implementations and applications in many different domains in our life. Logistic regression is a type of classification in machine leaning, and is used widely in many fields, including medicine, economics, marketing and social sciences. In this paper, we present the MapReduce implementation of three existing algorithms, this is, Gradient Descent algorithm, Cost Minimization algorithm and Newton-Raphson algorithm, for logistic regression on RHadoop that integrates R and Hadoop environment applicable to large scale data. We compare the performance of these algorithms for estimation of logistic regression coefficients with real and simulated data sets. We also compare the performance of our RHadoop and RHIPE platforms. The performance experiments showed that our Newton-Raphson algorithm when compared to Gradient Descent and Cost Minimization algorithms appeared to be better to all data tested, also showed that our RHadoop was better than RHIPE in real data, and was opposite in simulated data.
Files in This Item
There are no files associated with this item.
Appears in
Collections
자연과학대학 > Dept. of Information and Statistics > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE