Detailed Information

Cited 9 time in webofscience Cited 11 time in scopus
Metadata Downloads

SocialTERM-Extractor: Identifying and Predicting Social-Problem-Specific Key Noun Terms from a Large Number of Online News Articles Using Text Mining and Machine Learning Techniquesopen access

Authors
Suh, Jong Hwan
Issue Date
1-Jan-2019
Publisher
MDPI
Keywords
social-problem-specific key noun terms; temporal weights; sentiment analysis; complex network structure analysis; deep learning; ensemble learning methods
Citation
SUSTAINABILITY, v.11, no.1
Indexed
SCIE
SSCI
SCOPUS
Journal Title
SUSTAINABILITY
Volume
11
Number
1
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/9543
DOI
10.3390/su11010196
ISSN
2071-1050
2071-1050
Abstract
In the digital age, the abundant unstructured data on the Internet, particularly online news articles, provide opportunities for identifying social problems and understanding social systems for sustainability. However, the previous works have not paid attention to the social-problem-specific perspectives of such big data, and it is currently unclear how information technologies can use the big data to identify and manage the ongoing social problems. In this context, this paper introduces and focuses on social-problem-specific key noun terms, namely SocialTERMs, which can be used not only to search the Internet for social-problem-related data, but also to monitor the ongoing and future events of social problems. Moreover, to alleviate time-consuming human efforts in identifying the SocialTERMs, this paper designs and examines the SocialTERM-Extractor, which is an automatic approach for identifying the key noun terms of social-problem-related topics, namely SPRTs, in a large number of online news articles and predicting the SocialTERMs among the identified key noun terms. This paper has its novelty as the first trial to identify and predict the SocialTERMs from a large number of online news articles, and it contributes to literature by proposing three types of text-mining-based features, namely temporal weight, sentiment, and complex network structural features, and by comparing the performances of such features with various machine learning techniques including deep learning. Particularly, when applied to a large number of online news articles that had been published in South Korea over a 12-month period and mostly written in Korean, the experimental results showed that Boosting Decision Tree gave the best performances with the full feature sets. They showed that the SocialTERMs can be predicted with high performances by the proposed SocialTERM-Extractor. Eventually, this paper can be beneficial for individuals or organizations who want to explore and use social-problem-related data in a systematical manner for understanding and managing social problems even though they are unfamiliar with ongoing social problems.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Business Administration > Department of Management Information Systems > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Suh, Jong Hwan photo

Suh, Jong Hwan
경영대학 (경영정보학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE