A Dual-Stage Framework for Automated Review Labeling: Integrating Keyword Detection and Large Language Models for Subfeature Analysis

Jiang, Yilan; Park, Seyoung; Kim, Harrison

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Dual-Stage Framework for Automated Review Labeling: Integrating Keyword Detection and Large Language Models for Subfeature Analysis

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jiang, Yilan	-
dc.contributor.author	Park, Seyoung	-
dc.contributor.author	Kim, Harrison	-
dc.date.accessioned	2025-12-01T08:30:20Z	-
dc.date.available	2025-12-01T08:30:20Z	-
dc.date.issued	2026-05	-
dc.identifier.issn	1050-0472	-
dc.identifier.uri	https://scholarworks.gnu.ac.kr/handle/sw.gnu/81056	-
dc.description.abstract	Incorporating user needs into design strategies is a promising approach for successful product design. To achieve this, numerous studies extract design implications from user-generated data through supervised and unsupervised learning techniques. While supervised learning methods generally deliver superior performance, they require extensive data labeling, which is time-consuming and labor-intensive. This study presents a domain-specific framework for automating the labeling of product review data, aimed at supporting fine-grained analysis of customer feedback—particularly at the subfeature level. The proposed framework consists of two pseudo-labeling mechanisms, detection and large language model (LLM) application. The first stage extracts for the target topic and then labels datasets by checking if the data contains these . The second stage employs an LLM and labels the remainder of the first stage based on their context. This article presents two applications of LLMs tailored to the characteristics of the target data. (i) Prompting LLM: This approach appends a task-specific template to the input text (reviews) and predicts the masked token representing the label. (ii) Fine-tuned LLM: Leveraging domain knowledge, this method involves fine-tuning the LLM to classify the input data (reviews) with improved accuracy and contextual relevance. The framework is evaluated through real-world case studies in two product categories: smartphones and blood pressure monitors. Results show that the proposed method achieves F1 scores ranging from 83% to 97%, outperforming a baseline model, which yields F1 scores between 53% and 89%.	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	American Society of Mechanical Engineers	-
dc.title	A Dual-Stage Framework for Automated Review Labeling: Integrating Keyword Detection and Large Language Models for Subfeature Analysis	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1115/1.4069974	-
dc.identifier.scopusid	2-s2.0-105021457519	-
dc.identifier.bibliographicCitation	Journal of Mechanical Design - Transactions of the ASME, v.148, no.5	-
dc.citation.title	Journal of Mechanical Design - Transactions of the ASME	-
dc.citation.volume	148	-
dc.citation.number	5	-
dc.type.docType	Review	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordAuthor	data labeling	-
dc.subject.keywordAuthor	data-driven design	-
dc.subject.keywordAuthor	detection	-
dc.subject.keywordAuthor	large language model	-
dc.subject.keywordAuthor	online reviews	-
dc.subject.keywordAuthor	supervised learning	-

Files in This Item: There are no files associated with this item.

Appears in Collections: 공과대학 > Department of Industrial and Systems Engineering > Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Park, Seyoung photo

Park, Seyoung: 공과대학 (산업시스템공학부)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0534

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE