Cited 0 time in
Temporal Dynamics of Harmful Speech in Chatbot-User Dialogues: A Comparative Study of LLM and Chit-Chat Systems
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kwon, Ohseong | - |
| dc.contributor.author | Yoon, Hyobeen | - |
| dc.contributor.author | Chin, Hyojin | - |
| dc.contributor.author | Park, Jisung | - |
| dc.date.accessioned | 2026-01-08T02:30:13Z | - |
| dc.date.available | 2026-01-08T02:30:13Z | - |
| dc.date.issued | 2025-12 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.uri | https://scholarworks.gnu.ac.kr/handle/sw.gnu/81653 | - |
| dc.description.abstract | Harmful language in conversational AI poses distinct safety and governance challenges, as Large Language Model (LLM) chatbots interact in private, one-to-one settings. Understanding the types of harm and their temporal concentration is crucial for responsible deployment and time-aware moderation. This study investigates the types and diurnal dynamics of harmful speech, comparing patterns between play-oriented chit-chat and task-oriented LLM services.We analyze two large-scale, real-world English corpora: a chit-chat service (SimSimi; 8.7 M utterances) and an LLM service (WildChat; 610 K utterances). Using the Perspective API for multi-label classification (Toxicity, Profanity, Insult, Identity Attack, Threat), we estimate the incidence of harm categories and compare their distribution across five dayparts. Our analysis shows that harmful speech is significantly more prevalent in the chit-chat context than in the LLM service. Across both platforms, Toxicity and Profanity are the dominant categories. Temporally, harmful speech concentrates most frequently during the dawn daypart. We contribute an empirical baseline on how harm varies by chatbot modality and time of day, offering practical guidance for designing dynamic, platform-specific moderation policies. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | Temporal Dynamics of Harmful Speech in Chatbot-User Dialogues: A Comparative Study of LLM and Chit-Chat Systems | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/app152413185 | - |
| dc.identifier.scopusid | 2-s2.0-105025880660 | - |
| dc.identifier.wosid | 001646133500001 | - |
| dc.identifier.bibliographicCitation | Applied Sciences-basel, v.15, no.24 | - |
| dc.citation.title | Applied Sciences-basel | - |
| dc.citation.volume | 15 | - |
| dc.citation.number | 24 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Chemistry | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Materials Science | - |
| dc.relation.journalResearchArea | Physics | - |
| dc.relation.journalWebOfScienceCategory | Chemistry, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Materials Science, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
| dc.subject.keywordPlus | SLEEP | - |
| dc.subject.keywordAuthor | harmful speech | - |
| dc.subject.keywordAuthor | chatbot | - |
| dc.subject.keywordAuthor | WildChat | - |
| dc.subject.keywordAuthor | chatbot user dialogue | - |
| dc.subject.keywordAuthor | SimSimi | - |
| dc.subject.keywordAuthor | offensive languages | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532
COPYRIGHT 2022 GYEONGSANG NATIONAL UNIVERSITY LIBRARY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
