LLM 기반 의미론적 특허 데이터 노이즈 필터링 방법론 연구LLM-Based Semantic Noise Filtering Method for Patent Text Data
- Other Titles
- LLM-Based Semantic Noise Filtering Method for Patent Text Data
- Authors
- 임진성; 송지훈
- Issue Date
- Oct-2025
- Publisher
- 한국산업융합학회
- Keywords
- Generative AI; Patents; LLM; Noise filtering; Cross LLM Review
- Citation
- 한국산업융합학회논문집, v.28, no.5, pp 1379 - 1389
- Pages
- 11
- Indexed
- KCI
- Journal Title
- 한국산업융합학회논문집
- Volume
- 28
- Number
- 5
- Start Page
- 1379
- End Page
- 1389
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/80777
- ISSN
- 1226-833x
2765-5415
- Abstract
- Patent data are essential for tracking technological progress, assessing competitiveness, and forecasting future developments. However, the rapid evolution of technology and the rise of convergent fields make filtering irrelevant data a persistent challenge.
Traditional statistical models and manual preprocessing by researchers require substantial time and effort, prompting continuous research on efficient information structuring. In particular, filtering methods based on statistics or keywords have limitations in fully capturing subtle technical nuances and complex contexts. To address these limitations, this study proposes a semantic noise filtering methodology for patent data leveraging the contextual understanding capabilities of large language models (LLMs). The approach integrates LLM-based classification, statistical stability analysis, and cross-LLM review procedures to enhance the consistency and reliability of the filtering results. Applied to 1,930 domestic patents in the bio-artificial organ domain from 2000 to 2024, the method identified 55.4% as noise. The results demonstrate the method’s potential as an effective tool for technology policy formulation and strategic decision-making support.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 학과간협동과정 > 기술경영학과 > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.