Cited 0 time in
발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 김미란 | - |
| dc.date.accessioned | 2022-12-26T20:46:47Z | - |
| dc.date.available | 2022-12-26T20:46:47Z | - |
| dc.date.issued | 2016 | - |
| dc.identifier.issn | 1598-1886 | - |
| dc.identifier.issn | 2713-6817 | - |
| dc.identifier.uri | https://scholarworks.gnu.ac.kr/handle/sw.gnu/16114 | - |
| dc.description.abstract | This paper aims to present word frequency information of presentation transcripts for a better understanding of the linguistic nature of spoken data. The frequency information analyzed in this paper is based on the TEDLIUM corpus, which contains approximately 2.6 million words from 1,495 talks (transcripts) given by 1,242 different speakers. These talks are all task-oriented to some extent with various topics and communicative purposes, and they are of our potential interest as a preliminary to a measure of learning resources with the aim of developing listening and speaking skills. TTR (Type-Token Ratio) commonly used in evaluating lexical complexity or diversity of reading material is used to measure similar characteristics of presentation material as an exploratory measure. In addition, the frequency information extracted from TED-LIM is compared to the frequency list of BNC (British National Corpus) in order to understand similarities and/or differences between presentation transcripts and written/spoken corpus. Our better understanding of linguistic characteristics of spoken texts, though limited to presentation transcripts in this paper, can assist language learners with appropriate listening or speaking material to acquire effective communication skills. | - |
| dc.format.extent | 30 | - |
| dc.language | 한국어 | - |
| dc.language.iso | KOR | - |
| dc.publisher | 서강대학교 언어정보연구소 | - |
| dc.title | 발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교 | - |
| dc.title.alternative | Word Frequency Information in Presentation Transcripts: Comparing TED-LIUM Corpus and BNC Corpus | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.29211/soli.2016.29..004 | - |
| dc.identifier.bibliographicCitation | 언어와 정보 사회, v.29, pp 93 - 122 | - |
| dc.citation.title | 언어와 정보 사회 | - |
| dc.citation.volume | 29 | - |
| dc.citation.startPage | 93 | - |
| dc.citation.endPage | 122 | - |
| dc.identifier.kciid | ART002169726 | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.subject.keywordAuthor | 테드리움코퍼스 | - |
| dc.subject.keywordAuthor | BNC코퍼스 | - |
| dc.subject.keywordAuthor | 강연자료 | - |
| dc.subject.keywordAuthor | 어휘빈도 | - |
| dc.subject.keywordAuthor | 어휘길이(글자 | - |
| dc.subject.keywordAuthor | 자소) | - |
| dc.subject.keywordAuthor | 어휘다양성 | - |
| dc.subject.keywordAuthor | 어휘타입빈도비율 | - |
| dc.subject.keywordAuthor | TED-LIUM corpus | - |
| dc.subject.keywordAuthor | BNC(British National Corpus) | - |
| dc.subject.keywordAuthor | presentation transcripts | - |
| dc.subject.keywordAuthor | word frequency | - |
| dc.subject.keywordAuthor | word length(letter and segment) | - |
| dc.subject.keywordAuthor | lexical diversity | - |
| dc.subject.keywordAuthor | TTR(Type-Token Ratio) | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532
COPYRIGHT 2022 GYEONGSANG NATIONAL UNIVERSITY LIBRARY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
