Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교open accessWord Frequency Information in Presentation Transcripts: Comparing TED-LIUM Corpus and BNC Corpus

Other Titles
Word Frequency Information in Presentation Transcripts: Comparing TED-LIUM Corpus and BNC Corpus
Authors
김미란
Issue Date
2016
Publisher
서강대학교 언어정보연구소
Keywords
테드리움코퍼스; BNC코퍼스; 강연자료; 어휘빈도; 어휘길이(글자; 자소); 어휘다양성; 어휘타입빈도비율; TED-LIUM corpus; BNC(British National Corpus); presentation transcripts; word frequency; word length(letter and segment); lexical diversity; TTR(Type-Token Ratio)
Citation
언어와 정보 사회, v.29, pp 93 - 122
Pages
30
Indexed
KCI
Journal Title
언어와 정보 사회
Volume
29
Start Page
93
End Page
122
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/16114
DOI
10.29211/soli.2016.29..004
ISSN
1598-1886
2713-6817
Abstract
This paper aims to present word frequency information of presentation transcripts for a better understanding of the linguistic nature of spoken data. The frequency information analyzed in this paper is based on the TEDLIUM corpus, which contains approximately 2.6 million words from 1,495 talks (transcripts) given by 1,242 different speakers. These talks are all task-oriented to some extent with various topics and communicative purposes, and they are of our potential interest as a preliminary to a measure of learning resources with the aim of developing listening and speaking skills. TTR (Type-Token Ratio) commonly used in evaluating lexical complexity or diversity of reading material is used to measure similar characteristics of presentation material as an exploratory measure. In addition, the frequency information extracted from TED-LIM is compared to the frequency list of BNC (British National Corpus) in order to understand similarities and/or differences between presentation transcripts and written/spoken corpus. Our better understanding of linguistic characteristics of spoken texts, though limited to presentation transcripts in this paper, can assist language learners with appropriate listening or speaking material to acquire effective communication skills.
Files in This Item
There are no files associated with this item.
Appears in
Collections
사범대학 > 영어교육과 > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Mi Ran photo

Kim, Mi Ran
사범대학 (영어교육과)
Read more

Altmetrics

Total Views & Downloads

BROWSE