발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교

김미란

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교

Full metadata record

DC Field	Value	Language
dc.contributor.author	김미란	-
dc.date.accessioned	2022-12-26T20:46:47Z	-
dc.date.available	2022-12-26T20:46:47Z	-
dc.date.issued	2016	-
dc.identifier.issn	1598-1886	-
dc.identifier.issn	2713-6817	-
dc.identifier.uri	https://scholarworks.gnu.ac.kr/handle/sw.gnu/16114	-
dc.description.abstract	This paper aims to present word frequency information of presentation transcripts for a better understanding of the linguistic nature of spoken data. The frequency information analyzed in this paper is based on the TEDLIUM corpus, which contains approximately 2.6 million words from 1,495 talks (transcripts) given by 1,242 different speakers. These talks are all task-oriented to some extent with various topics and communicative purposes, and they are of our potential interest as a preliminary to a measure of learning resources with the aim of developing listening and speaking skills. TTR (Type-Token Ratio) commonly used in evaluating lexical complexity or diversity of reading material is used to measure similar characteristics of presentation material as an exploratory measure. In addition, the frequency information extracted from TED-LIM is compared to the frequency list of BNC (British National Corpus) in order to understand similarities and/or differences between presentation transcripts and written/spoken corpus. Our better understanding of linguistic characteristics of spoken texts, though limited to presentation transcripts in this paper, can assist language learners with appropriate listening or speaking material to acquire effective communication skills.	-
dc.format.extent	30	-
dc.language	한국어	-
dc.language.iso	KOR	-
dc.publisher	서강대학교 언어정보연구소	-
dc.title	발표문 텍스트의 어휘사용 특성: TED-LIUM과 BNC 코퍼스 비교	-
dc.title.alternative	Word Frequency Information in Presentation Transcripts: Comparing TED-LIUM Corpus and BNC Corpus	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.29211/soli.2016.29..004	-
dc.identifier.bibliographicCitation	언어와 정보 사회, v.29, pp 93 - 122	-
dc.citation.title	언어와 정보 사회	-
dc.citation.volume	29	-
dc.citation.startPage	93	-
dc.citation.endPage	122	-
dc.identifier.kciid	ART002169726	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	테드리움코퍼스	-
dc.subject.keywordAuthor	BNC코퍼스	-
dc.subject.keywordAuthor	강연자료	-
dc.subject.keywordAuthor	어휘빈도	-
dc.subject.keywordAuthor	어휘길이(글자	-
dc.subject.keywordAuthor	자소)	-
dc.subject.keywordAuthor	어휘다양성	-
dc.subject.keywordAuthor	어휘타입빈도비율	-
dc.subject.keywordAuthor	TED-LIUM corpus	-
dc.subject.keywordAuthor	BNC(British National Corpus)	-
dc.subject.keywordAuthor	presentation transcripts	-
dc.subject.keywordAuthor	word frequency	-
dc.subject.keywordAuthor	word length(letter and segment)	-
dc.subject.keywordAuthor	lexical diversity	-
dc.subject.keywordAuthor	TTR(Type-Token Ratio)	-

Files in This Item: There are no files associated with this item.

Appears in Collections: 사범대학 > 영어교육과 > Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Mi Ran photo

Kim, Mi Ran: 사범대학 (영어교육과)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE