Finding Appropriate Lexical Diversity Measurements for Small-Size Corpus
In the present investigation four kinds of lexical diversity measurement have been applied to the sets of word chunks with monotone increasing size.The computational experiment with corpus processing and statistical test has been conducted to find out the most effective lexical diversity measurement in evaluating a small-sized corpus of 350~550 words,and the result shows that D-estimate is the most appropriate among the four lexical diversity measurements which are considered in this research.Also D-estimate shows more stable results than other measurements when the number of words varies between texts.
TTR (Type-Token Ratio) D-estimate Yules K Guirauds R lexical diversity
Woonho Choi
Dept.of Linguistics, Seoul National University,Gwanak 1, Gwanak-ro, Gwanak-gu, 151-742, Seoul, Korea
国际会议
台湾
英文
1244-1248
2011-12-11(万方平台首次上网日期,不代表论文的发表时间)