会议专题

Finding Appropriate Lexical Diversity Measurements for Small-Size Corpus

  In the present investigation four kinds of lexical diversity measurement have been applied to the sets of word chunks with monotone increasing size.The computational experiment with corpus processing and statistical test has been conducted to find out the most effective lexical diversity measurement in evaluating a small-sized corpus of 350~550 words,and the result shows that D-estimate is the most appropriate among the four lexical diversity measurements which are considered in this research.Also D-estimate shows more stable results than other measurements when the number of words varies between texts.

TTR (Type-Token Ratio) D-estimate Yules K Guirauds R lexical diversity

Woonho Choi

Dept.of Linguistics, Seoul National University,Gwanak 1, Gwanak-ro, Gwanak-gu, 151-742, Seoul, Korea

国际会议

the Second International Conference on Frontiers of Manufacturing and Design Science(第二届制造与设计科学国际会议(ICFMD 2011))

台湾

英文

1244-1248

2011-12-11(万方平台首次上网日期,不代表论文的发表时间)