论文部分内容阅读
本文研究篇际零重复词的分布及其增长模式。本实证性研究中的零重复词随机抽取于2 000篇样本,每篇样本为1 000词,均取自于1亿单词的英国国家语料库(BNC)。研究结果表明篇际长度为1 000词的文本中零重复词与词汇量之间的比率为大约0.66。当文本长度增加到100万字时,零重复词和词汇量之间的比率约为0.38。零重复词的数量在长度相同的文本中呈正态分布,并且与词汇量成线性关系。篇际中零重复词的增长模式可以用幂函数模型来描述,此幂函数模型与所观察的零重复词的增长曲线有很好的拟合。零重复词的这种分布特点是影响中国英语学习者词汇习得的主要因素之一。
This article studies the distribution and growth patterns of zero-repetitive words in the world. Zero repetitions in this empirical study were randomly selected from 2,000 samples, each of 1,000 words, taken from the BNC of 100 million words. The result of the study shows that the ratio of zero repetitions to vocabulary is about 0.66 in texts with an inter-space length of 1,000 words. As text length increases to one million words, the ratio between zero repetitions and vocabulary is about 0.38. The number of zero repetitions is normally distributed in texts of the same length, and is linear with the vocabulary. The growth pattern of zero-repetition words can be described by the power function model. The power function model is fitted well with the growth curve of the observed zero-repetition words. This distribution of zero repetitive words is one of the main factors that affect the acquisition of vocabulary of Chinese EFL learners.