论文部分内容阅读
To RCH2009语料库是按布朗语料库(Brown corpus)取样方案创建的100万词次现代汉语语料库。该语料库所收15种文本类别情况,可见下表。To RCH2009语料库创建的突出特点是“共建共享”。它是由全国64所高校的115位老师和硕博士生参与语料收集,共同创建的现代汉语语料库。To RCH2009语料库项目由北京外国语大学中国外语教育研究中心许家金教授发起,并统筹语料收集、整理和校对工作。
The To RCH2009 corpus is a 100-million-word modern Chinese corpus created by the Brown corpus sampling program. The corpus received 15 kinds of text categories, see the table below. The salient feature of To RCH2009 corpus creation is “co-construction sharing.” It is a collection of modern Chinese corpus, which is collected by 115 teachers and master doctoral students from 64 universities in China. To RCH2009 corpus project by the Beijing Foreign Studies University Chinese Foreign Language Education Research Center Professor Xu Jiajin initiated and co-ordinate the corpus collection, collation and proofreading.