论文部分内容阅读
一 语料库建设和语料的加工 语言知识究竟主要出自语言学家的内省还是来自对客观真实语料的观察,历来是语言学研究中理性主义和经验主义两种不同方法的分水岭。尤其是如果把大规模真实文本的描写和处理作为语言学研究和语言信息处理产业的战略目标来看待,那么大规模计算机语料库的价值就显得更其重要了。这也就是近十年来语料库语言学迅速崛起的主要动力。
Whether language knowledge of corpus construction and processing of corpus is mainly derived from the introspection of linguists or from the observation of objective corpus, which has always been the watershed of two different approaches of rationalism and empiricism in linguistic research. In particular, the value of a large-scale computer corpus is even more important if the description and processing of large-scale real texts are viewed as a strategic goal in the linguistic research and linguistic information processing industries. This is also the main motivation for the rapid rise of linguistics in the past decade.