论文部分内容阅读
KWIC(Key Word in Context)索引既是语料库研究的基础方法和技术,也是通过语料库文本观察和描述语言数据,进而验证或构建语言理论的基本途径。本文追溯索引方法在西方的起源,并梳理了中国的类书传统及近代对西方索引方法的学习和借鉴;对比分析了中西传统索引实践的异同,并论述了西方索引方法与现代计算机技术的融合,以及计算机索引在语料库分析中的应用,标志了索引的语言学转向。最后,分析了语料库索引对于语料库驱动研究的意义,并展望了索引技术在语言大数据时代的进一步发展和创新。
KWIC (Key Word in Context) index is not only the basic method and technology of corpus research, but also the basic way to verify or construct linguistic theory by observing and describing linguistic data through corpus texts. This paper traced the origins of the indexing method in the West and combed the Chinese classics tradition and the modern learning and reference of the western indexing methods. The similarities and differences between the traditional indexing practices in China and the West were compared and analyzed. The integration of the western indexing method and modern computer technology was discussed. The application of computer index in corpus analysis marks the linguistic turn of index. Finally, the significance of corpus indexing for corpus-driven research is analyzed, and the further development and innovation of indexing technology in language big data era are prospected.