论文部分内容阅读
语料库在本质上是一种数据库,其存在的目的就是对语言材料进行有效的存放。伴随着计算机网络技术的不断发展,赣南地区的客家方言语音语料库以及检索平台也在不断的构建当中。但是数据库与语料库并不是完全的等同,两者之间既有交叉部分同时也存在各自的不同点。本文对方言数据库概述进行分析,较为详细地阐述了方言数据库研制的系统工程,并提出建设策略。
Corpus is essentially a database, its purpose is to exist on the effective storage of language materials. With the continuous development of computer network technology, the corpus of Hakka dialects and the retrieval platform in southern Jiangxi are also constantly being constructed. However, the database and the corpus are not exactly the same, there are both cross-section between the two also exist their own differences. This article analyzes the dialect database outline, elaborates the system engineering of dialect database development in detail, and puts forward the construction strategy.