论文部分内容阅读
本文介绍近期完成的国家自然科学基金项目<藏缅语语料库及比较研究的计量描写>的软件系统。该系统建立了我国境内藏缅语族五大语支82个语言点16万词条的开放性词汇语音数据库。研制了语言特征统计,语言比较研究软件。设计了应用于多种语言谱系分类比较研究的语音对应关系“全方位交叉”算法。对藏语方言的音节、音位、声母、韵母、声词、词素、构词能力和语音结构等10余项特征做了分布和对比统计。对藏语15个方言点做了语音对应关系和音系对比关系的量化描述,并在此基础上做出具有历时与共时比较研究意义的R相关和Φ相关分析,得出了语言分类的相关矩阵和聚类分析图表。
This article introduces the software system of National Natural Science Foundation of China recently completed, “Measurement and Description of Tibetan-Burman Corpus and Comparative Study”. The system has established an open vocabulary speech database of 82 languages and 160,000 entries in the five languages of Tibeto-Burman languages in our country. Developed language feature statistics, language comparison research software. The algorithm of “omni-directional crossover” for phonetic correspondence applied to the comparative study of multilingual linguistic classification was designed. More than 10 items such as syllables, phonemes, initials, vowels, phonetic words, morphemes, word-formation ability and phonetic structure of Tibetan dialects were distributed and compared statistically. We make a quantitative description of phonetic correspondence and Phonetic relationship between 15 dialects of Tibetan language and make a R-correlation and Phi-correlation analysis with diachronic and synchronic comparative studies, and draw the correlation matrix of linguistic classification And cluster analysis chart.