论文部分内容阅读
文章研究的是在维吾尔文字语料库建立过程中,从MS-DOS系统上排版的书刊,杂志中获得维吾尔语单词,并转换到WINDOWS环境上RTF格式的一种快速解决方法,然后提出维吾尔文字Unicode代码对应的RTF代码表和动态生成维吾尔文RTF文件的简单方法。实践证明这种方法有助于提高语料库构造中的大单词收集的效率和质量。
This article studies a fast solution to Uyghur language corpus in the process of obtaining Uyghur words from books and magazines formatted on MS-DOS system and converting to RTF format on WINDOWS environment, and then proposes Uighur text Unicode code Corresponding RTF code tables and easy way to dynamically generate Uighur RTF files. Practice has proved that this method helps to improve the efficiency and quality of large word collection in corpus construction.