Lexical characteristics of dialogue acts in the Switchboard Corpus of telephone conversations

来源 :外语教学与研究 | 被引量 : 0次 | 上传用户:liongliong548
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper describes a corpus-based investigation of dialogue acts.In particular,it attempts to answer questions about the empirical distribution of dialogue acts in a publicly accessible corpus of transcribed conversations and degrees of lexical similarity of dialogue acts.The research will not only lend itself to a deeper understanding of the lexical realisation of dialogue acts as communicative functions but also contribute towards dialogue act recognition.The Switchboard Dialogue Act Corpus is adopted and the SWBD-DAMSL tags used for the experiments.Two statistical measures were applied,which include lexical dispersion and lexical similarity using Chi by degrees of freedom(CBDF).We show that the distribution of DAs is uneven,that a majority of the word types occur in one DA and that only a small fraction of word types are commonly used by all the DAs.Our hierarchical clustering of DAs based on CBDF suggests that there is an intrinsic structure of DAs pertaining to fact finding and verification,information provision and inter-speaker agreement and affinity building.The paper concludes with discussions and suggestions for future work. This paper describes a corpus-based investigation of dialogue acts.In particular, it attempts to answer questions about the empirical distribution of dialogue acts in a publicly accessible corpus of transcribed conversations and degrees of lexical similarity of dialogue acts. The research will not only only lend itself to a deeper understanding of the lexical realisation of dialogue acts as communicative functions but also contribute to dialogue act recognition. The Switchboard Dialogue Act Corpus is adopted and the SWBD-DAMSL tags used for the experiments.Two statistical measures were applied, which include lexical dispersion and lexical similarity using Chi by degrees of freedom (CBDF). We show that the distribution of DAs is uneven, that a majority of the word types occur in one DA and that only a small fraction of word types are commonly used by all the DAs.Our hierarchical clustering of DAs based on CBDF suggests that there is an intrinsic structure of DAs pertaining to fact finding and verification, information provision and inter-speaker agreement and affinity building. The paper concludes with discussions and suggestions for future work.
其他文献
<正>对外汉语教材的编写也是汉语教学与研究的重要内容之一,一套质量较高的对外汉语教材可以对汉语教学起到积极的促进作用。高质量的对外汉语教材往往会体现出最新的教学理