您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(工学版)》

山东大学学报(工学版) ›› 2010, Vol. 40 ›› Issue (5): 105-111.

• 论文 • 上一篇    下一篇

基于词语量化相关关系的句际相关度计算

钟茂生1, 刘慧2, 邹箭3   

  1. 1. 华东交通大学信息工程学院, 江西 南昌 330013; 2. 上海对外贸易学院商务信息学院, 上海 201620;
    3. 江西司法警官职业学院图书馆, 江西 南昌 330013
  • 收稿日期:2010-01-28 出版日期:2010-10-16 发布日期:2010-01-28
  • 作者简介:钟茂生(1974-),男,江西兴国人,博士,讲师,研究方向为自然语言处理、信息检索.Email:zhongmaosheng@sina.com
  • 基金资助:

    教育部人文社科基金资助项目(09YJC740027),江西省高校人文社科基金资助项目(YY1005)

The inter-sentence semantic relevancy degree calculation using the quantified correlation of words

ZHONG Maosheng 1,  LIU Hui2,  ZOU Jian3   

  1. 1. School of Information Engineering, East China Jiaotong University, Nanchang 330013, China;
    2.  Business Information Management School, Shanghai Institute of Foreign Trade, Shanghai 201620, China;
    3. Library, Jiangxi Justice Police Vocational College, Nanchang 330013, China
  • Received:2010-01-28 Online:2010-10-16 Published:2010-01-28

摘要:

语篇上下文句子之间形式上是否衔接、意义上是否关联,是文本推理和文本结构分析的重要依据。上下文句子之间的关联性分析可分为定性和定量两种形式,作者以词语相关关系量化分析为基础,将上下文句子之间的相关度看成是由组成两个句子的所有词对之间的量化相关关系的组合结果,来定量地计算句子之间的相关度。实验结果表明,文中给出的句子相关度计算方法能够避免相似性度量中句子长度窗口受限问题;同时,与人工给出的句子间相关度进行相关系数计算表明,该方法能够很好的模拟人的认知来计算句子之间的语义相关性。

关键词: 词语量化关系, 句际相关度, 句际相似度

Abstract:

The coherence in form and relevancy in meaning between sentences in a context are the important grounds in text reasoning and text-structure analysis. There are two ways of analyzing the inter-sentence correlativity in a context—qualitatively and quantitatively. According to the analysis of quantified correlation between words, assuming that the inter-sentence correlativity in a context was the outcome of the quantified correlation between word-pairs, which consist of the adjacent sentences, this paper measured quantitatively the degree of semantic relevancy between sentences. The experiment results show that this method of relevancy measurement can avoid the window-length constraint which exists in similarity measurement; in addition, the calculation for the correlation coefficient of correlativity between sentences, done by hand and done by computer, indicates that the method can well-simulate the cognition of the human brain to measure the semantic relevancy between sentences in a context.

Key words: quantified correlation of words, inter-sentence relevancy degree, inter-sentence similarity

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!