文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 A Thesaurus-Based Semantic Classification of English Collocations
卷期 14:3
作者 Huang, Chung-chiKao, Kate H.Tseng, Chiung-huiChang, Jason S.
頁次 257-279
關鍵字 CollocationsMeaning Access Index and WordNetRandom Walk AlgorithmSemantic RelationsSemantic ClassificationTHCI Core
出刊日期 200909

中文摘要

英文摘要

Researchers have developed many computational tools aimed at extracting
collocations for both second language learners and lexicographers. Unfortunately, the tremendously large number of collocates returned by these tools usually overwhelms language learners. In this paper, we introduce a thesaurus-based semantic classification model that automatically learns semantic relations for classifying adjective-noun (A-N) and verb-noun (V-N) collocations into different thesaurus categories. Our model is based on iterative random walking over a weighted graph derived from an integrated knowledge source of word senses in WordNet and semantic categories of a thesaurus for collocation classification. We
conduct an experiment on a set of collocations whose collocates involve varying levels of abstractness in the collocation usage box of Macmillan English Dictionary. Experimental evaluation with a collection of 150 multiple-choice questions commonly used as a similarity benchmark in the TOEFL synonym test shows that a thesaurus structure is successfully imposed to help enhance collocation production for L2 learners. As a result, our methodology may improve the effectiveness of state-of-the-art collocation reference tools concerning the aspects of language understanding and learning, as well as lexicography.

相關文獻