文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Segmentation Standard for Chinese Natural Language Processing
卷期 2:2
作者 Huang, Chu-renChen, Keh-jiannChen, Feng-yiChang, Li-li
頁次 047-062
關鍵字 THCI Core
出刊日期 199708

中文摘要

英文摘要

This paper proposes a segmentation standard for Chinese natural language
processing. The standard is proposed to achieve linguistic felicity, computational feasibility, and data uniformity. Linguistic felicity is maintained by a definition of segmentation unit that is equivalent to the theoretical definition of word, as well as a set of segmentation principles that are equivalent to a functional definition of a word.
Computational feasibility is ensured by the fact that the above functional definitions are procedural in nature and can be converted to segmentation algorithms as well as by the implementable heuristic guidelines which deal with specific linguistic categories. Data uniformity is achieved by stratification of the standard itself and by defining a standard lexicon as part of the standard.

關鍵知識WIKI

相關文獻