文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Discovering Correction Rules for Auto Editing
卷期 15:3/4
作者 Huang, An-taKuo, Tsung-tingLai, Ying-chunLin, Shou-de
頁次 219-235
關鍵字 Edit DistanceErroneous PatternCorrection RrulesAuto EditingTHCI Core
出刊日期 201012

中文摘要

英文摘要

This paper describes a framework that extracts effective correction rules from a sentence-aligned corpus and shows a practical application: auto-editing using the discovered rules. The framework exploits the methodology of finding the Levenshtein distance between sentences to identify the key parts of the rules and uses the editing corpus to filter, condense, and refine the rules. We have produced the rule candidates of such form, A B, where A stands for the erroneous pattern
and B for the correct pattern.The developed framework is language independent; therefore, it can be applied to other languages. The evaluation of the discovered rules reveals that 67.2% of the top 1500 ranked rules are annotated as correct or mostly correct by experts. Based
on the rules, we have developed an online auto-editing system for demonstration at http://ppt.cc/02yY.

相關文獻