文章詳目資料

電子商務學報 TSSCI

  • 加入收藏
  • 下載文章
篇名 基於決策樹與二元語言模型的網路用語轉譯系統
卷期 17:1
並列篇名 An Internet Slang Translator Based on Decision Tree and Bi-gram Language Model
作者 楊亨利黃泓彰林青峰
頁次 025-048
關鍵字 網路用語網路流行語文字正規化決策樹語言模型Internet slangInternet buzzwordstext normalizationdecision treebi-gram language modelTSSCI
出刊日期 201503
DOI 10.6188/JEB.2015.17(1).02

中文摘要

網路文章中含有的網路用語或網路流行語,對於以正規中文為對象的文字分析而 言是一個阻礙分析的問題;若將網路用語轉譯為正規中文將會有助於取得更多可用的 資訊。為了將網路用語轉譯為正規中文,本研究蒐集網路用語的定義與網路文章,將 網路用語分類後,運用決策樹和語言模型的轉譯方法,對各類用語作合適的轉譯。轉 譯系統能夠偵測並轉譯約81% 的網路用語,其轉譯的精確度約為90%;因此,本研 究所提出之以決策樹和語言模型為基礎之系統應可適合網路用語的轉譯。

英文摘要

While conducting text mining on Chinese content, Internet slang is a problem which results in a lower accuracy of text segmentation. Translating Internet slang into formal Chinese would help segmentation and, in addition, revealing the implicit information of the slang. In order to translate Internet slang, this study first collected meanings of slangs and web texts. Next, Internet slang was categorized, and translating methods, which are mainly based on decision tree and bi-gram language model, were developed for each category. The translator was then implemented. Eighty-one percentages of the Internet slang in web texts were correctly detected and translated, with a precision at ninety percentages. It is concluded that the proposed methods are quite applicable to Internet slang translation.

相關文獻