篇名 | AN INTEGRATED APPROACH TO FUNCTIONAL CORPUS CONSTRUCTION |
---|---|
卷期 | 13:1 |
並列篇名 | 功能語料庫的一體化構建方法 |
作者 | 嚴恒斌 、 Jonathan Webster |
頁次 | 053-077 |
關鍵字 | corpus annotation 、 linguistic function 、 collaborative annotation 、 functional semantics 、 語料庫標註 、 語言功能 、 協作性標註 、 功能語義 、 Scopus 、 THCI |
出刊日期 | 201503 |
DOI | 10.6519/TJL.2015.13(1).3 |
本文論述作者基於系統功能語法框架,構建一個全新語料庫的經驗。我們 從Penn Treebank語料庫中選取部份文本,通過一個基於網絡且有著多項高 級特性的協作性平台對文本進行標註。我們首先討論我們項目的背景和目 的,然後提出我們針對協作性標註過程中所遇到的一些問題和挑戰的解決 方法。我們初步構建的語料庫有著較為精確的高質量標註,可對現有的基 於語義標註的語料庫資源作有益的補充,同時也為進一步開發相關的大型 功能語言學資源乃至語言功能自動分析系統的構建打下基礎。
In this paper, we present our recent experience in constructing a first-of-its-kind functional corpus based on the theoretical framework of Systemic Functional Linguistics. Annotated on selected texts from the Penn Treebank, the corpus was built by a collaborative team on a web-based annotation platform with several advanced features. After a discussion on the background and motivation of the project, we present our solutions to some of the challenges encountered in the collaborative annotation process. With fine-grained annotations of an initial corpus now available, the corpus can serve as a valuable linguistic resource that complements existing semantically annotated corpora and aids in the development of a larger-scale resource crucial for automated systems for analysis of linguistic function.