文章詳目資料

測驗學刊 TSSCI

  • 加入收藏
  • 下載文章
篇名 以可能值方法為基礎之多向度能力值垂直等化探究
卷期 62:2
並列篇名 The Research in Estimating Multidimensional Traits under Vertical Equating Based on Plausible Value Method
作者 吳慧珉郭伯臣許天維陳婉寧
頁次 095-126
關鍵字 大型測驗可能值方法多向度試題反應理論垂直等化能力估計Large-scale assessmentsMIRTplausible value methodtrait estimationvertical equatingTSSCI
出刊日期 201506

中文摘要

現今國際上幾個著名大型測驗均使用可能值方法呈現群體參數,因可能值方法在群體參數的回復性極佳,且大型測驗關注的焦點正是群體參數。建置大型測驗的目的通常是為了長期的教育成效評估,因此,如何檢視學生是否隨著年級不同而在某些能力值上有所不同,便成了一項值得關注的議題。透過垂直等化能使不同年級的受試者分別接受符合於其能力範圍的試題之後,將測量結果建置在同一量尺上,以進行能力高低之比較。本研究以多向度試題反應理論為基礎,使用垂直等化設計,探討不同題數、不同向度數對於能力參數估計的影響,並以不同估計方法與可能值方法進行比較。研究結果顯示,可能值方法在群體標準差的估計上有極佳的精準度,而群體能力平均數的估計則與其他估計法差不多;在多向度垂直等化設計下,每向度所對應的題數較多時則估計的效果較好。

英文摘要

The purpose of large-scale assessment is to monitor group progress. Therefore, group statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs. Vertical equating is the way test publishers used to longitudinally evaluate achievement that spans grade levels. This research is aimed to analysis if: (1) the method that used to estimate parameters; (2) the number of item for each dimension whether or not impact on the recovery of ability parameters of group statistics, based on multidimensional item response theory (MIRT) with the vertical equating design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more.

相關文獻