HyRead Journal 台灣全文資料庫

文章詳目資料

測驗學刊 TSSCI

人文科學/教育

篇名	九種古典測驗理論信度指標精確性之研究
卷期	65:2
並列篇名	A Comparison of Precision of Nine Reliability Estimates Based on Classical Test Theory
作者	蔡佩圜、凃柏原、吳裕益
頁次	217-240
關鍵字	最大信度估計下限、驗證性因素分析、 confirmatory factor analysis 、 the greatest lower bound reliability 、 TSSCI
出刊日期	201806

中文摘要

本研究採用因素結構已知的驗證性因素分析模式來產生模擬資料，探討測驗因素數目、題數、樣本數三個自變項，對g1b、1、2、3 、4、5、h、t等九種信度估計方法的偏誤、絕對偏誤、誤差均方根三個依變項之影響，藉以評估不同信度估計指標之精確性。研究結果顯示：(1)傳統最常使用的信度估計值3 僅適合用來分析單向度測驗，若為多因素測驗，則會明顯低估信度真值；(2) 4及t無論在何種情境其信度估計誤差均極微，建議盡可能採用這兩種信度估計值，當測驗資料之因素結構很明確時，最適合以t來估計整體之信度，若因素結構不明確時，最適合以4來估計整體之信度；(3)除非是分析母群資料，否則g1b有高估信度真值的現象，不適合稱之為最大信度下限；(4) h與t之比值是g因素解釋率占所有共同因素（包括g 與所有f）總解釋率之比率，建議以h與t之比值作為評估測驗是否接近單向度的指標。本研究之分析結果可提供給測驗使用人員依不同測驗情境選擇較適切之信度估計指標。

英文摘要

The purpose of this research is mainly to analyze the accuracy of different reliability index by employing g1b、1、2、3 、4、5、h、t as the major arguments. Confirmatory Factor Analysis (CFA) is utilized for simulating data in this experiment, basically relying on independent variables (the number of test factors, the number of test items, the number of sample sizes) and dependent variable (bias, absolute mean bias, root mean squared error). The statistical results and analyses are described as following: (1) 3 , the most commonly and traditionally used, only suitable for the analysis of one-dimension test, reliability index value will be significantly underestimated if multi-factor test takes place. (2) t、4 display best values of reliability estimation with extreme little error, it is recommended that these two can be used as much as possible. When the structure of factor of the test data is very clear, t is the most suitable role to estimate the overall reliability. On the other hand, if it is not clear, then 4 is the appropriate candidate to do the work. (3) Unless it is for analyzing the parent group data, then g1b shows a high estimated value of reliability which is not proper to name it as the greatest lower bound reliability. (4) The ratio of h to t is the ratio of the explanatory rate of g factor to the total explanatory rate of all common factors (including g and all f). It is recommended that it can be used as an indicator of whether the undergoing test is close to one dimension. The results of this study can provide testing persons with more appropriate estimates of reliability indicators according to different test scenarios.

本卷期文章目次

關鍵知識WIKI