In this study, four approaches to the estimation of interrater reliability are studied: correlation, comparison of means, percentage of agreement, and generalizability theory. For the data-composed of ratings for 43 students on ten items by two raters- the reliability estimates varied because of the situation that the ranges of the obtained values by used approaches and different calculation processes. The highest estimate was 0.90 which is estimated by G theory. Besides this result, it was obtained that there was positive and high correlation coefficient (0.74). The estimate of percentage of exact matches of agreement between the two raters was found as 58.9 %. Finally, although there were no statistically differences between general mean ...
In this study, interrater reliability was compared based on both classical test theory and generaliz...
Inter-rater reliability coefficients are often reported in performance assessment as a measure of ra...
The kappa statistic is frequently used to test interrater reliability. The importance of rater relia...
In this study, four approaches to the estimation of interrater reliability are studied: correlation,...
The agreement between raters is examined within the scope of the concept of “inter-rater reliability...
The aim of this study is to analyse the effects of the number of raters and the types of rubric on t...
It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only ...
232 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.This thesis identified and st...
This article argues that the general practice of describing interrater reliability as a single, unif...
This article introduces the application of Generalizability Theory to assessing the reliability of m...
This article reports from a study of interrater reliability of constructed response items in standar...
<p>All inter-rater reliabilities were measured using Krippendorff's alpha for ordinal data. *Include...
This article argues that the general practice of describing interrater reliability as a single, unif...
This paper presents the first meta-analysis for the inter-rater reliability (IRR) of journal peer re...
The statistical methods described in the preceding chapter for controlling for error are applicable ...
In this study, interrater reliability was compared based on both classical test theory and generaliz...
Inter-rater reliability coefficients are often reported in performance assessment as a measure of ra...
The kappa statistic is frequently used to test interrater reliability. The importance of rater relia...
In this study, four approaches to the estimation of interrater reliability are studied: correlation,...
The agreement between raters is examined within the scope of the concept of “inter-rater reliability...
The aim of this study is to analyse the effects of the number of raters and the types of rubric on t...
It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only ...
232 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.This thesis identified and st...
This article argues that the general practice of describing interrater reliability as a single, unif...
This article introduces the application of Generalizability Theory to assessing the reliability of m...
This article reports from a study of interrater reliability of constructed response items in standar...
<p>All inter-rater reliabilities were measured using Krippendorff's alpha for ordinal data. *Include...
This article argues that the general practice of describing interrater reliability as a single, unif...
This paper presents the first meta-analysis for the inter-rater reliability (IRR) of journal peer re...
The statistical methods described in the preceding chapter for controlling for error are applicable ...
In this study, interrater reliability was compared based on both classical test theory and generaliz...
Inter-rater reliability coefficients are often reported in performance assessment as a measure of ra...
The kappa statistic is frequently used to test interrater reliability. The importance of rater relia...