Multiple indices have been proposed claiming to measure the amount of agreement between ratings of two or more judges on a multi-item measure. Unfortunately, simulation work based on these indices is lacking; thus we are left with very little understanding of exactly what should be expected of these indices and when they should work. The present investigation seeks to bridge this gap in the literature by comparing several of the more commonly used measures of interrater agreement via an Item Response Theory (IRT) model. The goal is to identify which agreement indices best recover true agreement.In this manuscript, several agreement indices are compared. Among these are the kappa coefficient kappam (Fleiss, 1971); the intraclass correlation,...
The aim of this study is to introduce weighted inter-rater agreement statistics used in ordinal scal...
International audienceAgreement between observers (i.e., inter-rater agreement) can be quantified wi...
Agreement among raters is an important issue in medicine, as well as in education and psychology. Th...
ABSTRACT In 1960, Cohen introduced the kappa coefficient to measure chance-corrected nominal scale a...
In 1960, Cohen introduced the kappa coefficient to measure chance‐corrected nominal scale agreement ...
The statistical methods described in the preceding chapter for controlling for error are applicable ...
An index for assessing interrater agreement with respect to a single target using a multi-item ratin...
Abstract:Kappa statistics is used for the assessment of agreement between two or more raters when th...
A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure ...
This paper presents a critical review of some kappa-type indices proposed in the literature to measu...
Some characteristics of Hubert’s Г, as a measure of nominal scale response agreement, are sh...
The evaluation of agreement among experts in a classification task is crucial in many situations (e....
OBJECTIVE: The overall objective was to unfold the phenomenon of interrater agreement: to identify p...
Abstract. This research discusses the use of Cohen’s j (kappa), Brennan and Prediger’s jn, and the c...
Many methods for measuring agreement among raters have been proposed and applied in many domains i...
The aim of this study is to introduce weighted inter-rater agreement statistics used in ordinal scal...
International audienceAgreement between observers (i.e., inter-rater agreement) can be quantified wi...
Agreement among raters is an important issue in medicine, as well as in education and psychology. Th...
ABSTRACT In 1960, Cohen introduced the kappa coefficient to measure chance-corrected nominal scale a...
In 1960, Cohen introduced the kappa coefficient to measure chance‐corrected nominal scale agreement ...
The statistical methods described in the preceding chapter for controlling for error are applicable ...
An index for assessing interrater agreement with respect to a single target using a multi-item ratin...
Abstract:Kappa statistics is used for the assessment of agreement between two or more raters when th...
A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure ...
This paper presents a critical review of some kappa-type indices proposed in the literature to measu...
Some characteristics of Hubert’s Г, as a measure of nominal scale response agreement, are sh...
The evaluation of agreement among experts in a classification task is crucial in many situations (e....
OBJECTIVE: The overall objective was to unfold the phenomenon of interrater agreement: to identify p...
Abstract. This research discusses the use of Cohen’s j (kappa), Brennan and Prediger’s jn, and the c...
Many methods for measuring agreement among raters have been proposed and applied in many domains i...
The aim of this study is to introduce weighted inter-rater agreement statistics used in ordinal scal...
International audienceAgreement between observers (i.e., inter-rater agreement) can be quantified wi...
Agreement among raters is an important issue in medicine, as well as in education and psychology. Th...