While each phase of the test development process is crucial to the validity of the examination, one phase tends to stand out among the others: the standard setting process. The standard setting process is a time-consuming and expensive endeavor. While it has received the most attention in the literature among any of the technical issues related to criterion-referenced measurement, little research attention has been given to generalizing the resulting performance standards. This procedure has the potential to improve the standard setting process by limiting the number of items rated and the number of individual rater decisions. The ability to generalize performance standards has profound implications both from a psychometric as well as a pra...
There are at least two concepts of test reliability. For over 50 years, measurement specialists perc...
This study used computer simulation to investigate the relative performance of three rater allocatio...
A new method is proposed to set multiple standards in performance tests. The method combines three s...
While each phase of the test development process is crucial to the validity of the examination, one ...
In recent years, performance assessments have become increasingly popular in medical education. Whil...
The trustworthiness of performance standards influences the credibility of criterion-referenced larg...
Essential for the validity of the judgments in a standard-setting study is that they follow the impl...
In this paper, performance assessments are cast within a sampling framework. A performance assessmen...
Judgmental standard setting methods, such as the W. H. Angoff (1971) method, use item performance es...
Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the...
While a test score may be valid for a group there may sometimes be reason to suspect its validity fo...
Technicai and other standards and criteria required for multiple choice educational achievement test...
ABSTRACT. This review identifies 38 methods for either setting standards or adjusting them based on ...
In testing, setting performance standards involves identifying cut scores that divide examinees into...
In this research, it has been analyzed how variation in facet number affects reliability with the te...
There are at least two concepts of test reliability. For over 50 years, measurement specialists perc...
This study used computer simulation to investigate the relative performance of three rater allocatio...
A new method is proposed to set multiple standards in performance tests. The method combines three s...
While each phase of the test development process is crucial to the validity of the examination, one ...
In recent years, performance assessments have become increasingly popular in medical education. Whil...
The trustworthiness of performance standards influences the credibility of criterion-referenced larg...
Essential for the validity of the judgments in a standard-setting study is that they follow the impl...
In this paper, performance assessments are cast within a sampling framework. A performance assessmen...
Judgmental standard setting methods, such as the W. H. Angoff (1971) method, use item performance es...
Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the...
While a test score may be valid for a group there may sometimes be reason to suspect its validity fo...
Technicai and other standards and criteria required for multiple choice educational achievement test...
ABSTRACT. This review identifies 38 methods for either setting standards or adjusting them based on ...
In testing, setting performance standards involves identifying cut scores that divide examinees into...
In this research, it has been analyzed how variation in facet number affects reliability with the te...
There are at least two concepts of test reliability. For over 50 years, measurement specialists perc...
This study used computer simulation to investigate the relative performance of three rater allocatio...
A new method is proposed to set multiple standards in performance tests. The method combines three s...