Standardized tests are equated and scaled in order that scores on different tests can be compared. If one test yields higher expected scaled scores than another, the scale is biased against those who take the latter test. The amount of bias, defined as the difference between expected values, depends on ability. This paper presents two methods for estimating this relationship and the bias in the scale, using a predictor as the measure of ability. The resulting evaluation is absolute in the sense that the scale is judged according to its own properties and not by comparison with an arbitrarily designated criterion scale. Moreover, there is no need to assume a particular theoretical model to be correct. An application of the meth...
In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests d...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
Test publishers generally choose an anchor or scaling test approach to the development of a growth ...
The need to compare students across different test administrations, or perhaps across different test...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Previous research on the application of IRT methodology to vertical test equating has demonstrated ...
The Stocking and Lord (1983) procedure for computing equating coefficients for tests having dichot...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Situations where scale parameters are not nuisance factors to be controlled but outcomes to be expl...
The metric transformations of the ability scales involved in three equating techniques-external anc...
Wright (1977) outlined procedures for equating tests and test scores using the Rasch model. This p...
Foreign Language (TOEFL) test scores were evaluated in terms of scale stability. True score item res...
This book provides an introduction to test equating, scaling, and linking, including those concepts ...
For the external-anchor test equating model, two observed-score methods are derived using the slope...
In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests d...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
Test publishers generally choose an anchor or scaling test approach to the development of a growth ...
The need to compare students across different test administrations, or perhaps across different test...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Previous research on the application of IRT methodology to vertical test equating has demonstrated ...
The Stocking and Lord (1983) procedure for computing equating coefficients for tests having dichot...
Traditionally, error in equating observed scores on two versions of a test is defined as the differe...
Situations where scale parameters are not nuisance factors to be controlled but outcomes to be expl...
The metric transformations of the ability scales involved in three equating techniques-external anc...
Wright (1977) outlined procedures for equating tests and test scores using the Rasch model. This p...
Foreign Language (TOEFL) test scores were evaluated in terms of scale stability. True score item res...
This book provides an introduction to test equating, scaling, and linking, including those concepts ...
For the external-anchor test equating model, two observed-score methods are derived using the slope...
In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests d...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
Test publishers generally choose an anchor or scaling test approach to the development of a growth ...