A sequence of simulations was carried out to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker linear observed-score equating; (2) Levine equally reliable linear observed-score equating; (3) equipercentile curvilinear observed-score equating; and (4) item response theory (IRT) curvilinear true-score equating. The results support the prediction based on theoretical grounds that observed-score equating methods are more affected by sample variation than are true-score equating methods. These results further suggest that matching equating samples on the basis of fallible measures of ability may not be advisable for any conve...
The kernel method of test equating is a unified approach to test equating with some advantages over ...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
ABSTRACT. The application of item response theory (IRT) methodology to test equating has been a rese...
The overall aim of this work is to review test equating methods with a particularly detailed descrip...
The overall aim of this work is to review test equating methods with a particularly detailed descrip...
This study explored the effect of item difficulty and sample size on the accuracy of equating by usi...
The need to compare students across different test administrations, or perhaps across different test...
The purpose of this paper was to examine alternative techniques for quantifying the errors associat...
This paper discusses the four major types of test equating: (1) mean; (2) linear; (3) equipercentile...
This paper focuses on a discussion of how various equating methods are affected by (1) sampling err...
Test equating is a statistical procedure to ensure that scores from different test forms can be used...
Test equating is a statistical procedure to ensure that scores from different test forms can be used...
The kernel method of test equating is a unified approach to test equating with some advantages over ...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
Two methods of ’equating ’ tests are compared, one using true scores, the other using equipercentile...
ABSTRACT. The application of item response theory (IRT) methodology to test equating has been a rese...
The overall aim of this work is to review test equating methods with a particularly detailed descrip...
The overall aim of this work is to review test equating methods with a particularly detailed descrip...
This study explored the effect of item difficulty and sample size on the accuracy of equating by usi...
The need to compare students across different test administrations, or perhaps across different test...
The purpose of this paper was to examine alternative techniques for quantifying the errors associat...
This paper discusses the four major types of test equating: (1) mean; (2) linear; (3) equipercentile...
This paper focuses on a discussion of how various equating methods are affected by (1) sampling err...
Test equating is a statistical procedure to ensure that scores from different test forms can be used...
Test equating is a statistical procedure to ensure that scores from different test forms can be used...
The kernel method of test equating is a unified approach to test equating with some advantages over ...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...
Item response theory observed-score equating (IRTOSE) is widely used in many testing programs. The a...