It has been theorized and evidenced that traditional reliabilities calculated on tests consisting of context-dependent item sets yield inflated estimates. However, the degree of inflated reliability for different scoring techniques has not been observed. This study scored three context-dependent item sets as stand-alone items and as separate item sets using three different scoring techniques; number-right, polyweighting and the three-parameter IRT logistic model. Differences in reliability estimates for the stand-alone and item set treatment were determined for each scoring procedure and compared. These three scoring procedures were also compared to determine which procedure yielded the highest reliability and validity estimates and precisi...
The increased use of polytomous item formats has led assessment developers to pay greater attention ...
Lack of sufficient reliability is the primary impediment for generating and reporting subtest scores...
The ultimate goal of psychometric testing is to produce a score by which people can be differentiate...
Among the measurement techniques receiving greater attention is the context-dependent item set or te...
Reliability is usually estimated for a total score, but it can also be estimated for item scores. It...
Reliability is usually estimated for a total score, but it can also be estimated for item scores. It...
This study investigates the usefulness of item-score reliability as a criterion for item selection i...
Procedures based on item response theory (IRT) are widely accepted for solving various measurement p...
In psychology and education, tests are used to measure intelligence and school performance. Test sco...
In this study, it was compared different item scoring methods which consist of unweighted item scori...
Four methods of scoring multiple-choice items were compared: Dichotomous classical (number-correct),...
This study, to assess the robustness of Item Response Theory (IRT) equating to violation of the cond...
A Monte Carlo simulation study investigated the effect of scoring format, item parameterization, thr...
This study investigates the usefulness of item-score reliability as a criterion for item selection i...
Thesis (Ph. D.)--University of Hawaii at Manoa, 1995.Includes bibliographical references (leaves 154...
The increased use of polytomous item formats has led assessment developers to pay greater attention ...
Lack of sufficient reliability is the primary impediment for generating and reporting subtest scores...
The ultimate goal of psychometric testing is to produce a score by which people can be differentiate...
Among the measurement techniques receiving greater attention is the context-dependent item set or te...
Reliability is usually estimated for a total score, but it can also be estimated for item scores. It...
Reliability is usually estimated for a total score, but it can also be estimated for item scores. It...
This study investigates the usefulness of item-score reliability as a criterion for item selection i...
Procedures based on item response theory (IRT) are widely accepted for solving various measurement p...
In psychology and education, tests are used to measure intelligence and school performance. Test sco...
In this study, it was compared different item scoring methods which consist of unweighted item scori...
Four methods of scoring multiple-choice items were compared: Dichotomous classical (number-correct),...
This study, to assess the robustness of Item Response Theory (IRT) equating to violation of the cond...
A Monte Carlo simulation study investigated the effect of scoring format, item parameterization, thr...
This study investigates the usefulness of item-score reliability as a criterion for item selection i...
Thesis (Ph. D.)--University of Hawaii at Manoa, 1995.Includes bibliographical references (leaves 154...
The increased use of polytomous item formats has led assessment developers to pay greater attention ...
Lack of sufficient reliability is the primary impediment for generating and reporting subtest scores...
The ultimate goal of psychometric testing is to produce a score by which people can be differentiate...