This investigation examined the practice of relying on field test item calibrations in advance of the operational administration of a large scale assessment for purposes of equating and scaling. Often termed “pre-equating,” the effectiveness of this method is explored for a statewide, high-stakes assessment in grades three, five, and seven for the content areas of language arts, mathematics, and social studies. Pre-equated scaling was based on item calibrations using the Rasch model from an off-grade field test event in which students tested were one grade higher than the target population. These calibrations were compared to those obtained from post-equating, which used the full statewide population of examinees. Item difficulty estimate...
AbstractThe aim of this study is to introduce the concept of equating, its implications and methods ...
Foreign Language (TOEFL) test scores were evaluated in terms of scale stability. True score item res...
This article describes AYP and some of the psychometric issues it raises. It examines scaling as a m...
This investigation examined the practice of relying on field test item calibrations in advance of th...
Statistical procedure used in adjusting test score difficulties on test forms is known as “equating”...
The need to compare students across different test administrations, or perhaps across different test...
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, ...
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, ...
This quantitative study examines whether standards-based grade reporting accurately informs student ...
The purpose of this paper was to examine alternative techniques for quantifying the errors associat...
The practice of equating educational and psychological tests to create comparable and interchangeabl...
Common Core State Standards in English Language Arts and Mathematics at grades K to 12 were introduc...
Includes bibliographical references (pages [103]-108).This dissertation investigated the ability of ...
Test publishers generally choose an anchor or scaling test approach to the development of a growth ...
Targeted for policy-makers, this article provides suggestions for ensuring that the benefits of usin...
AbstractThe aim of this study is to introduce the concept of equating, its implications and methods ...
Foreign Language (TOEFL) test scores were evaluated in terms of scale stability. True score item res...
This article describes AYP and some of the psychometric issues it raises. It examines scaling as a m...
This investigation examined the practice of relying on field test item calibrations in advance of th...
Statistical procedure used in adjusting test score difficulties on test forms is known as “equating”...
The need to compare students across different test administrations, or perhaps across different test...
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, ...
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, ...
This quantitative study examines whether standards-based grade reporting accurately informs student ...
The purpose of this paper was to examine alternative techniques for quantifying the errors associat...
The practice of equating educational and psychological tests to create comparable and interchangeabl...
Common Core State Standards in English Language Arts and Mathematics at grades K to 12 were introduc...
Includes bibliographical references (pages [103]-108).This dissertation investigated the ability of ...
Test publishers generally choose an anchor or scaling test approach to the development of a growth ...
Targeted for policy-makers, this article provides suggestions for ensuring that the benefits of usin...
AbstractThe aim of this study is to introduce the concept of equating, its implications and methods ...
Foreign Language (TOEFL) test scores were evaluated in terms of scale stability. True score item res...
This article describes AYP and some of the psychometric issues it raises. It examines scaling as a m...