Videotapes were developed depicting persons performing on two jobs. Fourteen expert judges then carefully assigned multidimensional ratings of effectiveness to each performer. Interjudge agreement among the experts was high (median intraclass r = .97), providing indirect validity evidence for these performance judgments and justifying the use of mean expert ratings as criterion “true scores” of effectiveness. The true scores were next used as criteria against which to judge the differential accuracy (Cronbach, L. J. Processes affecting scores on understanding of others and assuming “similarity.” Psychological Bulletin 1955, 52 177–193) of each of the 146 college-student raters who viewed the tapes. Scores reflecting halo, leniency/severity,...