In this paper, we develop a new methodology for comparing normalization procedures based on different classification systems. Firstly, a pair of normalization procedures should be compared using their own classification systems for evaluation purposes. Secondly, when the two procedures are noncomparable according to the above test, then evaluation using a third (or more) classification systems may be forthcoming. In the empirical part of the paper we use: (i) the IDCP method for the evaluation of normalization procedures; (ii) two nested classification systems consisting of 219 sub-fields and 19 fields, together with a systematic and a random assignment of articles to sub-fields (or fields) with the aim of maximizing or minimizing di...