AbstractInformation overload is a problem for users of MEDLINE, the database of biomedical literature that indexes over 17 million articles. Various techniques have been developed to retrieve high quality or important articles. Some techniques rely on using the number of citations as a measurement of an article’s importance. Unfortunately, citation information is proprietary, expensive, and suffers from “citation lag.” MEDLINE users have a variety of information needs. Although some users require high recall, many users are looking for a “few good articles” on a topic. For these users, precision is more important than recall. We present and evaluate a method for identifying articles likely to be highly cited by using information available a...