Datasets used for the offline evaluation of recommender systems are collected through user interactions with an already deployed recommender system. However, such datasets can be subject to different types of biases including a system’s popularity bias. In this paper, we focus on assessing the influence of popularity on the offline evaluation of recommendation systems. Our insights from a deeper analysis based on popularity-stratified sampling reveal that the current offline evaluation of recommendation systems are sensitive to popular items, raising questions about conclusions driven from the offline comparison of recommendation models
Most recommender systems are evaluated on how they accurately predict user ratings. However, individ...
Recommender system evaluation usually focuses on the overall effectiveness of the algorithms, either...
Collaborative filtering algorithms have the advantage of not requiring sensitive user or item inform...
Datasets used for the offline evaluation of recommender systems are collected through user interacti...
Recommendation systems are often evaluated based on user’s interactions that were collected from an ...
Popularity is often included in experimental evaluation to provide a reference performance for a rec...
Recommender systems help people find relevant content in a personalized way. One main promise of suc...
In response to the quantity of information available on the Internet, many online service providers ...
International audienceRecommendation systems have been integrated into the majority of large online ...
International audienceRecommendation systems have been integrated into the majority of large online ...
Recently, a few papers report counter-intuitive observations made from experiments on recommender sy...
Recommender systems learn from historical users’ feedback that is often non-uniformly distributed ac...
Abstract. In academic studies, the evaluation of recommender system (RS) algorithms is often limited...
In this paper, we present the results of an empirical evaluation investigating how recommendation a...
Recommender systems learn from historical user-item interactions to identify preferred items for tar...
Most recommender systems are evaluated on how they accurately predict user ratings. However, individ...
Recommender system evaluation usually focuses on the overall effectiveness of the algorithms, either...
Collaborative filtering algorithms have the advantage of not requiring sensitive user or item inform...
Datasets used for the offline evaluation of recommender systems are collected through user interacti...
Recommendation systems are often evaluated based on user’s interactions that were collected from an ...
Popularity is often included in experimental evaluation to provide a reference performance for a rec...
Recommender systems help people find relevant content in a personalized way. One main promise of suc...
In response to the quantity of information available on the Internet, many online service providers ...
International audienceRecommendation systems have been integrated into the majority of large online ...
International audienceRecommendation systems have been integrated into the majority of large online ...
Recently, a few papers report counter-intuitive observations made from experiments on recommender sy...
Recommender systems learn from historical users’ feedback that is often non-uniformly distributed ac...
Abstract. In academic studies, the evaluation of recommender system (RS) algorithms is often limited...
In this paper, we present the results of an empirical evaluation investigating how recommendation a...
Recommender systems learn from historical user-item interactions to identify preferred items for tar...
Most recommender systems are evaluated on how they accurately predict user ratings. However, individ...
Recommender system evaluation usually focuses on the overall effectiveness of the algorithms, either...
Collaborative filtering algorithms have the advantage of not requiring sensitive user or item inform...