Recsys Hub

← Back to Library

Live Session

Hall 406 D

Paper

21 Sep

16:05

SGT

Session 12: Evaluation

Add Session to Calendar 2023-09-21 04:05 pm 2023-09-21 05:25 pm Asia/Singapore Session 12: Evaluation Session 12: Evaluation is taking place on the RecSys Hub. Https://recsyshub.org

Research

What We Evaluate When We Evaluate Recommender Systems: Understanding Recommender Systems’ Performance using Item Response Theory

View on ACM Digital Library

Yang Liu (University of Helsinki), Alan Medlar (University of Helsinki) and Dorota Glowacka (University of Helsinki)

View Paper PDF View Poster

Abstract

Current practices in offline evaluation use rank-based metrics to measure the quality of recommendation lists. This approach has practical benefits as it centers assessment on the output of the recommender system and, therefore, measures performance from the perspective of end-users. However, this methodology neglects how recommender systems more broadly model user preferences, which is not captured by only considering the top-n recommendations. In this article, we use item response theory (IRT), a family of latent variable models used in psychometric assessment, to gain a comprehensive understanding of offline evaluation. We used IRT to jointly estimate the latent abilities of 51 recommendation algorithms and the characteristics of 3 commonly used benchmark data sets. For all data sets, the latent abilities estimated by IRT suggest that higher scores from traditional rank-based metrics do not reflect improvements in modeling user preferences. Furthermore, we show the top-n recommendations with the most discriminatory power are biased towards lower difficulty items, leaving much room for improvement. Lastly, we highlight the role of popularity in evaluation by investigating how user engagement and item popularity influence recommendation difficulty.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →

8:00

SGT

Conference Registration and Badge Pick Up

9:00

SGT

DLP: International Workshop on Deep Learning Practice for High-Dimensional Sparse Data

9:00

SGT

Doctoral Symposium (By Invitation Only)

9:00

SGT

FAccTRec: The 6th Workshop on Responsible Recommendation

9:00

SGT

INRA: International Workshop on News Recommendation and Analytic

9:00

SGT

IntRS’23: 10th Joint Workshop on Interfaces and Human Decision Making for Recommender Systems

9:00

SGT

Tutorial: On Challenges of Evaluating Recommender Systems in Offline Setting

10:30

SGT

Monday AM Coffee Break

11:15

SGT

Tutorial: Customer Lifetime Value Prediction: Towards the Paradigm Shift of Recommender System Objectives

12:35

SGT

Lunch Break (on own)

14:00

SGT

CARS: Workshop on Context-Aware Recommender Systems

14:00

SGT

Tutorial: Recommenders in the Wild / Practical Evaluation Methods

14:00

SGT

fashionXrecsys: Recommender Systems in Fashion & Retail

15:20

SGT

Monday PM Coffee Break

8:00

SGT

Conference Registration and Badge Pick Up

9:00

SGT

BehavRec: Workshop on Recommendations for Behavior Change

9:00

SGT

CONSEQUENCES: The 2nd Workshop on Causality, Counterfactuals and Sequential Decision-Making for Recommender systems

9:00

SGT

MuRS: Music Recommender Systems Workshop

9:00

SGT

NORMalize: The 1st Workshop on Normative Design and Evaluation of Recommender Systems

9:00

SGT

ORSUM: 6th Workshop on Online Recommender Systems and User Modeling

9:00

SGT

PERSPECTIVES: 3rd Workshop: Perspectives on the Evaluation of Recommender Systems

9:00

SGT

RecSys Challenge

9:00

SGT

RecSys in HR: 3rd Workshop on Recommender Systems for Human Resources

9:00

SGT

Tutorial: Trustworthy Recommender Systems: Technical, Ethical, Legal, and Regulatory Perspectives

10:30

SGT

Tuesday AM Coffee Break

14:00

SGT

KaRS: Fifth Knowledge-aware and Conversational Recommender Systems Workshop

14:00

SGT

LERI: Workshop on Learning and Evaluating Recommendations with Impressions

14:00

SGT

RecTour: Workshop on Recommenders in Tourism 2023

14:00

SGT

Tutorial: On Large Language Models for Recommendation

14:00

SGT

VideoRecSys: First Workshop on Large-Scale Video Recommender Systems

15:20

SGT

Tuesday PM Coffee Break

16:05

SGT

Tutorial: User Behavior Modeling with Deep Learning for Recommendation: Recent Advances

18:00

SGT

RecSys Welcome Reception

8:00

SGT

Conference Registration and Badge Pick Up

8:30

SGT

Sponsor Meet-Up: Amazon

8:30

SGT

Wednesday Posters

9:00

SGT

Welcome and Keynote: "From Documents to Dialogues: How LLMs are Shaping the Future of Work"

10:30

SGT

Sponsor Meet-Up: Coolita

10:30

SGT

Wednesday AM Coffee and Posters

11:15

SGT

Session 1: Collaborative filtering 1

11:15

SGT

Session 2: Click-Through Rate prediction

12:35

SGT

Wednesday Lunch (on own) and Posters

14:00

SGT

Session 3: Applications

14:00

SGT

Session 4: Trustworthy Recommendation

15:20

SGT

Sponsor Meet-Up: Huawei

15:20

SGT

Wednesday PM Coffee and Posters

16:05

SGT

Session 5: Sequential Recommendation 1

16:05

SGT

Session 6: Graphs

8:00

SGT

Conference Registration and Badge Pick Up

8:30

SGT

Thursday Posters

8:30

SGT

Women in RecSys Breakfast

8:45

SGT

Sponsor Meet-Up: Google

9:30

SGT

Keynote: "Towards Generative Search and Recommendation"

10:30

SGT

Sponsor Meet-Up: Amazon

10:30

SGT

Thursday AM Coffee and Posters

11:15

SGT

Session 7: Interactive Recommendation 1

11:15

SGT

Session 8: Knowledge and Context

12:35

SGT

Thursday Lunch (on own) and Posters

14:00

SGT

Session 10: Reinforcement Learning

14:00

SGT

Session 9: Collaborative filtering 2

15:20

SGT

Sponsor Meet-Up: Netflix

15:20

SGT

Thursday PM Coffee and Posters

16:05

SGT

Session 11: Sequential Recommendation 2

16:05

SGT

Session 12: Evaluation

18:30

SGT

RecSys Banquet at Marina Bay Sands

8:30

SGT

Conference Registration and Badge Pick Up

8:30

SGT

Friday Posters

9:30

SGT

Keynote: "Recommendation systems: Challenges and solutions"

10:30

SGT

Friday AM Coffee and Posters

10:30

SGT

Sponsor Meet-Up: Netflix

10:30

SGT

Sponsor Meet-Up: TikTok

11:15

SGT

Session 13: Side Information, Items structure and Relations

11:15

SGT

Session 14: Multi-task Recommendation

12:35

SGT

Friday Lunch (on own) and Posters

14:00

SGT

Session 15: Cross-domain Recommendation

14:00

SGT

Session 16: Multimedia Recommendation

15:20

SGT

Friday PM Coffee and Posters

16:05

SGT

Session 17: Interactive Recommendation 2

16:05

SGT

Women in RecSys Journal Paper of the Year Awards

17:25

SGT

Closing Remarks and 2024 Reveal

No items found.