Live Session
Hall 406 D
Paper
22 Sep
 
16:05
SGT
Women in RecSys Journal Paper of the Year Awards
Add Session to Calendar 2023-09-22 04:05 pm 2023-09-22 05:25 pm Asia/Singapore Women in RecSys Journal Paper of the Year Awards Women in RecSys Journal Paper of the Year Awards is taking place on the RecSys Hub. Https://recsyshub.org
Women in Recsys

A Framework and Toolkit for Testing the Correctness of Recommendation Algorithms

View on ACM Digital Library

Lien Michiels, Robin Verachtert, Andres Ferraro, Kim Falk and Bart Goethals

View Paper PDFView Poster
Abstract

Evaluating recommender systems adequately and thoroughly is an important task. Significant efforts are dedicated to proposing metrics, methods and protocols for doing so. However, there has been little discussion in the recommender systems’ literature on the topic of testing. In this work, we adopt and adapt concepts from the software testing domain, e.g., code coverage, metamorphic testing, or property-based testing, to help researchers to detect and correct faults in recommendation algorithms. We propose a test suite that can be used to validate the correctness of a recommendation algorithm, and thus identify and correct issues that can affect the performance and behavior of these algorithms. Our test suite contains both black box and white box tests at every level of abstraction, i.e., system, integration and unit. To facilitate adoption, we release RecPack Tests, an open-source Python package containing template test implementations. We use it to test four popular Python packages for recommender systems: RecPack, PyLensKit, Surprise and Cornac. Despite the high test coverage of each of these packages, we find that we are still able to uncover undocumented functional requirements and even some bugs. This validates our thesis that testing the correctness of recommendation algorithms can complement traditional methods for evaluating recommendation algorithms.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →
No items found.