Live Session
Hall 405
20 Sep
 
8:30
SGT
Wednesday Posters
Add Session to Calendar 2023-09-20 08:30 am 2023-09-20 05:45 pm Asia/Singapore Wednesday Posters Wednesday Posters is taking place on the RecSys Hub. Https://recsyshub.org
Main Track

Hessian-aware Quantized Node Embeddings for Recommendation

View on ACM Digital Library

Huiyuan Chen (Visa Research), Kaixiong Zhou (Rice University), Kwei-Herng Lai (Rice University), Chin-Chia Michael Yeh (Visa Research), Yan Zheng (Visa Research), Xia Hu (Rice University) and Hao Yang (Visa Research)

View Paper PDFView Poster
Abstract

Graph Neural Networks (GNNs) have achieved state-of-the-art performance in recommender systems. Nevertheless, the process of searching and ranking from a large item corpus usually requires high latency, which limits the widespread deployment of GNNs in industry-scale applications. To address this issue, many methods quantize user/item representations into the binary embedding space to reduce space requirements and accelerate inference. Also, they use the Straight-through Estimator (STE) to prevent zero gradients during back-propagation. However, the STE often causes gradient mismatch problem, leading to sub-optimal results.

In this work, we present the Hessian-aware Quantized GNN (HQ-GNN) as an effective solution for discrete representations of users/items that enable fast retrieval. HQ-GNN is composed of two components: a GNN encoder for learning continuous node embeddings and a quantized module for compressing full-precision embeddings into low-bit ones. Consequently, HQ-GNN benefits from both lower memory requirements and faster inference speeds compared to vanilla GNNs. To address the gradient mismatch problem in STE, we further consider the quantized errors and its second-order derivatives for better stability. The experimental results on several large-scale datasets show that HQ-GNN achieves a good balance between latency and performance.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →
No items found.