Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/149716
Title: Recommendation systems based on extreme multi-label classification
Authors: Chua, Song Ann
Keywords: Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Chua, S. A. (2021). Recommendation systems based on extreme multi-label classification. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149716
Project: A3044-201
Abstract: This project aims to implement a recommender system using extreme multi-label classification algorithms. In the era of big data, traditional recommender systems are unable to keep up with the scale and size of data available. Extreme multi-label classification can tag a given target with multiple labels that are most relevant to it from an extremely large dataset of labels. This report summarises the design implementation and empirical studies of extreme multi-label classification algorithms for recommendation systems on the MovieLens 1M benchmark dataset. This project studied 2 tree-based extreme multi-label classification algorithms, FastXML and AttentionXML, and implemented them using Python for a movie recommender system. This was to investigate the reformulation of the recommender problem as a multi-label classification task. The dataset was prepared such that each item that can be recommended by the system was treated as a unique label that can be tagged to a user by the classifier. The 2 algorithms were compared based on accuracy as well as computational resources required. The accuracy of AttentionXML was 46.6%, 5% larger than that of FastXML’s accuracy of 41.4%. However, FastXML had a smaller computational requirement than AttentionXML. The memory footprints of AttentionXML’s models were smaller than FastXML’s models. This is because AttentionXML used more computational resources to train a deep model for each layer of its tree, while FastXML used more memory to train a larger tree ensemble to make up for the lower accuracy per tree.
URI: https://hdl.handle.net/10356/149716
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Report_Chua_Song_Ann.pdf
  Restricted Access
2.08 MBAdobe PDFView/Open

Page view(s)

119
Updated on May 14, 2022

Download(s)

10
Updated on May 14, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.