Universiti Teknologi Malaysia Institutional Repository

Improving the accuracy of collaborative filtering recommendations using clustering and association rules mining on implicit data

Najafabadi, M. K. and Mahrin, M. N. and Chuprat, S. and Sarkan, H. M. (2017) Improving the accuracy of collaborative filtering recommendations using clustering and association rules mining on implicit data. Computers in Human Behavior, 67 . pp. 113-128. ISSN 0747-5632

Full text not available from this repository.

Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

The recommender systems are recently becoming more significant in the age of rapid development of the Internet technology due to their ability in making a decision to users on appropriate choices. Collaborative filtering (CF) is the most successful and most applied technique in the design of recommender systems where items to an active user will be recommended based on the past rating records from like-minded users. Unfortunately, CF may lead to the poor recommendation when user ratings on items are very sparse in comparison with the huge number of users and items in user-item matrix. To overcome this problem, this research applies the users’ implicit interaction records with items to efficiently process massive data by employing association rules mining. It captures the multiple purchases per transaction in association rules, rather than just counting total purchases made. To do this, a modified preprocessing is implemented to discover similar interest patterns among users based on multiple purchases done. In addition, the clustering technique has been employed in our technique to reduce the size of data and dimensionality of the item space as the performance of association rules mining. Then, similarities between items based on their features were computed to make recommendations. The experiments were conducted and the results were compared with basic CF and other extended version of CF techniques including K-Means clustering, hybrid representation, and probabilistic learning by using public dataset, namely, Million Song dataset. The experimental results demonstrated that our technique achieves the better performance when compared to the basic CF and other extended version of CF techniques in terms of Precision, Recall metrics, even when the data is very sparse.

Item Type:Article
Uncontrolled Keywords:Collaborative filtering (CF), rating records
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Advanced Informatics School
ID Code:75957
Deposited By: Widya Wahid
Deposited On:30 May 2018 04:17
Last Modified:30 May 2018 04:17

Repository Staff Only: item control page