Toward the Next Generation of Recommender Systems:

A Survey of the State-of-the-Art and Possible Extensions


Gediminas Adomavicius and Alexander Tuzhilin


IEEE Transactions on Knowledge and Data Engineering

June, 2005


http://ids.csom.umn.edu/faculty/gedas/papers/recommender-systems-survey-2005.pdf





Abstract

In this paper,

  • Firstly, present a comprehensive survey of the state-of-the-art in recommender systems.
  • Describe various ways to extend the capabilities of recommender systems.



Content-Based Methods

Recommend items similar to the ones the user preferred in the past.

  • Memory-Based (heuristic)
    - Cosine similarity, Pearson correlation
  • Model-Based
    - Bayesian classifiers, and various machine learning techniques, including clustering, decision trees, and artificial neural networks


Weak Point
  1. Automatic feature extraction from data can be too hard or unavailable due to the variety of data types, e.g., text, multimedia(graphical images, audio streams, video streams), etc.
  2. Two different items can be represented by the same set of features. Then, they are indistinguishable.
  3. When the recommendation systems can only recommend items that score highly against a user's profile, the user is limited to being recommended items that are similar to those already rated. (Overspecialization)
  4. A new user, having very few ratings, would not be able to get accurate recommendations (New User Problem)


Collaborative Methods

Recommend items that people with similar tastes and preferences liked in the past.

Collaborative recommendations can be grouped into two general classes:

  • Memory-Based (heuristic)
    - Cosine similarity, Pearson correlation coefficient, Intersection, Mean squared difference
  • Model-Based
    - Bayesian model, Probabilistic relational model, Linear regression, Maximum entropy model, Probabilistic Latent Semantic Analysis, etc


Weak Point
  1. The recommender system must first learn the user's preferences from the ratings that the user gives. (New User Problem)
  2. Until the new item is rated by a substantial number of users, the recommender system would not be able to recommend it. (New Item Problem)
  3. The number of ratings already obtained is usually very small compared to the number of ratings that need to be predicted. (Sparsity)



Hybrid Approaches

Methods which combines content-based and collaborative methods.

  • Combining Separate Recommenders
  • Adding Content-Based Characteristics to Collaborative Models
  • Adding Collaborative Characteristics to Content-Based Models
  • Developing a Single Unifying Recommendation Model



Extending Capabilities of Recommender Systems
  • Comprehensive Understanding of Users and Items
  • Extensions for Model-Based Recommendation Techniques
  • Multidimensionality of Recommendations
  • Multcriteria Ratings
  • Nonintrusiveness
  • Flexibility
  • Effectiveness of Recommendations
  • Other Extensions
    - Explainability
    - Trustworthiness
    - Scalability
    - Privacy



References