Removal of previously liked items #131

srcolinas · 2018-06-28T22:54:08Z

Hi,
It would be nice to have a way to retrieve recommendations without ignoring previously liked items. I know in the end I would ignore those items, but it would be convenient to compare predictions with other libraries and evaluate performance on some metrics I already have working with those libraries.

My idea so far: use rank_items for the whole list of items and then retrieve the ones with the greater rank. It feels very inefficient though.

By the way...great work! Thank you.

Vslira · 2018-06-29T13:08:25Z

If I understand what you're asking, you can just call model.recommend(userid, Z) where Z is an empty sparse matrix with the same shape as user_items.

srcolinas · 2018-06-29T14:05:53Z

Do you mean a sparse matrix full of zeros? Because I get a score of nan if I do that

ita9naiwa · 2018-06-30T07:22:15Z

@Vslira taking an empty matrix as user-item matrix only works for ALS and BPR models.

DollarAkshay · 2018-07-13T08:37:34Z

I agree with @srcolinas on this issue. I am trying to implement the same evaluation method from the paper (Hu, Koren, Volinsky). So getting all the items in sorted order including the liked ones would be very helpful.

Update

I just tried rank_items for a lot of users. It seems to take a really long time. It takes 8 seconds to evaluate 100 users (44699 items), and I have 138k users. So rank_items seems to be really slow. Is there a better way to implement the evaluation method from the paper ?

ita9naiwa · 2018-07-15T17:17:18Z

I thought it can be easily implemented and I created a PR on that(#140).
please check this out.

benfred · 2018-07-16T19:52:45Z

I don't think including liked items from the train set when evaluating is a good idea.

The problem here is that if you leave these items in, almost all the returned results will be liked items from the train set - and these will push down the liked items from the test set. This leads to erroneous conclusions: I've seen cases where 90+% of the top 100 results returned by the als model are the liked items from the train set. This artificially lowered the score of the model and leads to false conclusions about which model is performing better.

@DollarAkshay - evaluation of these models will probably take longer than fitting. In your case it has to score and sort every item for every user.

I would focus on something on like P@K or MAP@K instead. The ranking of items far down the list doesn't actually matter that much to the user: If the item is at position 1K or 10K it is very doubtful that the user will see it. There is also some evidence that using metrics that measure early precision tend to lead to better user satisfaction (though I can't find the link at the moment). The nice thing about using P@K is that you can then use one the approximate MF models to speed up.

srcolinas · 2018-07-16T21:55:34Z

@benfred I still think we should be able to choose whether to remove previously liked items or not. Specially, because the user liked an item does not mean he has bought it or that he would not buy it again, so an automated system may still recommend it. Moreover, the library may be better if the user is free to choose the way to evaluate that he considers more appropriate. The PR #140 is a step towards solving this.

benfred · 2018-07-25T14:22:47Z

Thanks everyone, I've merged #140 - so with the version on master you should be able to do this now.

ita9naiwa mentioned this issue Jul 15, 2018

Add an option to whether to include previously liked items or not #140

Merged

benfred closed this as completed Jul 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removal of previously liked items #131

Removal of previously liked items #131

srcolinas commented Jun 28, 2018 •

edited

Loading

Vslira commented Jun 29, 2018

srcolinas commented Jun 29, 2018 •

edited

Loading

ita9naiwa commented Jun 30, 2018

DollarAkshay commented Jul 13, 2018 •

edited

Loading

ita9naiwa commented Jul 15, 2018

benfred commented Jul 16, 2018

srcolinas commented Jul 16, 2018 •

edited

Loading

benfred commented Jul 25, 2018

Removal of previously liked items #131

Removal of previously liked items #131

Comments

srcolinas commented Jun 28, 2018 • edited Loading

Vslira commented Jun 29, 2018

srcolinas commented Jun 29, 2018 • edited Loading

ita9naiwa commented Jun 30, 2018

DollarAkshay commented Jul 13, 2018 • edited Loading

Update

ita9naiwa commented Jul 15, 2018

benfred commented Jul 16, 2018

srcolinas commented Jul 16, 2018 • edited Loading

benfred commented Jul 25, 2018

srcolinas commented Jun 28, 2018 •

edited

Loading

srcolinas commented Jun 29, 2018 •

edited

Loading

DollarAkshay commented Jul 13, 2018 •

edited

Loading

srcolinas commented Jul 16, 2018 •

edited

Loading