How to evaluate the recommender(e.g. P@k) with Implict #30

kyowill · 2017-05-16T07:25:35Z

No description provided.

benfred · 2017-05-17T14:40:51Z

Unfortunately, there isn't currently any support for evaluating models built into this library. It shouldn't be too hard to add - and is something I'm looking at doing.

Akarshit · 2017-06-15T06:26:27Z

@benfred Hey I am planning to use this lib. in production. But before that I want to evaluate the performance of the algo. I am using the ALS for recommendation.
Could you point me to the right direction about testing this.

vhfmag · 2017-06-22T22:01:17Z

How should the evaluation go? I'm also planning to use the lib in production, but it would be great to have a metric of its accuracy before that. Is help needed?

snexus · 2017-06-23T07:03:59Z

Hey @benfred, support for evaluation would be great in order to use this library in production.

Meanwhile, was thinking about the following for evaluation:

Split data in train/test
Fit model on the train set.
For every entry in the test set, hide randomly one item.
Use the model to provide recommendations for every entry in the test set with hidden item excluded.
If hidden item is in top N recommended items -> true positive.
Calculate some metric, e.g. recall.
Bootstrap steps 3-6 many times to estimate the distribution of metric of interest.

Do you think it is feasible approach, or some simpler evaluation may be implemented?

Akarshit · 2017-06-23T08:42:42Z

@snexus I did the same to evaluate the system.

snexus · 2017-06-28T01:23:06Z

FYI - https://jessesw.com/Rec-System/ contains a good approach for validation. It is easy to adapt to your needs.

jbochi · 2017-07-08T15:21:33Z

I've managed to do grid search and cross-evaluation using scikit-learn, even though it does not have built in support for recommenders: scikit-learn/scikit-learn#6142

I had to create a few custom classes:

ALSEstimator, that wraps AlternatingLeastSquares and turns it into a scikit-learn Estimator.
A cross-validation splitter wrapping PredefinedSplit. It holds out p items for each user in every split.
A custom scorer that calculates ndcg

The code is in this gist: https://gist.github.com/jbochi/2e8ddcc5939e70e5368326aa034a144e#file-evaluation-ipynb

Do you guys have any suggestions to improve it?

Would it make sense to add some of this to scikit-learn?

antonioalegria · 2018-06-14T15:09:35Z

@jbochi your code is great but doesn't seem to work with datasets with very sparse data where the train and test could have different users/items (gives out index-out-of-bounds errors). Any ideas here?

@benfred do you expect support for this kind of evaluation soon?

benfred · 2018-06-14T19:59:17Z

@antonioalegria I've added some basic support for map@k and p@k that you can use in the latest version - there is an example of how to call here: #108 (comment)

I'm leaving this issues open until I get around to writing some documentation on this =)

antonioalegria · 2018-06-15T12:54:24Z

Thanks @benfred does the train/test split deal well with them ending up with different users and items?

benfred · 2018-06-19T17:32:51Z

@antonioalegria the train_test_split function should handle that (the returned matrices should have the same dimensions as the input - so there shouldn't be any out of bounds errors).

oliviernguyenquoc · 2018-08-24T15:34:16Z

Any plan for a Recall@k metric ?

It shouldn't be a lot of work but I can't understand Cython myself :(

benfred · 2018-09-01T01:05:24Z

I wasn't planning on adding a recall@k metric - but it shouldn't be difficult I guess (I think it's just replacing this line https://github.com/benfred/implicit/blob/master/implicit/evaluation.pyx#L114 with total += likes.size() ?).

oliviernguyenquoc · 2018-09-04T08:42:09Z

@benfred If I understand well, it should be that.
If you divide the results by the size of the test set, it's good (but again, it's difficult for me to read Cython).

A quick win ;)

Thanks for all. This library rocks.

according to benfred#30 (comment)

yvonnerahnfeld · 2022-12-27T08:41:59Z

@antonioalegria I've added some basic support for map@k and p@k that you can use in the latest version - there is an example of how to call here: #108 (comment)

I'm leaving this issues open until I get around to writing some documentation on this =)

Hi, whenever I try the example in #108 I get this error:
index 3953 is out of bounds for axis 0 with size 3953

Is there possibly an error in the example code?
And is there a documentation available?

benfred added the enhancement label May 17, 2017

MariosGr mentioned this issue Mar 17, 2018

model fit crushes with more than 2^31 positive interactions #86

Closed

ita9naiwa mentioned this issue May 9, 2018

Is there any plan to include offline-evaluation functionality? #101

Closed

thisisjl added a commit to thisisjl/implicit that referenced this issue Feb 15, 2019

add recall at k

08a3012

according to benfred#30 (comment)

ita9naiwa closed this as completed Jul 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to evaluate the recommender(e.g. P@k) with Implict #30

How to evaluate the recommender(e.g. P@k) with Implict #30

kyowill commented May 16, 2017

benfred commented May 17, 2017

Akarshit commented Jun 15, 2017

vhfmag commented Jun 22, 2017

snexus commented Jun 23, 2017 •

edited

Loading

Akarshit commented Jun 23, 2017

snexus commented Jun 28, 2017

jbochi commented Jul 8, 2017

antonioalegria commented Jun 14, 2018

benfred commented Jun 14, 2018

antonioalegria commented Jun 15, 2018

benfred commented Jun 19, 2018

oliviernguyenquoc commented Aug 24, 2018

benfred commented Sep 1, 2018

oliviernguyenquoc commented Sep 4, 2018

yvonnerahnfeld commented Dec 27, 2022

How to evaluate the recommender(e.g. P@k) with Implict #30

How to evaluate the recommender(e.g. P@k) with Implict #30

Comments

kyowill commented May 16, 2017

benfred commented May 17, 2017

Akarshit commented Jun 15, 2017

vhfmag commented Jun 22, 2017

snexus commented Jun 23, 2017 • edited Loading

Akarshit commented Jun 23, 2017

snexus commented Jun 28, 2017

jbochi commented Jul 8, 2017

antonioalegria commented Jun 14, 2018

benfred commented Jun 14, 2018

antonioalegria commented Jun 15, 2018

benfred commented Jun 19, 2018

oliviernguyenquoc commented Aug 24, 2018

benfred commented Sep 1, 2018

oliviernguyenquoc commented Sep 4, 2018

yvonnerahnfeld commented Dec 27, 2022

snexus commented Jun 23, 2017 •

edited

Loading