How to do incremental training? #31

tfzxyinhao · 2017-05-26T14:27:27Z

generator new data and new user every day,how to do incremental training

leodesigner · 2017-05-26T15:37:01Z

You have to maintain your own item/user matrix and manage new/expired users or items.
Then run matrix factorization periodically.

tfzxyinhao · 2017-05-27T02:25:07Z

@leodesigner
thanks for your answer
matrix factorization once again waste time
your mean that can't reuse the last time result of matrix factorization

leodesigner · 2017-05-27T07:36:35Z

You can actually reuse last results as an initialization - this is better than starting from random initialization. In this case only new item/users should be initialized randomly or as an average of similar items/users.
In my case my input matrix is about 10000x10000 and one iteration of the MF algorithm takes about 0.0125s on my server. (I am reusing last results).

tfzxyinhao · 2017-05-27T09:53:54Z

@leodesigner
are you use implicit to do real-time recommend? or it's can do it

leodesigner · 2017-05-27T10:57:59Z

I am using implicit to do calculations once a 10-60 minutes. However the results are used in realtime web app (client browser side item sorting based on cousine distance).

benfred · 2017-05-28T19:09:54Z

Like @leodesigner was saying - this isn't supported in this library right now, but you can build this on top of implicit with some effort.

Adding support for incremental training would be a good feature for this library.

jbochi · 2017-06-23T11:17:23Z

If you update the user_items matrix, you can now recalculate a user factor and get updated recommendations by running model.recommend(userid, user_items, recalculate_user=True).

You can also get recommendations for new users by passing a column vector as user_items and userid=0.

It's still not incremental training because item factors are not recalculated, but maybe it's helpful.

marcusklaas · 2018-03-29T14:31:38Z

Setting recalculate_user=True works, but seems to be quite slow, since it does a complete matrix inversion. In my tests, over 99% of the time would be spent on this operation.

One could speed this up by the conjugate gradient method, detailed here: https://www.benfrederickson.com/fast-implicit-matrix-factorization/.

The relevant part of that page would give something like this.

def factor_user_cg(Cui, X, Y, regularization, cg_steps=3):
	users, factors = X.shape
	# we could cache this
	YtY = Y.T.dot(Y) + regularization * np.eye(factors)

	# random start
	x = np.random.rand(factors) * 0.01

	# calculate residual r = (YtCuPu - (YtCuY.dot(Xu), without computing YtCuY
	r = -YtY.dot(x)
	for i, confidence in nonzeros(Cui, u):
		r += (confidence - (confidence - 1) * Y[i].dot(x)) * Y[i]

	p = r.copy()
	rsold = r.dot(r)

	for _ in range(cg_steps):
		# calculate Ap = YtCuYp - without actually calculating YtCuY
		Ap = YtY.dot(p)
		for i, confidence in nonzeros(Cui, u):
		Ap += (confidence - 1) * Y[i].dot(p) * Y[i]

		# standard CG update
		alpha = rsold / p.dot(Ap)
		x += alpha * p
		r -= alpha * Ap
		rsnew = r.dot(r)
		p = r + (rsnew / rsold) * p
		rsold = rsnew

	return x

benfred added the enhancement label May 28, 2017

MariosGr mentioned this issue Mar 17, 2018

model fit crushes with more than 2^31 positive interactions #86

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do incremental training? #31

How to do incremental training? #31

tfzxyinhao commented May 26, 2017

leodesigner commented May 26, 2017

tfzxyinhao commented May 27, 2017

leodesigner commented May 27, 2017 •

edited

Loading

tfzxyinhao commented May 27, 2017

leodesigner commented May 27, 2017

benfred commented May 28, 2017

jbochi commented Jun 23, 2017

marcusklaas commented Mar 29, 2018 •

edited

Loading

How to do incremental training? #31

How to do incremental training? #31

Comments

tfzxyinhao commented May 26, 2017

leodesigner commented May 26, 2017

tfzxyinhao commented May 27, 2017

leodesigner commented May 27, 2017 • edited Loading

tfzxyinhao commented May 27, 2017

leodesigner commented May 27, 2017

benfred commented May 28, 2017

jbochi commented Jun 23, 2017

marcusklaas commented Mar 29, 2018 • edited Loading

leodesigner commented May 27, 2017 •

edited

Loading

marcusklaas commented Mar 29, 2018 •

edited

Loading