-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test #99
test #99
Conversation
oops meant for forked repo |
Hi Villu! I was going to ask you about this but just trying out a hack for now. We're using scikit xgboost on google's cloud ml-engine (with a single machine) with a ton of data, and we're running into memory problems (since ml-engine right now doesn't offer a super high-mem machine). This was one way to mitigate, and it seems to work in my test. In general at CK (think you visited recently :)) we have a framework for unifying tensorflow and scikit and want to see if we can use float16 for all cases without sacrificing accuracy. Don't think it's urgent or serious, but more of a cost-minimization step. |
By switching from Have you figured out if the XGBoost algorithm is also operating on If XGBoost is treating |
So I'm looking at It seems like when storing label data, xgboost is justing taking whatever it's been passed and not touching it again afterwards- https://github.com/dmlc/xgboost/blob/74009afcacc8ac567b5f00d6f82736189490cb47/python-package/xgboost/core.py#L422 So the solution for me is simply changing our code from Wonder if I should make a comment to the xgboost team... Anyways, thanks for the insightful comment! |
But isn't it the case that when you convert your That is, XGBoost is unable to ingest
The label data type depends on your modeling problem. When dealing with regression-type problems, then it needs to be a floating-point data type (in most cases Anyway, I think that the JPMML-SkLearn library should be able to recognize the half-precision floating point data type. Just opened an issue about it. Please don't delete this PR, because it's a good reference + discussion for future generations. |
No description provided.