Document current limitation in number of features #3886

hcho3 · 2018-11-09T03:55:06Z

See discussion at #3884. The num_feature variable uses 32-bit integer, which causes XGBoost to fail for datasets with more than 2147483647 features. For now, print a better error message.

hcho3 · 2018-11-09T03:55:22Z

@trivialfis FYI

trivialfis · 2018-11-09T06:29:56Z

@hcho3 What's preventing using int64_t currently?

hcho3 · 2018-11-09T06:33:29Z

@trivialfis XGBoost saves the trained model as binary dump. So you can't easily add or change fields without breaking backward compatibility. As an aside, someone told me today that the use of binary dump is causing endian-compatibility issue as well (model created by little-endian machine doesn't load on big-endian machine). We may want to consider a next generation model format.

trivialfis

LGTM.

khotilov · 2018-11-09T13:55:49Z

@trivialfis Since dense structures are currently used in many places to index features, using such large indices would be consuming many gigs of memory.

Document current limitation in number of features

f43ebdf

hcho3 requested a review from trivialfis November 9, 2018 03:55

trivialfis approved these changes Nov 9, 2018

View reviewed changes

hcho3 merged commit e38d5a6 into dmlc:master Nov 9, 2018

hcho3 deleted the too_large_dim branch November 9, 2018 08:32

This was referenced Nov 12, 2018

Enable running objectives with 0 GPU. #3878

Merged

[DISCUSSION] Adopting JSON-like format as next-generation model format #3916

Closed

lock bot locked as resolved and limited conversation to collaborators Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Document current limitation in number of features #3886

Document current limitation in number of features #3886

Uh oh!

hcho3 commented Nov 9, 2018

Uh oh!

hcho3 commented Nov 9, 2018

Uh oh!

trivialfis commented Nov 9, 2018

Uh oh!

hcho3 commented Nov 9, 2018 •

edited

Loading

Uh oh!

trivialfis left a comment

Uh oh!

khotilov commented Nov 9, 2018 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Document current limitation in number of features #3886

Document current limitation in number of features #3886

Uh oh!

Conversation

hcho3 commented Nov 9, 2018

Uh oh!

hcho3 commented Nov 9, 2018

Uh oh!

trivialfis commented Nov 9, 2018

Uh oh!

hcho3 commented Nov 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trivialfis left a comment

Choose a reason for hiding this comment

Uh oh!

khotilov commented Nov 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

hcho3 commented Nov 9, 2018 •

edited

Loading

khotilov commented Nov 9, 2018 •

edited

Loading