Skip to content

Conversation

Innixma
Copy link
Contributor

@Innixma Innixma commented Apr 25, 2025

Issue #, if available:

Resolves #4930

Description of changes:

  • Improve CatBoost and XGBoost memory estimates to make out-of-memory errors rarer.
  • CatBoost previously underestimated the histogram size with depth<7
  • XGBoost previously underestimated the memory size for multiclass tasks and in some cases could go OOM at the end of fit during bagging if the model artifacts are large. Now we early stop due to memory earlier and account for the eventual model size in estimating memory.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Copy link
Contributor

@prateekdesai04 prateekdesai04 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link

Job PR-5090-c893128 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-5090/c893128/index.html

@Innixma Innixma merged commit d5da8d8 into autogluon:master Apr 25, 2025
25 checks passed
Copy link

Job PR-5090-312ce7d is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-5090/312ce7d/index.html

FireballDWF pushed a commit to FireballDWF/autogluon that referenced this pull request Apr 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

[BUG][tabular] Memory Miscalculation in AutoGluon When Using CatBoost with Ray Parallelism
3 participants