Skip to content

Conversation

wkuopt
Copy link
Contributor

@wkuopt wkuopt commented Jan 16, 2021

PULL REQUEST TEMPLATE

  • Fixed Dummify for satisfying sklearn's pipeline structure.
  • Added an unit test for testing the complete pycaret preprocessing pipeline.
  • Added Preprocess_Path_One_Sklearn for testing: This is a clone of Preprocess_Path_One with modification on the pipeline for testing purposes. It will be removed when the pipeline is completely refactored.


# Append classifier to preprocessing pipeline. Now we have a full prediction pipeline.
clf = Pipeline(steps=[('preprocessor', pycaret_preprocessor),
('classifier', xgb.XGBClassifier())],
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use something from sklearn instead? Ideally, we would have our own mock classes for all of this, though (though IIRC sklearn provides something like that, doesn't it?)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You mean like using a classifier from sklearn instead of XGBoost? Yes I can do that. I think as long as the implementation supports the sklearn interface, we can use them in a sklearn pipeline.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I feel that using something like RandomForest will be a better idea, since XGboost may lag behind any API changes

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the comment! I have changed XGBoost to random forest.

@Yard1 Yard1 merged commit 298c401 into pycaret:preprocessing-refactoring Jan 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants