Skip to content

[BUG]: 'pandas_profiling' is now 'ydata-profiling'. ModuleNotFoundError: 'pandas_profiling' is a soft dependency and not included in the pycaret installation. #3645

@beingmechon

Description

@beingmechon

pycaret version checks

Issue Description

The latest release of pycaret (version 3.0.4) still references the old pandas-profiling library in source code. According to the official documentation, pandas-profiling has been converted to ydata-profiling.

To ensure compatibility and utilize the updated functionality of ydata-profiling, it is necessary to update the source code to use ydata-profiling instead of pandas-profiling.

Reproducible Example

from pycaret.datasets import get_data
from pycaret.regression import setup
dataset = get_data("diamond")
dataset1 = get_data("diamond", profile=True)

Expected Behavior

It should give profiling report but it doesn't.

Actual Results

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-41-f9ca271b0078> in <cell line: 1>()
----> 1 dataset1 = get_data("diamond", profile=True)

1 frames
/usr/local/lib/python3.10/dist-packages/pycaret/utils/_dependencies.py in _check_soft_dependencies(package, severity, extra, install_name)
    150         if severity == "error":
    151             logger.exception(f"{msg}")
--> 152             raise ModuleNotFoundError(msg)
    153         elif severity == "warning":
    154             logger.warning(f"{msg}")

ModuleNotFoundError: 
'pandas_profiling' is a soft dependency and not included in the pycaret installation. Please run: `pip install pandas-profiling` to install.
Alternately, you can install this by running `pip install pycaret[analysis]`

Installed Versions

System: python: 3.10.12 (main, Jun 7 2023, 12:45:35) [GCC 9.4.0] executable: /usr/bin/python3 machine: Linux-5.15.109+-x86_64-with-glibc2.31

PyCaret required dependencies:
pip: 23.1.2
setuptools: 67.7.2
pycaret: 3.0.4
IPython: 7.34.0
ipywidgets: 7.7.1
tqdm: 4.65.0
numpy: 1.22.4
pandas: 1.5.3
jinja2: 3.1.2
scipy: 1.10.1
joblib: 1.3.1
sklearn: 1.2.2
pyod: 1.1.0
imblearn: 0.10.1
category_encoders: 2.6.1
lightgbm: 3.3.5
numba: 0.56.4
requests: 2.27.1
matplotlib: 3.7.1
scikitplot: 0.3.7
yellowbrick: 1.5
plotly: 5.13.1
plotly-resampler: Not installed
kaleido: 0.2.1
schemdraw: 0.15
statsmodels: 0.13.5
sktime: 0.20.1
tbats: 1.1.3
pmdarima: 2.0.3
psutil: 5.9.5
markupsafe: 2.1.3
pickle5: Not installed
cloudpickle: 2.2.1
deprecation: 2.1.0
xxhash: 3.2.0
wurlitzer: 3.0.3

PyCaret optional dependencies:
shap: 0.42.1
interpret: 0.4.2
umap: 0.5.3
pandas_profiling: Not installed
explainerdashboard: Installed but version unavailable
autoviz: 0.1.730
fairlearn: 0.7.0
deepchecks: 0.17.3
xgboost: 1.7.6
catboost: Not installed
kmodes: Not installed
mlxtend: 0.22.0
statsforecast: Not installed
tune_sklearn: Not installed
ray: Not installed
hyperopt: 0.2.7
optuna: Not installed
skopt: Not installed
mlflow: Not installed
gradio: Not installed
fastapi: Not installed
uvicorn: Not installed
m2cgen: Not installed
evidently: Not installed
fugue: Not installed
streamlit: Not installed
prophet: 1.1.4

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions