[ENH] Adding CNNRegressor and BaseDeepRegressor #2902

AurumnPegasus · 2022-06-30T14:22:02Z

Reference Issues/PRs

See aso #2894.

What does this implement/fix? Explain your changes.

Implements BaseDeepRegressor, which is a base class for Deep-Learning based Regression models. Also implements CNNRegressor using the base class.

Does your contribution introduce a new dependency? If yes, which one?

No.

What should a reviewer concentrate their feedback on?

I am a bit unsure about what all is required in BaseDeepRegressor. Once that is finalised, implementing CNNRegressor should be easy addition to it.

PR checklist

For all contributions

I've added unit tests and made sure they pass locally.
The PR title starts with either [ENH], [MNT], [DOC], or [BUG] indicating whether the PR topic is related to enhancement, maintenance, documentation, or bug.

AurumnPegasus · 2022-07-04T15:25:29Z

About the tests that fail:

AssertionError: 
Arrays are not almost equal to 6 decimals

Mismatched elements: 5 / 5 (100%)
Max absolute difference: 0.002462
Max relative difference: 0.00493456
 x: array([0.5     , 0.501392, 0.499438, 0.5     , 0.501738], dtype=float32)
 y: array([0.499986, 0.49893 , 0.499301, 0.500023, 0.500554], dtype=float

If the error is caused by the difference, I think we should introduce a threshold here. Since CNN / most neural have randomly initialised initial state, it is not uncommon to get different results (and in this case its a very minor difference).

Just a note, there are 2 tests which are failing, and both of them are related to idempotent matrices. It feels odd that only those two cases are failing, but I do not understand why its just those two matrices.

AurumnPegasus · 2022-07-11T15:48:52Z

Something I found out about reproduciblity in keras.
official docs, stack overflow answer

In short, to be absolutely sure that you will get reproducible results with your python script on one computer's/laptop's CPU then you will have to do the following:

Set PYTHONHASHSEED environment variable at a fixed value

Set python built-in pseudo-random generator at a fixed value

Set numpy pseudo-random generator at a fixed value

Set tensorflow pseudo-random generator at a fixed value

Configure a new global tensorflow session

I can test this out to see if it also works on windows 🤞🏽
Though I do think we need to redo our tests since:

Moreover, when running on a GPU, some operations have non-deterministic outputs, in particular tf.reduce_sum(). This is due to the fact that GPUs run many operations in parallel, so the order of execution is not always guaranteed. Due to the limited precision of floats, even adding several numbers together may give slightly different results depending on the order in which you add them. You can try to avoid the non-deterministic operations, but some may be created automatically by TensorFlow to compute the gradients

So when we do implement the added functionality of GPU compute, the test is meant to fail.

TonyBagnall · 2022-07-12T16:33:47Z

hi, sorry for the delay in looking at this. I have looked back at CNNClassifier, and basically I found the same. keras is non deterministic, and even if you could fix all seeds and reproduce results, fixing a global variable like numpy.random in one classifier seems like it may have nasty ramifications down the line.

My decision was to just not do the correctness tests for CNNClassifier. Instead, I have run the whole lot on the UCR data and can show no significant difference to the published results with a fixed version of sktime. If we ever have a correctness problem, we can compare back to this. I can do the same for the Regressor, and the other deep learners. Note I also excluded these two test

# test fail with deep problem with pickling inside tensorflow.
"CNNClassifier": [
    "test_fit_idempotent",
    "test_persistence_via_pickle",
],

I suggest you do the same for regressors

TonyBagnall

we might need to look for a tidier way of excluding classifiers/regressors, since when we port in more the config will start to bloat, but that is beyond the scope of this PR.

…into cnnregressor

AurumnPegasus · 2022-07-13T09:44:37Z

@TonyBagnall My commit seems to fail the test for soft dependencies

RuntimeError: Estimator CNNRegressor does not require soft dependencies according to tags, but raises ModuleNotFoundError on __init__. Any required soft dependencies should be added to the "python_dependencies" tag, and python version bouds should be added to the "python_version" tag. Exception text: tensorflow

I have added the required dependency in test_softdeps.py but it still is not reflecting that. Any idea what could be going wrong?

TonyBagnall · 2022-07-13T10:15:18Z

Estimator CNNRegressor does not require soft dependencies according to tags

so just looking at CNNClassifier, which does run, the BaseDeepClassifier has these tags,
_tags = {
"X_inner_mtype": "numpy3D",
"capability:multivariate": True,
"python_dependencies": "tensorflow",
}
I've not seen the python_dependencies tag before. Does the base regressor have it? If not, try adding it. Ask franz if you want an overview of tags logic.

…into cnnregressor

fkiraly

Looks ok, great!

One comment: if you wrote this alone, there is no need to credit @TonyBagnall or @james-large as authors, kindly remove that.

AurumnPegasus · 2022-07-15T03:54:16Z

Done, @fkiraly

fkiraly

Looks good!

added base regressor for sktime#2894

8ab25f7

fkiraly assigned AurumnPegasus Jul 1, 2022

AurumnPegasus added 2 commits July 4, 2022 18:34

Changed base class (sktime#2902)

1e7a245

Added CNN Regressor class sktime#2902

6feb03b

AurumnPegasus marked this pull request as ready for review July 6, 2022 10:52

AurumnPegasus requested review from fkiraly and aiwalter as code owners July 6, 2022 10:52

AurumnPegasus force-pushed the cnnregressor branch from aa59470 to fabfc6a Compare July 6, 2022 11:11

fixed branch

fabfc6a

fkiraly self-assigned this Jul 8, 2022

modified initial random seed

44d070b

excluded tests

5a28e6a

TonyBagnall previously approved these changes Jul 13, 2022

View reviewed changes

AurumnPegasus added 2 commits July 13, 2022 14:19

redid changes

0b0faec

Merge branch 'main' of https://github.com/alan-turing-institute/sktime …

287ff13

…into cnnregressor

AurumnPegasus dismissed TonyBagnall’s stale review via 287ff13 July 13, 2022 08:50

AurumnPegasus force-pushed the cnnregressor branch from 97cc27e to 287ff13 Compare July 13, 2022 08:50

AurumnPegasus added 2 commits July 13, 2022 14:39

fixed dependancy issues

167c446

changed soft dependencies test

6898fd9

another try for dependanct test

566f535

Tony Bagnall and others added 4 commits July 13, 2022 19:18

Merge branch 'main' into cnnregressor

815b70c

added dependancy tag

e82e5d6

Merge branch 'cnnregressor' of https://github.com/AurumnPegasus/sktime …

cb1906e

…into cnnregressor

Merge branch 'main' of https://github.com/alan-turing-institute/sktime …

156a449

…into cnnregressor

TonyBagnall previously approved these changes Jul 14, 2022

View reviewed changes

Merge branch 'main' into cnnregressor

a9521f9

fkiraly requested changes Jul 14, 2022

View reviewed changes

AurumnPegasus dismissed TonyBagnall’s stale review via cac67a9 July 15, 2022 03:53

updated authors

cac67a9

fkiraly approved these changes Jul 23, 2022

View reviewed changes

fkiraly merged commit 694c449 into sktime:main Jul 23, 2022

AurumnPegasus mentioned this pull request Aug 26, 2022

[ENH] Migrating Estimators from sktime-dl #3351

Open

27 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ENH] Adding CNNRegressor and BaseDeepRegressor #2902

[ENH] Adding CNNRegressor and BaseDeepRegressor #2902

Uh oh!

AurumnPegasus commented Jun 30, 2022

Uh oh!

AurumnPegasus commented Jul 4, 2022

Uh oh!

AurumnPegasus commented Jul 11, 2022

Uh oh!

TonyBagnall commented Jul 12, 2022

Uh oh!

TonyBagnall left a comment

Uh oh!

AurumnPegasus commented Jul 13, 2022

Uh oh!

TonyBagnall commented Jul 13, 2022

Uh oh!

fkiraly left a comment

Uh oh!

AurumnPegasus commented Jul 15, 2022

Uh oh!

fkiraly left a comment

Uh oh!

Uh oh!

Uh oh!

[ENH] Adding CNNRegressor and BaseDeepRegressor #2902

[ENH] Adding CNNRegressor and BaseDeepRegressor #2902

Uh oh!

Conversation

AurumnPegasus commented Jun 30, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

PR checklist

For all contributions

Uh oh!

AurumnPegasus commented Jul 4, 2022

Uh oh!

AurumnPegasus commented Jul 11, 2022

Uh oh!

TonyBagnall commented Jul 12, 2022

Uh oh!

TonyBagnall left a comment

Choose a reason for hiding this comment

Uh oh!

AurumnPegasus commented Jul 13, 2022

Uh oh!

TonyBagnall commented Jul 13, 2022

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

AurumnPegasus commented Jul 15, 2022

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!