Fix Epoch Number for Console Logging #411

ostamand · 2019-10-06T17:28:31Z

Fix epoch number in console logging.

Before:

After:

Scitator · 2019-10-07T04:49:00Z

Hi, thanks for PR!

Nevertheless, working with checkpoint indices, we also need to logs correctness. In your example, best model saved as train.1.pth, that means checkpoint at 1 epoch.... but from logs perspective it's already 2nd epoch – a little bit confusing :)

Scitator

ostamand · 2019-10-07T13:11:37Z

You are right. I missed that. Since all the logs are epoch zero based, I propose that we keep it as is but update the total number of epochs in the console progress bar. To keep it consistent.

Let me know if you would prefer a full refactoring of the epoch number so that it starts at 1 everywhere & I will do that instead.

Scitator · 2019-10-09T05:30:58Z

Conceptually, I think it's much better to make full refactoring and save model after N epoch as checkpoint.N.pth. It's much easier to understand :)

Looks like you need to refactor here and here... at least I hope so :)
We still need to check logs, metrics and overall correctness.

ostamand · 2019-10-09T17:56:27Z

@Scitator let me know what you think of this:

Added epoch_log & stage_epoch_log properties to RunnerState to keep it cleaner.

Multi-Stage Training Example

Stage 1:

[...]

Stage 2:

Loading from Stage 2 Checkpoint:

Looking at the plots

Stage 1:

Stage 2:

Loading from Stage 2 Checkpoint:

ostamand · 2019-10-09T18:13:45Z

Was not expecting travis to fail for such a small change. Let me look into it first. Sorry.

Scitator · 2019-10-13T17:08:13Z

Hah, @ostamand looks like you also need to rewrite tests a bit :)
We have check like train.1.loss < train.0.loss.... but with your new indexing... we need +1 there

ostamand · 2019-10-15T13:03:16Z

@Scitator Cool thanks for the tests commit! I was about to take a look at it. Let me know if there is any other change you want me to do?

Fix epoch number logging

5533cab

Scitator self-requested a review October 7, 2019 04:49

Scitator requested changes Oct 7, 2019

View reviewed changes

Keep zero based epoch

dee1e77

Add epoch property for log

be12607

Fix codestyle

68e3c1d

tests update

0bcce21

Scitator merged commit 7f1244c into catalyst-team:master Oct 15, 2019

mcollins42 mentioned this pull request Nov 6, 2019

Update google-cloud-automl to latest version and fix broken test for Catalyst Kaggle/docker-python#637

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix Epoch Number for Console Logging #411

Fix Epoch Number for Console Logging #411

Uh oh!

ostamand commented Oct 6, 2019

Uh oh!

Scitator commented Oct 7, 2019

Uh oh!

Scitator left a comment

Uh oh!

ostamand commented Oct 7, 2019 •

edited

Loading

Uh oh!

Scitator commented Oct 9, 2019

Uh oh!

ostamand commented Oct 9, 2019

Uh oh!

ostamand commented Oct 9, 2019

Uh oh!

Scitator commented Oct 13, 2019

Uh oh!

ostamand commented Oct 15, 2019

Uh oh!

Uh oh!

Uh oh!

Fix Epoch Number for Console Logging #411

Fix Epoch Number for Console Logging #411

Uh oh!

Conversation

ostamand commented Oct 6, 2019

Uh oh!

Scitator commented Oct 7, 2019

Uh oh!

Scitator left a comment

Choose a reason for hiding this comment

Uh oh!

ostamand commented Oct 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Scitator commented Oct 9, 2019

Uh oh!

ostamand commented Oct 9, 2019

Uh oh!

ostamand commented Oct 9, 2019

Uh oh!

Scitator commented Oct 13, 2019

Uh oh!

ostamand commented Oct 15, 2019

Uh oh!

Uh oh!

ostamand commented Oct 7, 2019 •

edited

Loading