Making millisecond dimension grouping less error prone #20262

snake14 · 2023-01-24T22:00:59Z

Description:

Some customers were occasionally seeing errors from millisecond dimensions being grouped. This checks if the value is numeric and casts it as a float. If the value isn't numeric, a warning is logged.

Review

core/Columns/Dimension.php

… with things

snake14 · 2023-01-25T21:58:13Z

@tsteur I was just able to reproduce the error using an automated test, so I'm pretty confident in the fix now. Should I remove the is_numeric check and logging or do you think it might still be helpful?

tsteur · 2023-01-25T23:42:27Z

No big preference @snake14 . It may be useful in the future should a non-numeric value be passed there then we'll be able to better understand what's happening I suppose. Haven't tried to test this though what would happen.

tests/PHPUnit/Integration/Columns/DimensionTest.php

AltamashShaikh

@snake14 Added 1 comment to add testcases for other locale too

snake14 · 2023-01-29T21:37:18Z

No big preference @snake14 . It may be useful in the future should a non-numeric value be passed there then we'll be able to better understand what's happening I suppose. Haven't tried to test this though what would happen.

Thank you @tsteur . I went ahead and updated the tests and confirmed what I expected as far as the behaviour if non-numeric values are found. It logs a warning and casts the value as a float. So something like abc123 would be 0 and 123abc becomes 123.0. It's not ideal, but it's better than an error that prevents archiving. This is also an extreme edge case since the issue being fixed was with the use of the number_format function. So, I don't think we ever pass in non-numeric values for this type.

AltamashShaikh

Looks good to me 👍
lets wait for core team to check if they think this can cause any issue

tests/PHPUnit/Integration/Columns/DimensionTest.php

sgiehl · 2023-02-02T14:37:28Z

@AltamashShaikh @snake14 Were you able to reproduce that problem locally? Your "fix" actually only solves the resulting problem, but it doesn't seem to fix the root cause of the issue.
If e.g. a already formatted number is being used at that point, casting it to float might actually change the result.
If we e.g. have a number like 100000.25 and it might already have been formatted with english format, that would be 100,000.25. Casting that to float results in 100, which is obviously incorrect.

snake14 · 2023-02-02T19:52:10Z

@AltamashShaikh @snake14 Were you able to reproduce that problem locally? Your "fix" actually only solves the resulting problem, but it doesn't seem to fix the root cause of the issue. If e.g. a already formatted number is being used at that point, casting it to float might actually change the result. If we e.g. have a number like 100000.25 and it might already have been formatted with english format, that would be 100,000.25. Casting that to float results in 100, which is obviously incorrect.

@sgiehl Yes. I was able to reproduce the error locally using the new test cases that I wrote. The issue was that we're using the number_format() function and then performing calculations on the results. That meant that anything larger than 999 would contain a comma and not be a valid 'numeric' value. Agreed. Casting to a float isn't ideal, but that's why I log the actual value before that. However, there isn't any evidence that the system has had any non-numeric values, so the logging and casting is simply a precaution.

sgiehl · 2023-02-03T10:11:29Z

@snake14 Sure. But logging an incorrect value before using it, doesn't make the value correct again. It will only help use to see how often that actually happens.
The imho correct solution, would be to narrow down why a number formatting is already applied before we use the values for calculating. Maybe doing a number_format at this or somewhere else isn't correct and should actually be a round?

snake14 · 2023-02-06T20:12:08Z

@snake14 Sure. But logging an incorrect value before using it, doesn't make the value correct again. It will only help use to see how often that actually happens. The imho correct solution, would be to narrow down why a number formatting is already applied before we use the values for calculating. Maybe doing a number_format at this or somewhere else isn't correct and should actually be a round?

@sgiehl If you think it's better, I can remove the logging and cast since they're not necessary. As far as the use of number_format, I think it's been using that for quite some time. That said, in past projects, I believe that I've seen more reliable results from using number_format than round, but I can switch it over if you prefer?

sgiehl · 2023-02-07T08:42:49Z

@sgiehl If you think it's better, I can remove the logging and cast since they're not necessary. As far as the use of number_format, I think it's been using that for quite some time. That said, in past projects, I believe that I've seen more reliable results from using number_format than round, but I can switch it over if you prefer?

The groupValue value code is there since the release of CustomReports plugin. It is actually not even used in core.
number_format being more reliable than round sounds rather strange to me. Both actually should internally do the same, but number_format additionally applies some formatting and converts the result to a string.

As I tried to point out before: The problem is not necessarily the number_format done in the groupValue method. Even though the error is thrown at that point it is not the correct approach to simply try fixing the error in that method.
The problem might be somewhere else, as an already formatted value should not be provided to the groupValue method.
So as long as you are not able to point out why an already formatted value is passed to that method you won't be able to provide a proper fix for it.

Note: Once you figured that out we can change the number_format to round in core, as imho there should not be any formatting applied while archiving/calculating numbers. That is part of presentation layer (e.g. API or UI).
But we should not cast the value there, as this might change it's value, which would be unexpected.

tsteur · 2023-02-07T19:04:08Z

fyi @sgiehl I don't think it ever retrieves a formatted value as it's called in archiving. It's more an issue that the number is a string instead of an integer and the method doesn't like it.

sgiehl · 2023-02-07T19:09:03Z

@tsteur No. The method does not have a problem with a string if it contains a well formed float or integer. See https://3v4l.org/0DgX0

tsteur · 2023-02-07T19:19:31Z

@sgiehl got it. The problem is the * 1000 and not the number_format https://3v4l.org/XlNkd

snake14 · 2023-02-07T20:26:26Z

@sgiehl got it. The problem is the * 1000 and not the number_format https://3v4l.org/XlNkd

That's correct. The error was being thrown by trying to multiply by a formatted number and not a valid float. If the number was greater than 999, like 2000, it was trying to multiply 2,000.00 against 1000, which would throw the error. That's why I removed the thousands separator, so that it would be a valid float value of 2000.00 instead.

sgiehl · 2023-02-09T13:59:51Z

Please replace the number_format with a round. As the result is used for calculating we should use a method that returns a numeric value and not use a method that is meant to format a number to be used in a textual presentation.
Even if PHP is not a type safe language yet, I would like to try getting Matomo more and more into handling variables more type safe.

Regarding the type cast to float and the additional logging and the tests: As it turned out that the incoming value does not seem to be the problem, we could actually consider to remove that again. As archiving should never provide invalid values this shouldn't cause any problems. If you think it's worth to keep that, we should create a follow up issue to check if anything was logged within the next weeks/months and remove it again if nothing is logged.

Personally I think it would be better to remove it again. The only case I can currently image that a wrong value would be provided is during development (like providing an incorrect type for a dimension). And in that case I prefer to have an error that aborts everything instead of having a warning logged, that can be easily overseen/ignored.

snake14 · 2023-02-12T20:38:54Z

Please replace the number_format with a round. As the result is used for calculating we should use a method that returns a numeric value and not use a method that is meant to format a number to be used in a textual presentation. Even if PHP is not a type safe language yet, I would like to try getting Matomo more and more into handling variables more type safe.

Regarding the type cast to float and the additional logging and the tests: As it turned out that the incoming value does not seem to be the problem, we could actually consider to remove that again. As archiving should never provide invalid values this shouldn't cause any problems. If you think it's worth to keep that, we should create a follow up issue to check if anything was logged within the next weeks/months and remove it again if nothing is logged.

Personally I think it would be better to remove it again. The only case I can currently image that a wrong value would be provided is during development (like providing an incorrect type for a dimension). And in that case I prefer to have an error that aborts everything instead of having a warning logged, that can be easily overseen/ignored.

Thanks @sgiehl . I made the changes you recommended 👍

Making millisecond dimension grouping less error prone

274e297

snake14 commented Jan 24, 2023

View reviewed changes

core/Columns/Dimension.php Outdated Show resolved Hide resolved

snake14 added the Needs Review PRs that need a code review label Jan 24, 2023

AltamashShaikh reviewed Jan 25, 2023

View reviewed changes

core/Columns/Dimension.php Outdated Show resolved Hide resolved

Code review change

1aa18c8

snake14 requested a review from AltamashShaikh January 25, 2023 20:36

snake14 added 2 commits January 26, 2023 09:55

Realised that it was the thousands separator in number_format messing…

05842fb

… with things

Created a some tests to reproduce the issue to verify the fix

1961334

AltamashShaikh reviewed Jan 26, 2023

View reviewed changes

tests/PHPUnit/Integration/Columns/DimensionTest.php Show resolved Hide resolved

AltamashShaikh requested changes Jan 26, 2023

View reviewed changes

Improving the new tests a little more

3472fde

snake14 requested a review from AltamashShaikh January 30, 2023 00:30

AltamashShaikh approved these changes Feb 1, 2023

View reviewed changes

tests/PHPUnit/Integration/Columns/DimensionTest.php Show resolved Hide resolved

AltamashShaikh requested a review from sgiehl February 1, 2023 02:44

Making some recommended changes from code review

abe6c05

Removed unused references

a580931

remove useless logging checks from tests

99426d8

sgiehl approved these changes Feb 13, 2023

View reviewed changes

sgiehl merged commit 2cb7c73 into 4.x-dev Feb 13, 2023

sgiehl deleted the l3-398-ms-dimension-group-error branch February 13, 2023 08:37

sgiehl added this to the 4.14.0 milestone Mar 20, 2023

Uh oh!

Making millisecond dimension grouping less error prone #20262

Making millisecond dimension grouping less error prone #20262

Uh oh!

Conversation

snake14 commented Jan 24, 2023

Description:

Review

Uh oh!

Uh oh!

Uh oh!

snake14 commented Jan 25, 2023

Uh oh!

tsteur commented Jan 25, 2023

Uh oh!

Uh oh!

AltamashShaikh left a comment

Choose a reason for hiding this comment

Uh oh!

snake14 commented Jan 29, 2023

Uh oh!

AltamashShaikh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sgiehl commented Feb 2, 2023

Uh oh!

snake14 commented Feb 2, 2023

Uh oh!

sgiehl commented Feb 3, 2023

Uh oh!

snake14 commented Feb 6, 2023

Uh oh!

sgiehl commented Feb 7, 2023

Uh oh!

tsteur commented Feb 7, 2023

Uh oh!

sgiehl commented Feb 7, 2023

Uh oh!

tsteur commented Feb 7, 2023

Uh oh!

snake14 commented Feb 7, 2023

Uh oh!

sgiehl commented Feb 9, 2023

Uh oh!

snake14 commented Feb 12, 2023

Uh oh!

Uh oh!