Issue #13043: Make references optional for link and linkplain tags #16458

mahfouz72 · 2025-03-03T01:29:53Z

Diff Regression config: https://gist.githubusercontent.com/mohitsatr/f5a524a06294d907c84b9fc5b68661dc/raw/604872961df92db35e66c26218dc74c61bb10097/config.xml

mahfouz72 · 2025-03-03T01:30:22Z

Github, generate report

github-actions · 2025-03-03T02:42:35Z

https://checkstyle-diff-reports.s3.us-east-2.amazonaws.com/00579f4_2025024203/reports/diff/index.html

mahfouz72 · 2025-03-03T17:26:20Z

It looks like Kleene Star is a time killer. we have a lott of them :)

Parse time increased by approximately 5.15 times.
lookahead burden increased by approximately 2.1 times
the prediction time becomes 90%

Before

After

nrmancuso · 2025-03-03T20:07:46Z

It looks like Kleene Star is a time killer. we have a lott of them :)

It all depends on the context, when the grammar is as poorly implemented as this one, any change that makes matching rules more ambiguous will have a compounding negative impact on performance.

mahfouz72 · 2025-03-04T12:30:33Z

mahfouz72 · 2025-03-04T13:48:47Z

src/main/resources/com/puppycrawl/tools/checkstyle/grammar/javadoc/JavadocParser.g4

+            | LINK_LITERAL (WS | NEWLINE | LEADING_ASTERISK)+ reference
+                    (WS | NEWLINE)* ((WS | NEWLINE) description)?
+            | LINK_LITERAL (WS | NEWLINE | LEADING_ASTERISK)* ((WS | NEWLINE) description)?
+            | LINKPLAIN_LITERAL (WS | NEWLINE | LEADING_ASTERISK)+ reference
+                    (WS | NEWLINE)* ((WS | NEWLINE) description)?
+            | LINKPLAIN_LITERAL (WS | NEWLINE | LEADING_ASTERISK)* ((WS | NEWLINE) description)?


My old code that uses reference? and (WS | NEWLINE | LEADING_ASTERISK)* creates an explosion of possibilities.

I tried to guide the parser and separate the rule for reference and non-reference to explore one path at a time to eliminate some ambiguity and avoid a lot of backtracking. I don't know if this is good, as this becomes very verbose

Screenshot shows that this rule separation improves performance (compare it to #16458 (comment) and #16458 (comment))

@nrmancuso @rnveach @romani please share your opinion does this make sense?

I think it makes sense. It would be good to see how we look on memory usage while parsing some larger, nested html Javadocs. If we are hurting more for time than memory, I would suggest that we start extracting subrules to be reused where appropriate. This will help to make this grammar less buggy and ease maintenance.

We should optimize time, not a memory. With reasonable limits. Nowadays memory is cheeper.

…in tags

mahfouz72 · 2025-03-13T21:15:18Z

@nrmancuso @romani @rnveach Are we good here?

rnveach · 2025-03-13T22:00:38Z

src/main/resources/com/puppycrawl/tools/checkstyle/grammar/javadoc/JavadocParser.g4

-                ((WS | NEWLINE) description)?
+            | LINK_LITERAL (WS | NEWLINE | LEADING_ASTERISK)+ reference
+                    (WS | NEWLINE)* ((WS | NEWLINE) description)?
+            | LINK_LITERAL (WS | NEWLINE | LEADING_ASTERISK)* ((WS | NEWLINE) description)?


Since these 2 are so similar, is there any benefit if we combine them so they aren't fully separate entries? I assume this won't really improve performance, so I am ok either way.

| LINK_LITERAL (WS | NEWLINE | LEADING_ASTERISK)* ( (WS | NEWLINE | LEADING_ASTERISK)+ reference (WS | NEWLINE)* )? ( (WS | NEWLINE) description)?

At the least, maybe we should consider in the future making WS | NEWLINE | LEADING_ASTERISK and WS | NEWLINE its own group to somehow make this easier to read.

Since these 2 are so similar, is there any benefit if we combine them so they aren't fully separate entries?

I don't think so. I tried several options the one with 2 separate entries always leads to less parse time and IMO it is better in terms of readability.

At the least, maybe we should consider in the future making WS | NEWLINE | LEADING_ASTERISK and WS | NEWLINE its own group to somehow make this easier to read.

Yes, There should be a clean way to rewrite all this rules to improve the readability of this parse.

romani

ok to merge.

nrmancuso

Thanks!

mahfouz72 force-pushed the allow-empty-ref branch from 753d984 to 00579f4 Compare March 3, 2025 01:32

mahfouz72 mentioned this pull request Mar 3, 2025

Issue #16005: fix parse-error if @see ends with dot #16355

Merged

mahfouz72 force-pushed the allow-empty-ref branch from 00579f4 to 4a75878 Compare March 4, 2025 13:34

mahfouz72 commented Mar 4, 2025

View reviewed changes

Issue checkstyle#13043: Make references optional for link and linkpla…

a7db524

…in tags

mahfouz72 force-pushed the allow-empty-ref branch from 4a75878 to a7db524 Compare March 13, 2025 20:34

rnveach approved these changes Mar 13, 2025

View reviewed changes

romani approved these changes Mar 14, 2025

View reviewed changes

romani assigned nrmancuso Mar 14, 2025

nrmancuso approved these changes Mar 15, 2025

View reviewed changes

nrmancuso merged commit 2a50408 into checkstyle:master Mar 15, 2025
113 checks passed

nrmancuso mentioned this pull request Mar 15, 2025

Make references optional for link and linkplain tags #13043

Closed

mahfouz72 deleted the allow-empty-ref branch May 9, 2025 09:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Issue #13043: Make references optional for link and linkplain tags #16458

Issue #13043: Make references optional for link and linkplain tags #16458

Uh oh!

mahfouz72 commented Mar 3, 2025 •

edited

Loading

Uh oh!

mahfouz72 commented Mar 3, 2025

Uh oh!

github-actions bot commented Mar 3, 2025

Uh oh!

mahfouz72 commented Mar 3, 2025

Uh oh!

nrmancuso commented Mar 3, 2025

Uh oh!

mahfouz72 commented Mar 4, 2025

Uh oh!

mahfouz72 Mar 4, 2025

Uh oh!

nrmancuso Mar 4, 2025

Uh oh!

romani Mar 4, 2025 •

edited

Loading

Uh oh!

mahfouz72 commented Mar 13, 2025

Uh oh!

rnveach Mar 13, 2025

Uh oh!

mahfouz72 Mar 14, 2025

Uh oh!

romani left a comment

Uh oh!

nrmancuso left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Issue #13043: Make references optional for link and linkplain tags #16458

Issue #13043: Make references optional for link and linkplain tags #16458

Uh oh!

Conversation

mahfouz72 commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mahfouz72 commented Mar 3, 2025

Uh oh!

github-actions bot commented Mar 3, 2025

Uh oh!

mahfouz72 commented Mar 3, 2025

Before

After

Uh oh!

nrmancuso commented Mar 3, 2025

Uh oh!

mahfouz72 commented Mar 4, 2025

Uh oh!

mahfouz72 Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

nrmancuso Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

romani Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mahfouz72 commented Mar 13, 2025

Uh oh!

rnveach Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

mahfouz72 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

romani left a comment

Choose a reason for hiding this comment

Uh oh!

nrmancuso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mahfouz72 commented Mar 3, 2025 •

edited

Loading

romani Mar 4, 2025 •

edited

Loading