Fix Swift runtime ANTLRInputStream that can’t read Unicode scalars #3025

niw · 2021-01-03T18:06:29Z

Problems

ANTLRInputStream in Swift runtime is using array of Character as internal representation to get Unicode code point as Int to supply it to the lexer. However, Swift Character is not representing Unicode code point but representing Unicode grapheme, therefore if the original String contains characters that build up from multiple Unicode code points such as Family emoji (👨‍👩‍👧‍👦, which is represented in a single Character in Swift but actually U+1F468 U+200D U+1F469 U+200D U+1F467 U+200D U+1F466), ANTLRInputStream will not able to get each Unicode code point.

Solution

Use array of UnicodeScalar instead.
Add unit tests to ensure ANTLRInputStream can read each unicode code point.

Turns out, using `UnicodeScalarView` is extremely slow.

hanjoes · 2021-10-11T18:56:09Z

will try to merge it as part of #3301

niw force-pushed the fix_swift_input_stream branch from 5918123 to 86c89b4 Compare January 4, 2021 21:34

niw mentioned this pull request Jan 7, 2021

Travis continuous integration is no longer an option; need new location #3029

Closed

niw added 3 commits January 7, 2021 18:37

Add niw to contributors.

baae110

Add unit tests.

67ed431

Use UnicodeScalarView instead of array of Character.

65aacab

niw force-pushed the fix_swift_input_stream branch from 86c89b4 to 65aacab Compare January 8, 2021 02:39

Use array of UnicodeScalar instead.

f54c0e7

Turns out, using `UnicodeScalarView` is extremely slow.

hanjoes mentioned this pull request Oct 11, 2021

Preparing for 4.9.3 release #3301

Closed

parrt added the target:swift label Oct 11, 2021

parrt added this to the 4.9.3 milestone Oct 11, 2021

Merge branch 'master' into fix_swift_input_stream

00e5fae

parrt merged commit c293e23 into antlr:master Oct 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Swift runtime ANTLRInputStream that can’t read Unicode scalars #3025

Fix Swift runtime ANTLRInputStream that can’t read Unicode scalars #3025

niw commented Jan 3, 2021 •

edited

Loading

Uh oh!

hanjoes commented Oct 11, 2021

Uh oh!

Uh oh!

Fix Swift runtime ANTLRInputStream that can’t read Unicode scalars #3025

Fix Swift runtime ANTLRInputStream that can’t read Unicode scalars #3025

Conversation

niw commented Jan 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanjoes commented Oct 11, 2021

Uh oh!

Uh oh!

niw commented Jan 3, 2021 •

edited

Loading