Skip to content

Conversation

tats-u
Copy link
Contributor

@tats-u tats-u commented Sep 4, 2024

Fixes #189

Also, han with Ideographic Variation Sequence (U+E0100–U+E01EF) is now treated as CJK.

E.g. 竈門禰󠄀(U+79B0 U+E0100)豆子

@tats-u tats-u changed the title Add symbols in Katakana block and IVS Add symbols in Katakana block Jan 11, 2025
src/index.ts Outdated
@@ -23,6 +23,7 @@ const cjkPunctuations = unicode({
'Kanbun',
'CJK_Strokes',
'Enclosed_CJK_Letters_And_Months',
'Katakana',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @tats-u, I just got access to this repo, I'll take care of this PR.

There seems more than just U+30FB and U+30A0, are you sure there should all treat as punctuations?

https://en.wikipedia.org/wiki/Katakana_(Unicode_block)

Copy link
Contributor Author

@tats-u tats-u Mar 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As you may suspect, many of them are just letters. I have to modify the code.

Copy link
Contributor Author

@tats-u tats-u Mar 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@fisker I remembered I've thought all of the non-punctuation characters in the Katakana block are included in cjkLetters because of .subtract(cjkLetters). I don't think I have to remove this line. I don't think I have to remove this line now.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All right, I'll take a look tomorrow.

Copy link
Contributor Author

@tats-u tats-u left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Snapshot was taken as intended.

# Conflicts:
#	tests/__snapshots__/index.test.ts.snap
@fisker fisker merged commit c66a3d1 into ikatyang-collab:main Mar 10, 2025
3 checks passed
tats-u

This comment was marked as outdated.

@tats-u
Copy link
Contributor Author

tats-u commented Mar 10, 2025

I checked the new diff too. LGTM. Thank you for the merging.

@tats-u tats-u deleted the katakana-ivs branch March 10, 2025 14:14
@fisker
Copy link
Collaborator

fisker commented Mar 10, 2025

I'll improve the snapshot a little bit, and release a new version.

@fisker
Copy link
Collaborator

fisker commented Mar 10, 2025

Released v3.3.0, would you like to upgrade it in Prettier?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

False-negative: Katakana Middle Dot (・ / U+30FB) & Katakana-Hiragana Double Hyphen (゠ / U+30A0)
2 participants