Skip to content

False-negative: Katakana Middle Dot (・ / U+30FB) & Katakana-Hiragana Double Hyphen (゠ / U+30A0) #189

@tats-u

Description

@tats-u

Version: 3.1.0

U+30FB is frequently used as the equivalent meaning to , &, in Japanese.

ドン・キホーテ (Don Quijote)
2・3番目 (2nd & 3rd)
何・・・? (What...?)

U+30A0 can be used as the equivalent meaning to - in European people names in Japanese.

サン=サーンス (Saint-Saëns)

> cjk.all().toRegExp().test("\u30fb")
false
> cjk.all().toRegExp().test("\u30a0")
false

They should be true in .all() and .punctuations().

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions