Jump to content

CJK Symbols and Punctuation

From Wikipedia, the free encyclopedia
CJK Symbols and Punctuation
RangeU+3000..U+303F
(64 code points)
PlaneBMP
ScriptsHan (15 char.)
Hangul (2 char.)
Common (43 char.)
Inherited (4 char.)
Assigned64 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)56 (+56)
1.0.1 (1992)56 (+0)
1.1 (1993)57 (+1)
3.0 (1999)61 (+4)
3.2 (2002)64 (+3)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]
In Unicode 1.0.1, during the process of unifying with ISO 10646, the "IDEOGRAPHIC DITTO MARK" (仝) was unified with the unified ideograph at U+4EDD, allowing the Japanese Industrial Standard symbol to be moved from U+32FF in the Enclosed CJK Letters and Months block to the vacated code point at U+3004.[3]

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Block

[edit]
CJK Symbols and Punctuation[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+300x ID
 SP 
U+301x
U+302x
U+303x  〾 
Notes
1.^ As of Unicode version 15.1

The block has variation sequences defined for East Asian punctuation positional variants.[4][5] They use U+FE00 VARIATION SELECTOR-1 (VS01) and U+FE01 VARIATION SELECTOR-2 (VS02):

Variation sequences for punctuation alignment
U+ 3001 3002 Description
base code point
base + VS01 、︀ 。︀ corner-justified form
base + VS02 、︁ 。︁ centered form

Orientation

[edit]

Quotation marks and other punctuation have expected differences in behaviour in vertical and horizontal text. The quotation marks 「...」, 『...』 and 〝...〟 rotate 90 degrees, as follows:

Expected behaviour of CJK quotation marks in vertical and horizontal text. The red registration corners mark the glyph metrics and show how the glyph aligns within the em-box of a CJK character.

See also General Punctuation, for variation selectors and CJK behaviour of the Latin quotation marks ‘...’ and “...”.

Chinese character

[edit]

The CJK Symbols and Punctuation block contains one Chinese character: U+3007 IDEOGRAPHIC NUMBER ZERO. Although it is not covered under "Unified Ideographs", it is treated as a CJK character for all other intents and purposes.[6]

Emoji

[edit]

The CJK Symbols and Punctuation block contains two emoji: U+3030 and U+303D.[7][8]

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[9]

Emoji variation sequences
U+ 3030 303D
base code point
base+VS15 (text) 〰︎ 〽︎
base+VS16 (emoji) 〰️ 〽️

History

[edit]

In Unicode 1.0.1, two changes were made to this block in order to make Unicode 1.0.1 a proper subset of ISO 10646:[10][11][12]


The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Symbols and Punctuation block:

See also

[edit]

References

[edit]
  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  4. ^ Lunde, Ken (2018-01-21). "L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF).
  5. ^ "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
  6. ^ GB/T 15835-2011《出版物上数字用法》. China Guojia Biaozhun. https://journals.usst.edu.cn/uploadfile/file/GBT%2015835-2011%E3%80%8A%E5%87%BA%E7%89%88%E7%89%A9%E4%B8%8A%E6%95%B0%E5%AD%97%E7%94%A8%E6%B3%95%E3%80%8B.pdf
  7. ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
  8. ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
  9. ^ "UTS #51 Emoji Variation Sequences". The Unicode Consortium.
  10. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  11. ^ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  12. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.