Ideographic Description Characters


Ideographic Description Characters is a Unicode block containing graphic characters used for describing CJK ideographs. They are used in Ideographic Description Sequences to provide a description of an ideograph, in terms of what other ideographs make it up and how they are laid out relative to one another. An IDS provides the reader with a description of an ideograph that cannot be represented properly, usually because it is not encoded in Unicode; rendering systems are not intended to automatically compose the pieces into a complete ideograph, and the descriptions are not standardized.
U+2FF0 to U+2FFB were introduced from GBK; U+2FFC to U+2FFF were devised later and introduced in Unicode 15.1.

Ideographic Description Sequences

Ideographic Description Sequences are sequences of characters that represent a Chinese character structure as defined by the Unicode standard.
Below are the 16 characters as defined by Unicode in this block:
UnicodeSymbolMeaningExample 1IDSExample 2IDS
U+2FF0Two components combined left to right⿰木目?⿰丨㇍
U+2FF1Two components combined above to below⿱木口?⿱?丶
U+2FF2Three components combined left to middle and right⿲彳氵亍?⿲丿夕乚
U+2FF3Three components combined above to middle and below⿳亠口小?⿳亼目口
U+2FF4One component fully wrapping another component⿴囗口?⿴㐁人
U+2FF5One component surround three sides of another component ⿵几皇?⿵齊虫
U+2FF6One component surround three sides of another component ⿶凵㐅⿶乂丶
U+2FF7One component surround three sides of another component ⿷匚斤?⿷虎九
U+2FF8One component surround top and left side of another component⿸疒丙?⿸耂火
U+2FF9One component surround top and right side of another component⿹戈廾?⿹或壬
U+2FFAOne component surround bottom and left side of another component⿺走召?⿺礼分
U+2FFBTwo components overlapped⿻工从?⿻木⿻コ一
U+2FFCOne component surround three sides of another component ⿼叉丶?⿼コ二
U+2FFDOne component surround bottom and right side of another component⿽水丶⿽⺀十
U+2FFEHorizontal reflection⿾卍?⿾正
U+2FFF⿿Rotation?⿿凹?⿿予

Two other related ideographic description characters are not encoded in this Unicode block, but of which may be used in ideographic description sequences:
UnicodeSymbolBlockMeaningExample 1IDSExample 2IDS
U+303ECJK Symbols and PunctuationVariant but not equivalent〾胶 ?〾爫
U+31EFCJK StrokesSubtraction㇯兵丶?㇯豕一

This is the syntax of IDS in EBNF:

IDS := Ideographic | Radical | CJK_Stroke | Private Use | U+FF1F | IDS_UnaryOperator IDS | IDS_BinaryOperator IDS IDS | IDS_TrinaryOperator IDS IDS IDS
CJK_Stroke := U+31C0 | U+31C1 |... | U+31E3
IDS_UnaryOperator := U+2FFE | U+2FFF | U+303E
IDS_BinaryOperator := U+2FF0 | U+2FF1 | U+2FF4 |... | U+2FFD | U+31EF
IDS_TrinaryOperator:= U+2FF2 | U+2FF3

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Ideographic Description Characters block: