GB 18030-2022 PDF English
US$5005.00 · In stock · Download in 9 secondsGB 18030-2022: Information technology - Chinese coded character set Delivery: 9 seconds. True-PDF full-copy in English & invoice will be downloaded + auto-delivered via email. See step-by-step procedureStatus: Valid GB 18030: Evolution and historical versions
Standard ID | Contents [version] | USD | STEP2 | [PDF] delivery | Name of Chinese Standard | Status |
GB 18030-2022 | English | 5005 |
Add to Cart
|
0-9 seconds. Auto-delivery
|
Information technology - Chinese coded character set
| Valid |
GB 18030-2005 | English | 4690 |
Add to Cart
|
0-9 seconds. Auto-delivery
|
Information technology -- Chinese coded character set
| Obsolete |
GB 18030-2000 | English | RFQ |
ASK
|
3 days
|
Information technology-Chinese ideograms coded character set for information interchange-Extension for the basic set
| Obsolete |
Excerpted PDFs (Download full copy in 9 seconds upon purchase)PDF Preview: GB 18030-2022
GB 18030-2022: Information technology - Chinese coded character set---This is an excerpt. Full copy of true-PDF in English version (including equations, symbols, images, flow-chart, tables, and figures etc.), auto-downloaded/delivered in 9 seconds, can be purchased online: https://www.ChineseStandard.net/PDF.aspx/GB18030-2022
PEOPLE’S REPUBLIC OF CHINA
ICS 35.040
CCS L 71
Replacing GB 18030-2005
Information technology - Chinese coded character set
Issued on: JULY 19, 2022
Implemented on: AUGUST 01, 2023
Issued by. State Administration for Market Regulation;
Standardization Administration of the People's Republic of China.
Table of Contents
Foreword... i
1 Scope... 0
2 Normative references... 0
3 Terms and definitions... 0
4 Repertoire... 1
5 Overall structure... 2
6 Sequence of characters... 4
7 Code point allocation... 4
8 Explanation of some characters and codes... 7
9 Implementation level... 7
Annex A (normative) Character table of double-byte... 9
Annex B (normative) Ideographic descriptors... 91
Annex C (normative) Character table of four-byte... 92
Annex D (informative) Explanation of some characters and codes... 546
Annex E (informative) Code positions of Chinese characters in "General Standard
Chinese Character List"... 549
Bibliography... 742
1 Scope
This document specifies the hexadecimal representation of Chinese graphic characters
and their binary codes used in information technology.
This document applies to the processing, exchange, storage, transmission, presentation,
input and output of Chinese and other graphic character information.
This document is applicable to technical products with information processing and
exchange functions of Chinese and other text and graphic characters, including but not
limited to the software products represented by input methods, optical character
recognition (OCR), editing and proofreading, machine translation, speech synthesis,
text transcription, intelligent writing, etc., as well as the hardware products represented
by computers, communication terminal equipment, e-book readers, learning machines,
etc.
2 Normative references
The following referenced documents are indispensable for the application of this
document. For dated references, only the edition cited applies. For undated references,
the latest edition of the referenced document (including any amendments) applies.
GB/T 2312-1980, Code of Chinese graphic character set for information
interchange - Primary set
GB/T 11383-1989, Information process in 8-bit code for information interchange -
Structure and rules for implementation
GB/T 13000, Information technology - Universal multiple - Octet coded character
set (UCS)
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
3.1 character
An element in a collection of elements used to organize, control, or represent data.
3.2 coded character
Character (3.1) and its coded representation.
3.3 private use area
An area that can be specified by the user of a product conforming to this document.
3.4 repertoire
A specified set of characters (3.1) represented by a coded character (3.2) set.
3.5 reserved zone
Areas reserved for future specified by this document.
4 Repertoire
4.1 Overview
The characters included in this document are coded in single-byte, double-byte or four-
byte.
4.2 Part of single-byte
In this document, the part of single-byte includes all 128 characters from 0x00 to 0x7F
of GB/T 11383-1989.
4.3 Part of double-byte
The part of double-byte includes all graphic characters in GB/T 2312-1980, CJK unified
Chinese characters and some graphic characters in GB/T 13000.The characters in the
part of double-byte are in accordance with the provisions in Annex A. Among them, the
graphics, code positions and functions of ideographic descriptors shall comply with the
provisions of Annex B.
NOTE. GB/T 13000 uniformly encodes Chinese characters used in China, Japan, South Korea,
Vietnam and other countries and regions. Chinese characters with unique abstract glyphs are
assigned a separate code position. Chinese characters with different sources but the same abstract
glyphs are given a common code position. The encoded Chinese characters are called CJK unified
Chinese characters (CJK Unified Ideographs), where CJK means China, Japan, and Korea.
4.4 Part of four-byte
The part of four-byte includes 66 CJK unified Chinese characters (9FA6~9FEF,
excluding 9FB4~9FBB) in GB/T 13000 other than the above-mentioned double-byte
characters, CJK unified Chinese character extension A, CJK unified Chinese character
extension B, CJK unified Chinese character extension C, CJK unified Chinese character
extension D, CJK unified Chinese character extension E, CJK unified Chinese character
extension F and the characters of ethnic minorities that have been coded in GB/T 13000.
The characters in the part of four-byte follow the provisions of Annex C.
5 Overall structure
In the text, all numbers marked with 0x are in hexadecimal. Those not marked with 0x
are in decimal. All coded representations in the appendix are expressed in hexadecimal.
All other numbers are expressed in decimal.
The part of single-byte adopts the encoding structure of GB/T 11383-1989.Use code
points 0x00~0x7F.
6 Sequence of characters
6.1 Sequence of characters in part of single-byte
All characters in the part of single-byte are arranged in the order of the corresponding
characters in GB/T 11383-1989.
6.2 Sequence of characters in part of double-byte
See Annex A for the sequence of characters in the part of double-byte.
6.3 Sequence of characters in part of four-byte
There is a total of 50400 code points from 0x81308130 to 0x8439FE39.The characters
corresponding to all basic multilingual plane of GB/T 13000 not included in the part of
double-byte shall be arranged in the order of the corresponding characters of basic
multilingual plane in GB/T 13000.
A total of 1058400 code points from 0x90308130 to 0xE339FE39 are used for the 16
auxiliary planes corresponding to GB/T 13000.The sequence of character arrangement
is completely in accordance with the corresponding code point sequence of the 16
auxiliary planes of GB/T 13000.
The sequence of characters in the part of four-byte shall comply with Annex C.
7 Code point allocation
7.1 Code point allocation for part of single-byte
The code points of the part of single-byte are allocated according to the rules of GB/T
11383-1989.See Figure 2 for the allocation of single-byte code points.
Figure 2 -- Code point map for zone of single byte
7.2 Code point allocation for part of double-byte
The code point arrangement of the part of double-byte is divided into two parts.
0x8140~0xFE7E and 0x8180~0xFEFE, a total of 23940 code points. See Figure 3 and
Table 2 for the allocation of double-byte code points.
8 Explanation of some characters and codes
Compared with GB 18030-2005, the glyphs at some code positions and/or the
corresponding GB/T 13000 code positions have been adjusted in this document (see
Annex D).
9 Implementation level
9.1 General
This document specifies three implementation levels. System software products that
meet the corresponding implementation level shall provide input and output functions
for all characters within the corresponding implementation level.
9.2 Implementation level 1
Implementation level 1 supports CJK unified Chinese characters (i.e.,
0x82358F33~0x82359636) and CJK unified Chinese character extension A (i.e.,
0x8139EE39~0x82358738) of the single-byte coded part, double-byte coded part and
four-byte coded part of this document.
Any product to which this document applies shall meet the requirements for
implementation level 1.
NOTE. According to the needs of software applications, implementation level 1 can also choose to
support any one or more non-Chinese characters listed in Table 3.
9.3 Implementation level 2
Implementation level 2 contains implementation level 1.In addition, implementation
level 2 also supports encoded Chinese characters that are not included in
implementation level 1 in the "General Standard Chinese Character List". See Annex E
for the code positions and glyphs of the Chinese characters included in the "General
Standard Chinese Character List" in this document.
The system software and supporting software shall meet the requirements for
implementation level 2.
NOTE. System software and supporting software include but not limited to operating system,
database management system, and middleware (see GB/T 36475 for information on software
product classification).
9.4 Implementation level 3
Implementation level 3 contains implementation level 2.In addition, implementation
level 3 also supports all Chinese characters specified in this document and Kangxi
radicals in Table 3.
Products used for government services and public services shall meet the requirements
of level 3.
NOTE. Government services and public service industries include but are not limited to railway
transportation, road transportation, water transportation, air transportation, multimodal
transportation and transportation agency, postal services, monetary and financial services, insurance,
land management, health, national institutions, social security, etc. (see GB/T 4754 for industry
classification information).
Annex A
(normative)
Character table of double-byte
A.1 Content
This table gives the glyphs and codes of double-byte coded characters. At the same time,
the GB/T 13000 code position corresponding to this character is given.
A.2 Description
Annex C
(normative)
Character table of four-byte
C.1 Contents
This table gives the glyphs and codes of Chinese characters and some minority
languages. At the same time, the GB/T 13000 code position corresponding to this
character is also given. The fonts of other parts are omitted.
C.2 Description
Annex E
(informative)
Code positions of Chinese characters in "General Standard Chinese Character List"
This document contains all 8105 standard Chinese characters in the "General Standard Chinese Character List"
approved and released by the State Council of the People's Republic of China in 2013.Their code positions and font
styles are shown in Table E.1.The arrangement order of the Chinese characters in the table is consistent with the order of
the standard Chinese characters in the "General Standard Chinese Character List". The Chinese characters with serial
number 1~3500 are first-level Chinese characters. The Chinese characters with serial numbers 3501~6500 are second-
level Chinese characters. The Chinese characters with serial numbers 6501~8105 are third-level Chinese characters.
...... Source: Above contents are excerpted from the full-copy PDF -- translated/reviewed by: www.ChineseStandard.net / Wayne Zheng et al.
Tips & Frequently Asked QuestionsQuestion 1: How long will the true-PDF of English version of GB 18030-2022 be delivered?Answer: The full copy PDF of English version of GB 18030-2022 can be downloaded in 9 seconds, and it will also be emailed to you in 9 seconds (double mechanisms to ensure the delivery reliably), with PDF-invoice. Question 2: Can I share the purchased PDF of GB 18030-2022_English with my colleagues?Answer: Yes. The purchased PDF of GB 18030-2022_English will be deemed to be sold to your employer/organization who actually paid for it, including your colleagues and your employer's intranet. Question 3: Does the price include tax/VAT?Answer: Yes. Our tax invoice, downloaded/delivered in 9 seconds, includes all tax/VAT and complies with 100+ countries' tax regulations (tax exempted in 100+ countries) -- See Avoidance of Double Taxation Agreements (DTAs): List of DTAs signed between Singapore and 100+ countriesQuestion 4: Do you accept my currency other than USD?Answer: Yes. www.ChineseStandard.us -- GB 18030-2022 -- Click this link and select your country/currency to pay, the exact amount in your currency will be printed on the invoice. Full PDF will also be downloaded/emailed in 9 seconds. Question 5: Should I purchase the latest version GB 18030-2022?Answer: Yes. Unless special scenarios such as technical constraints or academic study, you should always prioritize to purchase the latest version GB 18030-2022 even if the enforcement date is in future. Complying with the latest version means that, by default, it also complies with all the earlier versions, technically.
How to buy and download a true PDF of English version of GB 18030-2022?A step-by-step guide to download PDF of GB 18030-2022_EnglishStep 1: Visit website https://www.ChineseStandard.net (Pay in USD), or https://www.ChineseStandard.us (Pay in any currencies such as Euro, KRW, JPY, AUD). Step 2: Search keyword "GB 18030-2022". Step 3: Click "Add to Cart". If multiple PDFs are required, repeat steps 2 and 3 to add up to 12 PDFs to cart. Step 4: Select payment option (Via payment agents Stripe or PayPal). Step 5: Customize Tax Invoice -- Fill up your email etc. Step 6: Click "Checkout". Step 7: Make payment by credit card, PayPal, Google Pay etc. After the payment is completed and in 9 seconds, you will receive 2 emails attached with the purchased PDFs and PDF-invoice, respectively. Step 8: Optional -- Go to download PDF. Step 9: Optional -- Click Open/Download PDF to download PDFs and invoice. See screenshots for above steps: Steps 1~3 Steps 4~6 Step 7 Step 8 Step 9
|