|
US$419.00 · In stock Delivery: <= 4 days. True-PDF full-copy in English will be manually translated and delivered via email. ZYF001-2018: General specifications for corpora Status: Valid ZYF001: Evolution and historical versions
| Standard ID | Contents [version] | USD | STEP2 | [PDF] delivered in | Standard Title (Description) | Status | PDF |
| ZYF 001-2018 | English | 419 |
Add to Cart
|
4 days [Need to translate]
|
General specifications for corpora
| Valid |
ZYF 001-2018
|
| ZYF 001-2017 | English | 529 |
Add to Cart
|
4 days [Need to translate]
|
(Translation Service Procurement Guide -- Part 1: Translation)
| Valid |
ZYF 001-2017
|
PDF similar to ZYF001-2018
Standard similar to ZYF001-2018 WB/T 1120 GB/T 40952 GB 5296.1
Basic data | Standard ID | ZYF 001-2018 (ZYF001-2018) | | Description (Translated English) | General specifications for corpora | | Sector / Industry | Chinese Industry Standard | | Classification of Chinese Standard | A14 | | Classification of International Standard | 01.040 | | Word Count Estimation | 18,145 | | Date of Issue | 2018 | | Date of Implementation | 2019-01-01 |
ZYF001-2018: General specifications for corpora---This is a DRAFT version for illustration, not a final translation. Full copy of true-PDF in English version (including equations, symbols, images, flow-chart, tables, and figures etc.) will be manually/carefully translated upon your order.
ZYF001-2018
General specifications for corpora
Corpus General Technical Specification
China Language Service Industry Standard
Specifications for the Language Service Industry in China
2018-11-19 release
2019-01-01 Implementation
Published by China Translation Association
Issuedby. Translators Association of China
Contents
Foreword... Ⅲ
Introduction... Ⅳ
1 Scope... 1
2 Normative references... 1
3 Terms and definitions... 1
4 Construction and processing... 4
4.1 Corpus construction process... 4
4.2 Corpus design... 5
4.3 Corpus collection... 5
4.4 Corpus Preprocessing... 5
4.5 Corpus annotation... 5
4.6 Corpus alignment... 6
4.7 Corpus generation... 6
5 Management and Maintenance... 6
6 Trading and Sharing... 6
6.1 Classification of corpus... 6
6.2 Classification of corpora... 6
6.3 Corpus description... 7
6.4 Corpus evaluation... 8
6.5 Corpus price and delivery... 9
Appendix A (Informative Appendix) Bilingual Parallel Corpus Reference Evaluation Standards... 11
References... 13
Foreword
This specification was drafted in accordance with the rules given in GB/T 1.1-2009.
This specification is proposed and managed by the China Translation Association.
This standard was drafted by. Shanghai Jiaotong University, Peking University, Beijing Foreign Studies University, Chinese Academy of Foreign Languages, Tongji University
Science, Southeast University, Nanjing Normal University, Zhejiang University, Zhejiang University of Finance and Economics, Chinese Academy of Social Sciences, Chinese Institute of Standardization, Chinese Translation
Yutong Technology Co., Ltd., Alibaba (China) Network Technology Co., Ltd., Huawei Technologies Co., Ltd., Express Network Technology
Co., Ltd., Beijing Zhongyi Tiankai Education Service Co., Ltd., Chengdu Youyi Information Technology Co., Ltd., Hangzhou Chinese Language Technology Co., Ltd.
The company, Suzhou Lianyue Technology Co., Ltd., Beijing Sibi Rui Translation Co., Ltd.
The main drafters of this specification. Hu Kaibao, Yang Ping, Zhang Xuetao, Luo Huifang, Chen Shengquan, Wu Yongbo, Xie Ning, Peng Chengchao, Xu Wensheng, Li Aijun,
Liang Hongli, Wang Haitao, Wang Haibo, Li Jie, Pan Yicen, He Zhengyu, Liu Siyuan, Cai Fangren, Yu Jingsong, Gao Zhijun, Zhang Wei, Cheng Le, Yan Zhijun,
Li Changbao, Yi Mianzhu, Bi Yude, Guo Qing, Guan Xinchao, Tian Xujun, Li Chan, Li Xiaoqian, Hu Ang, Shan Weiqing, Ren Caiqi.
Introduction
The China Translation Association includes translation and localization services, language teaching and training, language technology tool development, and language-related consulting services
A national organization in the language service industry. Formulating language service standards and promoting the orderly and healthy development of the industry is the work of the China Translation Association
As one of the content.
With China's foreign investment and foreign direct investment entering a new stage, and the country's "Belt and Road" initiative, economic and cultural "going out"
The further implementation of language services, especially translation, has become a key point and bottleneck for the implementation of various policies.
The rapid growth of the demand for language services, and machine translation based on artificial intelligence neural network technology is rapidly developing, innovative human-machine fusion language
The service model is emerging quietly, and the research, construction, transaction and sharing of corpus are the key. Establish a service that can serve academic
Research on the general technical specifications of corpora that can serve language services and machine translation will serve the following purposes. (1) Respond to the “Belt and Road” initiative
National economic policies such as "Going Global" and economic and cultural services, serving major vertical fields such as national communication, foreign publicity, culture, economic and trade, and security, and promoting national dialects
Construction of language rights and language soft power; (2) Formulating and implementing corpus industry standards, on the basis of which an independent and controllable Chinese language can be established
Xin's corpus sharing and trading platform; (3) Advocating "safe corpus big data", on the premise of fully respecting copyright, using Internet thinking and artificial
Intelligence helps change the language service model.
Corpus General Technical Specification
1 Scope
This specification describes and specifies the construction and processing, management and maintenance, transaction and sharing of corpora. Other specifications on the corpus will be in
It will then be released in the form of a series of specifications.
By implementing this specification, the corpus provider can prove whether its corpus meets the corpus specification and whether it can meet the basic corpus usage
Usage requirements; corpus users can determine whether the corpus can be used for a specific purpose.
2 Normative references
The following documents are essential for the application of this document. For dated references, only the dated versions apply to this document.
For the cited documents without date, the latest version (including all modifications) applies to this document.
GB/T 19682-2005 Translation service translation quality requirements
ISO 639 language name codes
Bulletin of ISO 3166 country names
3 Terms and definitions
The following terms and definitions apply to this document.
3.1
Corpus text
Samples of language materials or language applications
3.2
Corpus
Electronic database composed of naturally occurring corpus (3.1) collected according to a certain sampling method
Note. It is a data collection that is selected and arranged in an orderly manner according to a certain purpose and method.
3.3
Token token
All the word forms in the corpus (3.2)
Example. go, goes, went, going are 4 English word forms.
3.4
Sentence pair
A complete sentence and the translation corresponding to its content are regarded as a sentence pair
Note. Sentence pairs can be n to n, where n is a non-negative integer.
3.5
Storage size
The capacity of the corpus (3.2), that is, the size of the corpus
Note. The corpus (3.2) for academic research usually calculates the capacity in terms of the total number of signs (3.3); the corpus for artificial intelligence and language services
(3.2) The storage capacity is usually calculated in units of sentence pairs (3.4).
Tips & Frequently Asked Questions:Question 1: How long will the true-PDF of ZYF001-2018_English be delivered?Answer: Upon your order, we will start to translate ZYF001-2018_English as soon as possible, and keep you informed of the progress. The lead time is typically 2 ~ 4 working days. The lengthier the document the longer the lead time. Question 2: Can I share the purchased PDF of ZYF001-2018_English with my colleagues?Answer: Yes. The purchased PDF of ZYF001-2018_English will be deemed to be sold to your employer/organization who actually pays for it, including your colleagues and your employer's intranet. Question 3: Does the price include tax/VAT?Answer: Yes. Our tax invoice, downloaded/delivered in 9 seconds, includes all tax/VAT and complies with 100+ countries' tax regulations (tax exempted in 100+ countries) -- See Avoidance of Double Taxation Agreements (DTAs): List of DTAs signed between Singapore and 100+ countriesQuestion 4: Do you accept my currency other than USD?Answer: Yes. If you need your currency to be printed on the invoice, please write an email to [email protected]. In 2 working-hours, we will create a special link for you to pay in any currencies. Otherwise, follow the normal steps: Add to Cart -- Checkout -- Select your currency to pay. Question 5: Should I purchase the latest version ZYF001-2018?Answer: Yes. Unless special scenarios such as technical constraints or academic study, you should always prioritize to purchase the latest version ZYF001-2018 even if the enforcement date is in future. Complying with the latest version means that, by default, it also complies with all the earlier versions, technically.
|