|
US$319.00 · In stock Delivery: <= 3 days. True-PDF full-copy in English will be manually translated and delivered via email. CYT101.4-2014: Specification of content resource processing in press and publication. Part 4: Data processing quality Status: Valid
| Standard ID | Contents [version] | USD | STEP2 | [PDF] delivered in | Standard Title (Description) | Status | PDF |
| CY/T 101.4-2014 | English | 319 |
Add to Cart
|
3 days [Need to translate]
|
Specification of content resource processing in press and publication. Part 4: Data processing quality
| Valid |
CY/T 101.4-2014
|
PDF similar to CYT101.4-2014
Basic data | Standard ID | CY/T 101.4-2014 (CY/T101.4-2014) | | Description (Translated English) | Specification of content resource processing in press and publication. Part 4: Data processing quality | | Sector / Industry | Press & Publication Industry Standard (Recommended) | | Classification of Chinese Standard | L70 | | Classification of International Standard | 35.240.30 | | Word Count Estimation | 12,163 | | Date of Issue | 1/29/2014 | | Date of Implementation | 1/29/2014 | | Quoted Standard | GB/T 16159; GB/T 5271.14 | | Regulation (derived from) | News-Broadcasting-Press-Letter 2014 [11] | | Issuing agency(ies) | State Press and Publication Administration | | Summary | This standard specifies the data quality structure and requirements, quality accuracy assessment and quality inspection report of digital publishing of news publishing content resources. This standard is applicable to the digital processing of news publishing content resources and the production and application of electronic resources. |
CYT101.4-2014: Specification of content resource processing in press and publication. Part 4: Data processing quality ---This is a DRAFT version for illustration, not a final translation. Full copy of true-PDF in English version (including equations, symbols, images, flow-chart, tables, and figures etc.) will be manually/carefully translated upon your order.
Specification of content resource processing in press and publication.Part 4.Data processing quality
News and publication content resource processing specifications
Part 4.Data Processing Quality
Released on.2014-01-29
2014-01-29 Implementation
Press and Publication Industry Standards of the People's Republic of China
release
State Administration of Press, Publication, Radio, Film and Television of the People's Republic of China
Foreword I
1 Scope 1
2 Normative references 1
3 Terms and definitions 1
4 Data quality structure and requirements 2
4.1 Data Quality Structure 2
4.2 The basic process of quality assessment 2
4.3 Completeness 2
4.4 Normative 3
4.5 Effectiveness 3
4.6 Accuracy 3
5 Quality accuracy assessment 3
5.1 Quality accuracy assessment principle 3
5.2 Evaluation method 3
5.3 Error rate calculation 3
5.4 Testing and sampling scope 3
5.5 Quality accuracy requirements and error statistical methods 3
6 Quality accuracy test report 5
Appendix A (informative appendix) Sample 6 of Quality Accuracy Test Report
Reference 7
Table of contents
CY/T 101 "Regulations for the Processing of News and Publication Content Resources" is divided into the following 10 parts.
──Part 1.Processing technical terms;
──Part 4.Data processing quality;
──Part 5.Data Management;
──Part 6.Data Management;
──Part 7.Data Delivery;
──Part 8.Book processing;
──Part 9.Newspaper Processing;
──Part 10.Periodical Processing.
This part is part 4 of CY/T 101.
Appendix A of this section is an informative appendix.
This part was proposed by the Science and Technology Department of the State Administration of Press, Publication, Radio, Film and Television of the People's Republic of China.
This part is under the jurisdiction of the National Press and Publication Information Standardization Technical Committee.
Drafting organizations of this section. Founder International Software Co., Ltd., Beijing Tuoba Excellent Information Technology Research Institute, Information Center of the General Administration of Press and Publication.
The main drafters of this section. Zhao Haitao, Zhou Changling, An Xiumin, Liu Chengyong, Cai Jingsheng, Zhou Weiguo, Wu Zhiqiang, Zhang Mo.
Foreword
──Part 2.Data processing and application mode;
──Part 3.Data processing specifications;
11 Scope
This part of CY/T 101 specifies the data quality structure and requirements for the digital processing of news and publication content resources, and the assessment of quality accuracy
And quality inspection reports.
This section applies to the digital processing of news and publication content resources and the production and application of electronic resources.
2 Normative references
The following documents are indispensable for the application of this document. For dated reference documents, only the dated version applies to this document.
For undated reference documents, the latest version (including all amendments) is applicable to this document.
GB/T 16159 Basic Rules of Chinese Phonetic Orthography
GB/T 5271.14 Information Technology Vocabulary Part 14.Reliability, Maintainability and Availability
3 Terms and definitions
The terms and definitions defined in GB/T 5271.14 and CY/T 101.1-2014 and the following terms and definitions apply to this document.
3.1
Materials
A general term for printed materials, archived films, or original typesetting data of news and publications.
[CY/T 101.1-2014, 4.1.2]
3.2
Finished data
All data processing procedures have been completed and meet the pre-set specifications and quality requirements, and the final data form delivered can be realized.
[CY/T 101.1-2014, 7.1.1]
3.3
Error
The difference between a calculated, observed or measured value or condition, a prescribed or theoretically correct value or condition.
[GB/T 5271.14-2008, 14.01.08]
3.4
Fixed-layout document
Layout document
A file that is generated after typesetting and contains all the data needed for solidification and presentation of the layout.
[CY/T 101.1-2014, 6.3.10]
3.5
Reflowing document
Streaming document
According to the logical sequence of the content, the content presents a file that can adapt to changes in the screen or window of the terminal device.
[CY/T 101.1-2014, 6.3.11]
3.6
Imaged fixed-layout document imaged fixed-layout document
News Publication Content Resource Processing Specification Part 4.Data Processing Quality
Image layout file
Through the scanning method, a collection of image files that are completely consistent with the original processing object layout is generated, and packaged into an independent and complete browsable
Digital layout file (including bookmark information and the link relationship between bookmark information and page of the layout file).
[CY/T 101.1-2014, 6.3.12]
3.7
Vectorized dual-layer fixed-layout document
Double Layout Document
On the basis of a single-layer image format file, a text layer of transparent font mode corresponding to the image layer is generated at the same time, which can support selection and copying.
Beihe finds the layout file.
[CY/T 101.1-2014, 6.3.13]
3.8
Vectorized fixed-layout document
Vector typography file
According to the text position of the original processing object, the text adopts vector characters, decorative pictures, artistic characters, shading, lines, charts and formulas
Layout files displayed in image format.
[CY/T 101.1-2014, 6.3.14]
4 Data quality structure and requirements
4.1 Data quality structure
Data quality should include the completeness, standardization, validity and accuracy of the data, as shown in Figure 1.
Figure 1 Data quality structure
4.2 The basic process of quality assessment
Assess the completeness, standardization and validity of the finished product data. After all these three aspects meet the quality requirements, then assess the finished product data.
accuracy.
4.3 Completeness
4.3.1 Type complete
The type of finished product data should be consistent with the requirements of the data processing target, and no omissions and errors are allowed.
4.3.2 Complete content
The content range and quantity of the finished product data should be consistent with the requirements of the data processing goal, and errors such as omissions and disorder are not allowed.
4.3.3 Complete quality management documents
The complete product data quality management document should include.
a) Quality inspection plan;
b) Quality inspection report.
Data quality
Integrity, Normative, Effectiveness and Accuracy
34.4 Normative
4.4.1 Data format
The data format of the finished product data should be consistent with the data processing requirements, usually the following format.
a) Use lossless compression TIFF format for long-term preservation of images;
b) Generally publish application images using JPEG format;
c) The content structured document adopts XML1.0 and above, and the structured specification description file adopts XSD1.0 and above;
d) The layout document adopts PDF and other formats;
e) Streaming documents use formats such as Epub.
4.4.2 Data file naming
The naming of the finished product data should be consistent with the data processing requirements, and the naming method is composed of unique ID information and data type category information.
4.4.3 Data storage
The storage of finished product data should be consistent with the requirements of data processing. Usually, the basic unit of the processing object is the storage folder.
Classified storage of all kinds of finished product data of this processing object. Books are stored in books, and newspapers and periodicals are stored on schedule.
4.5 Effectiveness
The data of the finished product should be able to be read through the relevant software and system, and no errors such as data damage, abnormal error reporting, failure to open, etc. are allowed. read
The output data should be complete, and unusable errors such as coding confusion and image distortion are not allowed.
4.6 Accuracy
The quality and accuracy of the finished product data should be consistent with the data processing requirements, including.
a) The accuracy of the text;
b) Image accuracy;
c) Accuracy of content structure;
d) Accuracy of layout documents;
e) Streaming file accuracy.
5 Quality accuracy assessment
5.1 Quality accuracy assessment principles
5.1.1 Basic principles
The basis for determining data quality should be established on the basis of the data used in data processing, that is, errors, omissions, and smoothness in the original data.
Quality problems such as sequence reversal are not corrected during data processing, and are not counted as data processing quality errors.
5.2 Evaluation method
Data processing requirements should specify the error rate indicators of different types of finished product data, and adopt sampling testing and other methods for quality accuracy
assessment. If the actual error rate is not higher than the error rate index of the finished product data, it is deemed to meet the quality accuracy target, and vice versa.
5.3 Error rate calculation
The standard unit of testing is generally based on the value of one thousand, ten thousand, and one hundred thousand. The error rate calculation formula is.
Error rate = number of errors in the detection standard unit/detection standard unit
5.4 Testing sampling scope
The sampling test range should not be less than 20 times of the test standard unit.
5.5 Quality accuracy requirements and error statistical methods
5.5.1 Text accuracy
5.5.1.1 Text accuracy requirements
Adopt content index level, full text standard level, format reconstruction level (double-layer format files processed by full text basic level are not included), version
The text quality evaluation standard unit of the finished product data containing text content generated by the processing method such as type complex level is 10,000 characters.
The quantity should comply with relevant publication quality management regulations.
5.5.1.2 Statistical methods for text errors
Error rate statistics methods include.
a) The calculation method for text errors is as follows.
1) Back cover, copyright page, text, table of contents, publication instructions (or legends), preface (or preface), postscript (or epilogue), notes, index,
General typos, extra characters, multiple characters, missing characters, and inverted characters in charts, appendices, references, etc., each place is counted as 1 error;
2) If the same typo appears repeatedly, each page is counted as 1 error, and the whole book is counted as 4 errors at most. Too many, missing 1 to 5 words,
Each place is counted as 1 error, and more than 5 words are counted as 4 errors;
3) The text errors on the front cover page are counted as 2 errors in each place; if the relevant text is inconsistent, one error will be counted as 1 error; foreign language,
Minority scripts and international phonetic symbols are based on words. No matter how many errors occur, they are counted as 1 error.
4) If the Chinese pinyin does not meet the relevant regulations, a corresponding Chinese character or phrase is used as the unit, and each place is counted as 1 error;
5) Simplified characters and traditional characters are mixed, and each place is counted as 0.5 errors; if the same error is more than 3 in the whole book, it is counted as 1.5 errors;
Errors are not counted if the content itself is needed or the original paper book is mixed with simplified and traditional.
b) The calculation method of punctuation marks and other symbols is as follows.
1) The general misuse, omission, and multiple use of punctuation marks are counted as 0.1 errors;
2) If the decimal point is mistaken as a midpoint, or the midpoint is mistaken as a decimal point, and the colon is mistaken as a ratio sign, or the ratio sign is mistaken as a colon, every
Every place is counted as 0.1 errors;
3) The dash is mistaken for a word line or a half word line, and each place is counted as 0.1 error. If the punctuation marks are mistaken at the beginning or end of the line, press 0.1 for each place
Error count;
4) Errors in the symbols of legal units of measurement, scientific symbols in various disciplines of science and technology, musical score symbols, etc., each place is counted as 0.5 errors;
The same error will not be double-counted in the same area, and the book will count up to 1.5 errors.
c) The text errors in the same position are in different finished data such as metadata sets, content structured data, layout files and streaming files
Repeated occurrences in, uniformly count as 1 error.
5.5.2 Image accuracy
5.5.2.1 Image accuracy requirements
The standard unit of image quality evaluation is 1000 pages, and the image error rate is required to be less than one thousandth.
5.5.2.2 Statistical method of image error
Taking the page as the basic detection unit, any one or several errors on the page will be counted as 1 error. The types of errors include.
a) Specification errors such as file format error, image resolution error, color mode error, compression algorithm error, etc.;
b) The image size is inconsistent with the original version;
c) The color is distorted, the image is too thick or too light;
d) The horizontal tilt is greater than 0.5 degrees;
e) The file is damaged.
5.5.3 Accuracy of content structure
5.5.3.1 Content structured accuracy requirements
The standard unit of content structured quality assessment is 10,000 characters, and the content structured error rate is required to be less than three ten thousandths.
5.5.3.2 Content structured error statistics method
Content unindexed, indexed error, structured name error, structured level error, etc. are counted as 1 error;
5.5.4 Accuracy of Association Relationship
5.5.4.1 Accuracy requirements of association relationship
The quality evaluation standard unit of the association relationship is 1000 link points, and the error rate is required to be less than three thousandths.
5.5.4.2 Statistical methods of association errors
Missing or pointing errors in an association relationship are errors. An association relationship error is counted as 1 error. The association relationship includes.
5a) The link relationship between the table of contents and the chapters of the text;
b) The hierarchical relationship of the table of contents;
c) The reference point of the footnote and the reference relationship of the footnote;
d) The relationship between the reference point of the illustration and the reference of the illustration;
e) The relationship between the reference point of the table and the reference of the table;
f) The citation relationship between the reference point and the reference;
g) Image citation...
Tips & Frequently Asked Questions:Question 1: How long will the true-PDF of CYT101.4-2014_English be delivered?Answer: Upon your order, we will start to translate CYT101.4-2014_English as soon as possible, and keep you informed of the progress. The lead time is typically 1 ~ 3 working days. The lengthier the document the longer the lead time. Question 2: Can I share the purchased PDF of CYT101.4-2014_English with my colleagues?Answer: Yes. The purchased PDF of CYT101.4-2014_English will be deemed to be sold to your employer/organization who actually pays for it, including your colleagues and your employer's intranet. Question 3: Does the price include tax/VAT?Answer: Yes. Our tax invoice, downloaded/delivered in 9 seconds, includes all tax/VAT and complies with 100+ countries' tax regulations (tax exempted in 100+ countries) -- See Avoidance of Double Taxation Agreements (DTAs): List of DTAs signed between Singapore and 100+ countriesQuestion 4: Do you accept my currency other than USD?Answer: Yes. If you need your currency to be printed on the invoice, please write an email to [email protected]. In 2 working-hours, we will create a special link for you to pay in any currencies. Otherwise, follow the normal steps: Add to Cart -- Checkout -- Select your currency to pay.
|