GB/T 42755-2023 Artificial intelligence - Code of practice for data labeling of machine learning
1 Scope
This document specifies the framework process of data labeling of machine learning in the field of artificial intelligence.
This document is applicable to guiding data labeling of machine learning in the field of artificial intelligence and related research, development and application.
2 Normative references
The following documents contain requirements which, through reference in this text, constitute provisions of this document. For dated references, only the edition cited applies. For undated references, the latest edition (including any amendments) applies.
GB/T 35274-2017 Information security technology - Security capability requirements for big data services
GB/T 37973-2019 Information security technology - Big data security management guide
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
3.1
data labeling
process of assigning target variables and values to data samples
3.2
labeling task
activity of labeling data according to data labeling description
3.3
data labeler
person or organization that undertakes the data labeling task
3.4
data user
person or organization that puts forward data labeling requirements
3.5
data labeling administrator
person or organization that manages the evaluation, distribution, delivery, acceptance and quality control of the data labeling task
3.6
labeling tool
all process-related tools including those used by the data labeler to perform data labeling, those used by the data labeling administrator to manage data labeling, those used by the data user to accept data labeling, etc.
3.7
labeling task description
written expression used by the data user to clearly indicate the labeling task to the data labeling administrator and data labeler
Note: The labeling task description usually includes the description of the labeling task to be performed, labeling method, positive and negative examples, acceptance method and acceptance index, etc.
4 Data labeling process
Data labeling involves the data user, data labeling administrator and data labeler. The main process includes three stages: labeling task preparation, labeling task performing and labeling result output. See Figure 1 for the data labeling process.
Foreword i
1 Scope
2 Normative references
3 Terms and definitions
4 Data labeling process
5 Labeling task preparation
5.1 Labeling task
5.2 Labeling personnel
5.3 Labeling environment
6 Labeling task performing
6.1 Process control
6.2 Quality assurance
6.3 Management mechanism
7 Labeling result output
7.1 Internal quality inspection
7.2 Data delivery
7.3 Post maintenance
Figure 1 Data labeling process framework
GB/T 42755-2023 Artificial intelligence - Code of practice for data labeling of machine learning
1 Scope
This document specifies the framework process of data labeling of machine learning in the field of artificial intelligence.
This document is applicable to guiding data labeling of machine learning in the field of artificial intelligence and related research, development and application.
2 Normative references
The following documents contain requirements which, through reference in this text, constitute provisions of this document. For dated references, only the edition cited applies. For undated references, the latest edition (including any amendments) applies.
GB/T 35274-2017 Information security technology - Security capability requirements for big data services
GB/T 37973-2019 Information security technology - Big data security management guide
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
3.1
data labeling
process of assigning target variables and values to data samples
3.2
labeling task
activity of labeling data according to data labeling description
3.3
data labeler
person or organization that undertakes the data labeling task
3.4
data user
person or organization that puts forward data labeling requirements
3.5
data labeling administrator
person or organization that manages the evaluation, distribution, delivery, acceptance and quality control of the data labeling task
3.6
labeling tool
all process-related tools including those used by the data labeler to perform data labeling, those used by the data labeling administrator to manage data labeling, those used by the data user to accept data labeling, etc.
3.7
labeling task description
written expression used by the data user to clearly indicate the labeling task to the data labeling administrator and data labeler
Note: The labeling task description usually includes the description of the labeling task to be performed, labeling method, positive and negative examples, acceptance method and acceptance index, etc.
4 Data labeling process
Data labeling involves the data user, data labeling administrator and data labeler. The main process includes three stages: labeling task preparation, labeling task performing and labeling result output. See Figure 1 for the data labeling process.
Contents of GB/T 42755-2023
Foreword i
1 Scope
2 Normative references
3 Terms and definitions
4 Data labeling process
5 Labeling task preparation
5.1 Labeling task
5.2 Labeling personnel
5.3 Labeling environment
6 Labeling task performing
6.1 Process control
6.2 Quality assurance
6.3 Management mechanism
7 Labeling result output
7.1 Internal quality inspection
7.2 Data delivery
7.3 Post maintenance
Figure 1 Data labeling process framework