Skip to content

Commit a81c4d2

Browse files
authored
Merge pull request #23078 from Juliana-Morais/master
updated images and steps
2 parents 1af6f27 + f3279f7 commit a81c4d2

File tree

9 files changed

+49
-49
lines changed

9 files changed

+49
-49
lines changed
Loading
Loading

tutorials/cp-aibus-dox-ui-schema-custom/cp-aibus-dox-ui-schema-custom.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -7,20 +7,20 @@ primary_tag: topic>machine-learning
77
---
88

99
# Create Custom Schema for Custom Documents
10-
<!-- description --> Create a custom schema for custom documents (that are not supported out of the box) to extract information from similar documents using the Document Information Extraction service.
10+
<!-- description --> Create a custom schema for custom documents (which are not supported out of the box) to extract information from similar documents using the Document Information Extraction service.
1111

1212
## You will learn
1313
- How to create a custom schema for custom documents
1414
- How to add standard and custom data fields for the header information of custom documents
1515

1616
## Intro
17-
The core functionality of Document Information Extraction is to automatically extract structured information from documents using machine learning. The service supports extraction from the following standard document types out of the box: invoices, payment advices and purchase orders.
17+
The core functionality of Document Information Extraction is to automatically extract structured information from documents using machine learning. The service supports extraction from the following standard document types out of the box: invoices, payment advices, and purchase orders.
1818

1919
You can also use the [Schema Configuration](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/3c7862e30fc2488ea95f58f1d77e424e.html) and [Template](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/1eeb08998f49409681c06a01febc3172.html) features to extract information from custom documents that are different from the standard document types. You can customize the information extracted from custom document types by creating a custom schema and adding the specific information that you have in your documents.
2020

2121
In this tutorial, we'll use power of attorney documents as an example of a custom document type that is not supported by Document Information Extraction out of the box. A power of attorney document is a legal instrument authorizing one to act as the attorney or agent for another person in specified or all legal or financial matters.
2222

23-
If you are new to the Document Information Extraction UI, try out first the tutorial: [Use Machine Learning to Extract Information from Documents with Document Information Extraction UI](cp-aibus-dox-ui).
23+
If you are new to the Document Information Extraction UI, first try out the tutorial: [Use Machine Learning to Extract Information from Documents with Document Information Extraction UI](cp-aibus-dox-ui).
2424

2525
---
2626

@@ -30,7 +30,7 @@ If you are new to the Document Information Extraction UI, try out first the tuto
3030
1. Open the Document Information Extraction UI, as described in the tutorial: [Use Trial to Set Up Account for Document Information Extraction and Go to Application](cp-aibus-dox-booster-app) or [Use Free Tier to Set Up Account for Document Information Extraction and Go to Application](cp-aibus-dox-free-booster-app).
3131

3232

33-
>If you **HAVE NOT** just used the **Set up account for Document Information Extraction** booster to create a service instance for Document Information Extraction, and subscribe to the Document Information Extraction UI, observe the following:
33+
>If you **HAVE NOT** just used the **Set up account for Document Information Extraction** booster to create a service instance for Document Information Extraction and subscribe to the Document Information Extraction UI, observe the following:
3434
3535
>- To access the [Schema Configuration](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/3c7862e30fc2488ea95f58f1d77e424e.html) and [Template](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/1eeb08998f49409681c06a01febc3172.html) features, ensure that you use the `blocks_of_100` plan to create the service instance for Document Information Extraction Trial.
3636
@@ -51,7 +51,7 @@ If you are new to the Document Information Extraction UI, try out first the tuto
5151

5252
<!-- border -->![Access Schema Configuration](access-schema-configuration.png)
5353

54-
Here, you find the SAP schemas. The Document Information Extraction UI includes preconfigured SAP schemas for the following standard document types: purchase order, payment advice, and invoice. In addition, there’s an SAP schema for custom documents (`SAP_OCROnly_schema`). You can't delete SAP schemas. You can use them as they're, you can edit them directly, or create copies and adapt the list of fields according to your needs.
54+
Here, you find the SAP schemas. The Document Information Extraction UI includes preconfigured SAP schemas for the following standard document types: purchase order, payment advice, and invoice. In addition, there’s an SAP schema for custom documents (`SAP_OCROnly_schema`). You can't delete SAP schemas. You can use them as they are, you can edit them directly, or create copies and adapt the list of fields according to your needs.
5555

5656
<!-- border -->![Access Schema Configuration](sap-schemas.png)
5757

@@ -98,13 +98,13 @@ As your first header field, add the shipper number of your power of attorney doc
9898

9999
1. Enter an appropriate name for your field, `shipperNumber`, for example.
100100

101-
2. Select `string` for the `Data Type`. Note that a shipper number is a `string`, even though it consists of numbers, as it is an arbitrary combination of numbers without meaning. In contrast, price is an example for the data type `number`.
101+
2. Select `string` for the `Data Type`. Note that a shipper number is a `string`, even though it consists of numbers, as it is an arbitrary combination of numbers without meaning. In contrast, price is an example of the data type `number`.
102102

103-
3. Click **Add** to create the header field.
103+
3. Select `default` for the `Setup Type` and click **Add** to create the header field.
104104

105105
<!-- border -->![Create Number](add-number.png)
106106

107-
The field now displays in your list of header fields where you find all the information again that you have just entered. You can edit or delete the field by clicking the respective icons on the right.
107+
The field now displays in your list of header fields, where you again find all the information that you have just entered. You can edit or delete the field by clicking the respective icons on the right.
108108

109109
<!-- border -->![View Number](added-number.png)
110110

@@ -114,7 +114,7 @@ Click **Add** again to open the `Add Data Field` dialog.
114114

115115
2. Select `string` for the `Data Type`.
116116

117-
3. Click **Add** to create the field.
117+
3. Select `default` for the `Setup Type` and click **Add** to create the field.
118118

119119
<!-- border -->![Create Name](add-name.png)
120120

Loading
Loading
Loading

tutorials/cp-aibus-dox-ui-schema/cp-aibus-dox-ui-schema.md

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -16,9 +16,9 @@ author_profile: https://github.com/Juliana-Morais
1616
- How to add standard and custom data fields for the header and line item information of purchase order documents
1717

1818
## Intro
19-
The core functionality of Document Information Extraction is to automatically extract structured information from documents using machine learning. The service supports extraction from the following standard document types out of the box: invoices, payment advices and purchase orders. You can customize the information extracted from these document types by creating a custom schema and adding the specific information that you have in your documents. Additionally, you can add completely new document types.
19+
The core functionality of Document Information Extraction is to automatically extract structured information from documents using machine learning. The service supports extraction from the following standard document types out of the box: invoices, payment advices, and purchase orders. You can customize the information extracted from these document types by creating a custom schema and adding the specific information that you have in your documents. Additionally, you can add completely new document types.
2020

21-
If you are new to the Document Information Extraction UI, try out first the tutorial: [Use Machine Learning to Extract Information from Documents with Document Information Extraction UI](cp-aibus-dox-ui).
21+
If you are new to the Document Information Extraction UI, first try out the tutorial: [Use Machine Learning to Extract Information from Documents with Document Information Extraction UI](cp-aibus-dox-ui).
2222

2323
---
2424

@@ -28,7 +28,7 @@ If you are new to the Document Information Extraction UI, try out first the tuto
2828
1. Open the Document Information Extraction UI, as described in the tutorial: [Use Trial to Set Up Account for Document Information Extraction and Go to Application](cp-aibus-dox-booster-app) or [Use Free Tier to Set Up Account for Document Information Extraction and Go to Application](cp-aibus-dox-free-booster-app).
2929

3030

31-
>If you **HAVE NOT** just used the **Set up account for Document Information Extraction** booster to create a service instance for Document Information Extraction, and subscribe to the Document Information Extraction UI, observe the following:
31+
>If you **HAVE NOT** just used the **Set up account for Document Information Extraction** booster to create a service instance for Document Information Extraction and subscribe to the Document Information Extraction UI, observe the following:
3232
3333
>- To access the [Schema Configuration](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/3c7862e30fc2488ea95f58f1d77e424e.html) and [Template](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/1eeb08998f49409681c06a01febc3172.html) features, ensure that you use the `blocks_of_100` plan to create the service instance for Document Information Extraction Trial.
3434
@@ -49,7 +49,7 @@ If you are new to the Document Information Extraction UI, try out first the tuto
4949

5050
<!-- border -->![Access Schema Configuration](access-schema-configuration.png)
5151

52-
Here, you find the SAP schemas. The Document Information Extraction UI includes preconfigured SAP schemas for the following standard document types: purchase order, payment advice, and invoice. In addition, there’s an SAP schema for custom documents (`SAP_OCROnly_schema`). You can't delete SAP schemas. You can use them as they're, you can edit them directly, or create copies and adapt the list of fields according to your needs.
52+
Here, you find the SAP schemas. The Document Information Extraction UI includes preconfigured SAP schemas for the following standard document types: purchase order, payment advice, and invoice. In addition, there’s an SAP schema for custom documents (`SAP_OCROnly_schema`). You can't delete SAP schemas. You can use them as they are, edit them directly, or create copies and adapt the list of fields according to your needs.
5353

5454
<!-- border -->![Access Schema Configuration](sap-schemas.png)
5555

@@ -87,11 +87,11 @@ Now, your schema shows up in the list. Access the schema by clicking on the row.
8787

8888
A schema defines a list of header fields and line item fields that represent the information you want to extract from a document.
8989

90-
Header fields represent information that are specific to your document and only occur one time. Those may include the document number, any sender information or the total amount of the order. In contrast, line item fields represent the products that you ordered where each line is one product, often with a certain quantity attached. Thus, the line item fields extract the information for each product in your order. Those may include the article number, the price and the quantity.
90+
Header fields represent information that is specific to your document and only occurs one time. This may include the document number, any sender information, or the total amount of the order. In contrast, line item fields represent the products that you ordered, where each line is one product, often with a certain quantity attached. Thus, the line item fields extract the information for each product in your order. Those may include the article number, the price and the quantity.
9191

92-
Document Information Extraction already contains an amount of fields it can extract. See [here](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/b1c07d0c51b64580881d11b4acb6a6e6.html) which header fields are supported and [here](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/ff3f5efe11c14744b2ce60b95d210486.html) which line item fields are supported. Additionally, you can define custom fields. In the next step, you'll learn about both.
92+
Document Information Extraction already includes a number of fields that it can extract. See [here](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/b1c07d0c51b64580881d11b4acb6a6e6.html) which header fields are supported and [here](https://help.sap.com/viewer/5fa7265b9ff64d73bac7cec61ee55ae6/SHIP/en-US/ff3f5efe11c14744b2ce60b95d210486.html) which line item fields are supported. Additionally, you can define custom fields. In the next step, you'll learn about both.
9393

94-
The image below shows an example purchase order. All the fields that you define in your schema in this tutorial are highlighted. All information outside of the table that occur once are header fields. All information within the table occur per product and are line item fields. You can of course extend or reduce the information that you want to extract.
94+
The image below shows an example purchase order. All the fields that you define in your schema in this tutorial are highlighted. The header fields represent all information outside of the table that occurs once. The line item fields represent all information within the table, which occurs per product. You can, of course, extend or reduce the information that you want to extract.
9595

9696
<!-- border -->![Purchase Order Example](purchase-order-example.png)
9797

@@ -105,21 +105,21 @@ To define your first header field, click **Add** to the right of the headline `H
105105

106106
<!-- border -->![Add Header Field](add-header-field.png)
107107

108-
For each field, you have to enter a name, a data type and optionally a default extractor and a description. The potential data types are `string`, `number`, `date`, `discount` and `currency`. To use one of the included standard fields of Document Information Extraction, select them for the default extractor.
108+
For each field, you have to enter a name, a data type, a setup type, and optionally a default extractor and a description. The potential data types are `string`, `number`, `date`, `discount` and `currency`. To use one of the included standard fields of Document Information Extraction, select them for the default extractor.
109109

110110
As your first header field, add the number of your purchase order which identifies your document.
111111

112112
1. Enter an appropriate name for your field, `purchaseOrderNumber`, for example.
113113

114-
2. Select `string` for the `Data Type`. Note that a document number is a `string`, even though it consists of numbers, as it is an arbitrary combination of numbers without meaning. In contrast, price is an example for the data type `number`.
114+
2. Select `string` for the `Data Type`. Note that a document number is a `string`, even though it consists of numbers, as it is an arbitrary combination of numbers without meaning. In contrast, price is an example of the data type `number`.
115115

116-
3. As all business documents have a unique identification, Document Information Extraction already includes a standard field. Select `documentNumber` for the `Default Extractor`.
116+
3. As all business documents have a unique identification, Document Information Extraction already includes a standard field. Select `default` for the `Setup Type` and then select `documentNumber` for the `Default Extractor`.
117117

118118
4. Click **Add** to create the header field.
119119

120120
<!-- border -->![Create Purchase Order Number](add-document-number.png)
121121

122-
The field now displays in your list of header fields where you find all the information again that you have just entered. You can edit or delete the field by clicking the respective icons on the right.
122+
The field now displays in your list of header fields, where you again find all the information that you have just entered. You can edit or delete the field by clicking the respective icons on the right.
123123

124124
<!-- border -->![View Purchase Order Number](added_purchase-order-number.png)
125125

@@ -131,11 +131,11 @@ Click **Add** again to open the dialog.
131131

132132
2. Select `string` for the `Data Type`.
133133

134-
3. As Document Information Extraction offers no equivalent field, leave the default extractor blank. Click **Add** to create the field.
134+
3. As Document Information Extraction offers no equivalent field, select `default` for the `Setup Type` but leave the default extractor blank. Click **Add** to create the field.
135135

136136
<!-- border -->![Create Purchase Order Status](add-purchase-order-status.png)
137137

138-
You have now created your first custom field. Go ahead and create the list of header fields as shown in the table and image below. Pay attention which fields have a default extractor and which do not. Feel free to extend or reduce the list of header fields.
138+
You have now created your first custom field. Go ahead and create the list of header fields as shown in the table and image below. Pay attention to which fields have a default extractor and which do not. Feel free to extend or reduce the list of header fields.
139139

140140
| Field Name | Data Type | Default Extractor
141141
| :------------------- | :---------- | :-----------------
@@ -174,15 +174,15 @@ In the dialog proceed as follows:
174174

175175
2. Select `string` for the `Data Type`.
176176

177-
3. Leave the default extractor blank and click **Add** to create the field.
177+
3. Select `default` for the `Setup Type`, leave the default extractor blank, and click **Add** to create the field.
178178

179179
<!-- border -->![Add SKU Number](add-sku-number.png)
180180

181181
The field now displays in your list of line item fields where you find all the information again that you have just entered.
182182

183183
<!-- border -->![View SKU Number](added-sku-number.png)
184184

185-
You have now created your first line item field. Go ahead and create the list of line item fields as shown in the table and image below. Pay attention which fields have a default extractor and which do not. Feel free to extend or reduce the list of line item fields.
185+
You have now created your first line item field. Go ahead and create the list of line item fields as shown in the table and image below. Pay attention to which fields have a default extractor and which do not. Feel free to extend or reduce the list of line item fields.
186186

187187
| Field Name | Data Type | Default Extractor
188188
| :------------------- | :---------- | :-----------------

0 commit comments

Comments
 (0)