Categories:
Annex: Dataset fields detail
Explanation of the process that each field of the data model is going through towards a clean PPD
Introduction
The objective of the following tables is to explain the process that each field is going through within this flow:
AURA DATASET PPD_RAW PPD_CLEAN
-
Each cell of the table explains the process that the data field is undergoing in this specific moment before it gets to the concrete stage (table column).
-
For example, the field
GLOBAL_AURA_IDis undergoing a “hashing” before it gets stored inPPD_RAW. After this, the “hashed data” is progressed without any further processing toPPD_CLEAN.
Tables used in the Active Listening process are described in the following sections. They belong to the Aura Entities database.
-
Columns “FIELD” and “DESCRIPTION”: instances managed by the OB
-
Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team
MESSAGE dataset
Message dataset (stored in local Kernel)
-
COLUMNS “field” and “description”: instances managed by the OB
-
COLUMNS “PPD raw” and “PPD clean”: instances managed by Aura Global Team
| # | FIELD | DESCRIPTION | PPD RAW | PPD CLEAN |
|---|---|---|---|---|
| 1 | USER_ID | Unique user ID in the OB systems | NOT transferred |
NOT transferred |
| 2 | MSG_DT | Timestamp of the data | ||
| 3 | MSG_ID | Unique ID of the message | NOT transferred |
|
| 4 | ACTION_CD | Code of the action that produces the data | NOT transferred |
|
| 5 | AURA_ID | User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura. |
Hashed | |
| 6 | PHONE_ID | Phone number of the user | NOT transferred |
NOT transferred |
| 7 | CHANNEL_CD | Code of the channel where the action happened | ||
| 8 | SUBSCRIPTION_CD | Code of the subscription type of the user in the OB | NOT transferred |
|
| 9 | DOMAIN_CD | Code of the domain where the action happened | NOT transferred |
|
| 10 | CATEGORY_CD | Code of the category where the action happened | NOT transferred |
|
| 11 | COUNTRY_CD | Code of the country | NOT transferred |
|
| 12 | CORR_ID | Correlator ID of the request that produces this data | ||
| 13 | IS_CACHED | Shows if the entity content was already cached or not | NOT transferred |
|
| 14 | STATUS_CD | Status code of the action, if meaningful | ||
| 15 | REASON | Result of the action in error case, code of the error | NOT transferred |
|
| 16 | VERSION_ID | Aura version that produces this data | ||
| 17 | LANG_CD | Language configured by the user for communication | NOT transferred |
|
| 18 | TZ_CD | Timezone where the communication happened | NOT transferred | |
| 19 | DURATION_NU | Duration in milliseconds of the action | ||
| 20 | MESSAGE | Content of the message | Anonymized | |
| 21 | DIALOG_ID | Id of the dialog where the message happens | ||
| 22 | CONVERSATION_ID | Id of the conversation where the message happens | NOT transferred |
|
| 23 | WIN_RECOGNIZER_CD | Code of the recognizer that wins for this message | NOT transferred |
|
| 24 | WIN_RECOGNIZER_SCORE_NU | Score of the recognizer that wins for this message | NOT transferred |
|
| 25 | INTENT | Selected intent | ||
| 26 | ENTITIES | List of entities determined by the recognizer | ||
| 27 | MODALITY_CD | How does the user communicate with Aura | ||
| 28 | AURA_ID_GLOBAL | Identifies the same user_id logged with the same authentication method |
Hashed | |
| 29 | ACCOUNT_NUMBER | Unique account number of the user | NOT transferred |
NOT transferred |
RECOGNIZER dataset
Recognizer dataset stored in local Kernel
-
Columns “FIELD” and “DESCRIPTION”: instances managed by the OB
-
Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team
| # | FIELD | DESCRIPTION | PPD RAW | PPD CLEAN |
|---|---|---|---|---|
| 1 | USER_ID | Unique user ID in the OB systems | NOT transferred | NOT transferred |
| 2 | RECOGNIZER_DT | Timestamp of the data | ||
| 3 | RECOGNIZER_ID | Unique ID of the recognizer | ||
| 4 | ACTION_CD | Code of the action that produces the data | NOT transferred | |
| 5 | AURA_ID | User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura. |
Hashed | |
| 6 | PHONE_ID | Phone number of the user | NOT transferred | NOT transferred |
| 7 | CHANNEL_CD | Code of the channel where the action happened | ||
| 8 | DOMAIN_CD | Code of the domain where the action happened | NOT transferred | |
| 9 | CATEGORY_CD | Code of the category where the action happened | NOT transferred | |
| 10 | COUNTRY_CD | Code of the country | NOT transferred | |
| 11 | CORR_ID | Correlator ID of the request that produces this data | ||
| 12 | IS_CACHED | Shows if the entity content was already cached or not | NOT transferred | |
| 13 | STATUS_CD | Status code of the action, if meaningful | ||
| 14 | REASON | Result of the action in error case, code of the error | ||
| 15 | VERSION_ID | Aura version that produces this data | ||
| 16 | LANG_CD | Language configured by the user for communication | NOT transferred | |
| 17 | TZ_CD | Timezone where the communication happened | NOT transferred | |
| 18 | DURATION_NU | Duration in milliseconds of the action | ||
| 19 | SCORE_NU | Score returned by the recognizer | ||
| 20 | INPUT | User input sent to the recognizer. Null if incoming message is an AuraCommand | Anonymized | |
| 21 | OUTPUT | Complete output generated by the recognizer | ||
| 22 | INTENT | Intent returned by the recognizer | ||
| 23 | ENTITIES | Entities returned by the recognizer due to the intent | ||
| 24 | COMMON_THRESHOLD_NU | Common threshold used to determine the best answer of all recognizers | NOT transferred | |
| 25 | THRESHOLD | Specific threshold of the specific recognizer being executed | NOT transferred | |
| 26 | EXPECTED_INTENT | Intent expected to be returned by the recognizer | NOT transferred | |
| 27 | EXPECTED_ENTITIES | Entities expected to be returned by the recognizer due to the intent | NOT transferred | |
| 28 | AURA_ID_GLOBAL | Identifies the same user_id logged with the same authentication method |
Hashed | |
| 29 | ACCOUNT_NUMBER | Unique account number of the user | NOT transferred | NOT transferred |
This Markdown table can be directly used in your GitHub Markdown files.
API dataset
API request dataset (stored in local Kernel)
-
Columns “FIELD” and “DESCRIPTION”: instances managed by the OB
-
Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team
| # | FIELD | DESCRIPTION | PPD RAW | PPD CLEAN |
|---|---|---|---|---|
| 1 | USER_ID | Unique user ID in the OB systems | NOT transferred | NOT transferred |
| 2 | REQUEST_DT | Timestamp of the data | ||
| 3 | REQUEST_ID | Unique ID of the request | ||
| 4 | ACTION_CD | Code of the action that produces the data | NOT transferred | |
| 5 | AURA_ID | User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura | Hashed | NOT transferred |
| 6 | PHONE_ID | Phone number of the user | NOT transferred | NOT transferred |
| 7 | CHANNEL_CD | Code of the channel where the action happened | NOT transferred | |
| 8 | DOMAIN_CD | Code of the domain where the action happened | NOT transferred | |
| 9 | CATEGORY_CD | Code of the category where the action happened | NOT transferred | |
| 10 | COUNTRY_CD | Code of the country | NOT transferred | |
| 11 | CORR_ID | Correlator ID of the request that produces this data | ||
| 12 | IS_CACHED | Shows if the entity content was already cached or not | NOT transferred | NOT transferred |
| 13 | STATUS_CD | Status code of the API request | ||
| 14 | REASON | Result of the action in error case, code of the error | ||
| 15 | VERSION_ID | Aura version that produces this data | NOT transferred | |
| 16 | LANG_CD | Language configured by the user for communication | NOT transferred | |
| 17 | TZ_CD | Timezone where the communication happened | ||
| 18 | DURATION_NU | Duration in milliseconds of the action | ||
| 19 | HOST | Host of the API | ||
| 20 | PATH | Specific path of the API being called | NOT transferred | |
| 21 | HTTP_STATUS | HTTP status of the server response | NOT transferred | |
| 22 | RESPONSE | Response body | Anonymized | |
| 23 | AURA_ID_GLOBAL | Identifies the same user_id logged with the same authentication method | Hashed | NOT transferred |
| 24 | ACCOUNT_NUMBER | Unique account number of the user | NOT transferred | NOT transferred |
| 25 | REQUEST | Request body |