Annex: Dataset fields detail

Explanation of the process that each field of the data model is going through towards a clean PPD

Introduction

The objective of the following tables is to explain the process that each field is going through within this flow:

AURA DATASET PPD_RAW PPD_CLEAN


  • Each cell of the table explains the process that the data field is undergoing in this specific moment before it gets to the concrete stage (table column).

  • For example, the field GLOBAL_AURA_ID is undergoing a “hashing” before it gets stored in PPD_RAW. After this, the “hashed data” is progressed without any further processing to PPD_CLEAN.

Tables used in the Active Listening process are described in the following sections. They belong to the Aura Entities database.

  • Columns “FIELD” and “DESCRIPTION”: instances managed by the OB

  • Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team

MESSAGE dataset

Message dataset (stored in local Kernel)

  • COLUMNS “field” and “description”: instances managed by the OB

  • COLUMNS “PPD raw” and “PPD clean”: instances managed by Aura Global Team

# FIELD                 DESCRIPTION   PPD RAW       PPD CLEAN
1 USER_ID               Unique user ID in the OB systems NOT
transferred
 NOT
transferred
2 MSG_DT                 Timestamp of the data                                
3 MSG_ID                 Unique ID of the message                         NOT
transferred
4 ACTION_CD             Code of the action that produces the data                     NOT
transferred
5 AURA_ID               User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura. Hashed              
6 PHONE_ID               Phone number of the user    NOT
transferred
NOT
transferred             
7 CHANNEL_CD             Code of the channel where the action happened                                  
8 SUBSCRIPTION_CD       Code of the subscription type of the user in the OB           NOT
transferred
9 DOMAIN_CD             Code of the domain where the action happened                 NOT
transferred
10 CATEGORY_CD           Code of the category where the action happened               NOT
transferred
11 COUNTRY_CD             Code of the country NOT
transferred
12 CORR_ID               Correlator ID of the request that produces this data                                    
13 IS_CACHED             Shows if the entity content was already cached or not     NOT
transferred
14 STATUS_CD             Status code of the action, if meaningful                               
15 REASON                 Result of the action in error case, code of the error   NOT
transferred
16 VERSION_ID             Aura version that produces this data                                
17 LANG_CD               Language configured by the user for communication   NOT
transferred
18 TZ_CD                 Timezone where the communication happened NOT transferred
19 DURATION_NU           Duration in milliseconds of the action                                
20 MESSAGE               Content of the message   Anonymized                  
21 DIALOG_ID             Id of the dialog where the message happens                              
22 CONVERSATION_ID       Id of the conversation where the message happens               NOT
transferred
23 WIN_RECOGNIZER_CD     Code of the recognizer that wins for this message             NOT
transferred
24 WIN_RECOGNIZER_SCORE_NU Score of the recognizer that wins for this message             NOT
transferred
25 INTENT                 Selected intent                                  
26 ENTITIES               List of entities determined by the recognizer                                
27 MODALITY_CD           How does the user communicate with Aura                                   
28 AURA_ID_GLOBAL         Identifies the same user_id logged with the same authentication method Hashed    
29 ACCOUNT_NUMBER         Unique account number of the user               NOT
transferred             
NOT
transferred

RECOGNIZER dataset

Recognizer dataset stored in local Kernel

  • Columns “FIELD” and “DESCRIPTION”: instances managed by the OB

  • Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team

# FIELD                   DESCRIPTION       PPD RAW       PPD CLEAN
1 USER_ID                 Unique user ID in the OB systems   NOT transferred NOT transferred             
2 RECOGNIZER_DT           Timestamp of the data                              
3 RECOGNIZER_ID           Unique ID of the recognizer                              
4 ACTION_CD               Code of the action that produces the data                 NOT transferred
5 AURA_ID                 User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura. Hashed              
6 PHONE_ID               Phone number of the user   NOT transferred NOT transferred 
7 CHANNEL_CD             Code of the channel where the action happened                              
8 DOMAIN_CD               Code of the domain where the action happened     NOT transferred      
9 CATEGORY_CD             Code of the category where the action happened     NOT transferred  
10 COUNTRY_CD             Code of the country                NOT transferred         
11 CORR_ID                 Correlator ID of the request that produces this data                            
12 IS_CACHED               Shows if the entity content was already cached or not NOT transferred     
13 STATUS_CD               Status code of the action, if meaningful                            
14 REASON                 Result of the action in error case, code of the error                              
15 VERSION_ID             Aura version that produces this data                              
16 LANG_CD                 Language configured by the user for communication NOT transferred             
17 TZ_CD                   Timezone where the communication happened    NOT transferred            
18 DURATION_NU             Duration in milliseconds of the action                              
19 SCORE_NU               Score returned by the recognizer                                  
20 INPUT                   User input sent to the recognizer. Null if incoming message is an AuraCommand Anonymized              
21 OUTPUT                 Complete output generated by the recognizer                            
22 INTENT                 Intent returned by the recognizer                               
23 ENTITIES               Entities returned by the recognizer due to the intent                            
24 COMMON_THRESHOLD_NU     Common threshold used to determine the best answer of all recognizers                 NOT transferred
25 THRESHOLD               Specific threshold of the specific recognizer being executed  NOT transferred            
26 EXPECTED_INTENT         Intent expected to be returned by the recognizer  NOT transferred            
27 EXPECTED_ENTITIES       Entities expected to be returned by the recognizer due to the intent             NOT transferred
28 AURA_ID_GLOBAL         Identifies the same user_id logged with the same authentication method Hashed              
29 ACCOUNT_NUMBER         Unique account number of the user    NOT transferred NOT transferred             

This Markdown table can be directly used in your GitHub Markdown files.  

API dataset

API request dataset (stored in local Kernel)

  • Columns “FIELD” and “DESCRIPTION”: instances managed by the OB

  • Columns “PPD RAW” and “PPD CLEAN”: instances managed by Aura Global Team

#   FIELD            DESCRIPTION     PPD RAW       PPD CLEAN
1   USER_ID               Unique user ID in the OB systems                                   NOT transferred NOT transferred             
2   REQUEST_DT           Timestamp of the data                                                                        
3   REQUEST_ID           Unique ID of the request                                                                      
4   ACTION_CD             Code of the action that produces the data                                       NOT transferred
5   AURA_ID               User logging ID in Aura. The user will have a new Aura_id each time she logs in Aura Hashed NOT transferred
6   PHONE_ID             Phone number of the user                                           NOT transferred NOT transferred
7   CHANNEL_CD           Code of the channel where the action happened                                   NOT transferred
8   DOMAIN_CD             Code of the domain where the action happened                                   NOT transferred
9   CATEGORY_CD           Code of the category where the action happened                                 NOT transferred
10 COUNTRY_CD           Code of the country                                                             NOT transferred
11 CORR_ID               Correlator ID of the request that produces this data                                        
12 IS_CACHED             Shows if the entity content was already cached or not             NOT transferred NOT transferred             
13 STATUS_CD             Status code of the API request                                                                
14 REASON               Result of the action in error case, code of the error                                        
15 VERSION_ID           Aura version that produces this data                               NOT transferred
16 LANG_CD               Language configured by the user for communication                    NOT transferred          
17 TZ_CD                 Timezone where the communication happened                                      
18 DURATION_NU           Duration in milliseconds of the action                                                        
19 HOST                 Host of the API                                                                              
20 PATH                 Specific path of the API being called                               NOT transferred           
21 HTTP_STATUS           HTTP status of the server response                                  NOT transferred            
22 RESPONSE             Response body                                                     Anonymized                  
23 AURA_ID_GLOBAL       Identifies the same user_id logged with the same authentication method Hashed NOT transferred
24 ACCOUNT_NUMBER       Unique account number of the user         NOT transferred NOT transferred             
25 REQUEST               Request body