<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Aura – </title>
    <link>/docs/developers-workspace/monitoring/</link>
    <description>Recent content on Aura</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    
	  <atom:link href="/docs/developers-workspace/monitoring/index.xml" rel="self" type="application/rss+xml" />
    
    
      
        
      
    
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/aura-analytics/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/aura-analytics/</guid>
      <description>
        
        
        &lt;h1 id=&#34;aura-analytics-11&#34;&gt;Aura Analytics 1.1.&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Description of &lt;strong&gt;Aura Analytics 1.1&lt;/strong&gt;, the monitoring dataflow that allows active listening in Aura&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction&#34;&gt;Introduction&lt;/h2&gt;
&lt;p&gt;This document contains a description of a joint dataflow between LCDO OB teams and Aura Global Team for processing &lt;strong&gt;Aura log files created in production environment&lt;/strong&gt; (i.e., coming from actual Aura users) in order to create PPDs (Privacy-Preserving Datasets). All this process is known as &lt;strong&gt;Active Listening&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;The dataflow produces as a result, among other elements, an analytics component, named as &lt;strong&gt;Aura Analytics Dashboard&lt;/strong&gt;, that can be used to gather statistics on the production system and to analyze user&amp;rsquo;s behavior. The latest version 1.1 of this dashboard is described in the current document.&lt;/p&gt;
&lt;p&gt;The main objectives of the unified dataflow are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Consolidate the processing of Aura logs into a framework.&lt;/li&gt;
&lt;li&gt;Provide LCDOs and Aura Global Team with a unified common source for analytics, in a privacy-preserving way.&lt;/li&gt;
&lt;li&gt;Enable extensibility of the dataflow.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In this framework, the current documents provide:  &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The available &lt;a href=&#34;#aura-analytics-versions&#34;&gt;data analytics versions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;#prerequisites-and-recommended-tools&#34;&gt;Prerequisites and recommended tools&lt;/a&gt; for its use&lt;/li&gt;
&lt;li&gt;Overall description of &lt;a href=&#34;../../../docs/developers-workspace/monitoring/aura-analytics/architecture/&#34;&gt;&lt;strong&gt;Aura Analytics&lt;/strong&gt; architecture&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Description of the &lt;a href=&#34;../../../docs/developers-workspace/monitoring/aura-analytics/ob-analytics/&#34;&gt;OB Analytics subsystem&lt;/a&gt; that enables the management by OBs&lt;/li&gt;
&lt;li&gt;Current &lt;a href=&#34;../../../docs/developers-workspace/monitoring/aura-analytics/data-model/&#34;&gt;&lt;strong&gt;Aura Analytics&lt;/strong&gt; data model&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The target audience of this document includes the following roles both in LCDO Teams and Aura Global Team:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Data Scientists and Product teams, that wish to access Aura logs information and perform analytics on them.&lt;/li&gt;
&lt;li&gt;Operation teams, for the architectural description and the requirements on OB environments.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;aura-analytics-versions&#34;&gt;Aura Analytics versions&lt;/h2&gt;
&lt;h3 id=&#34;release-10&#34;&gt;Release 1.0.&lt;/h3&gt;
&lt;p&gt;The first release 1.0. sets up the basic paths, deploys the PPD infrastructure and produce:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Version 1.0. of the OB Analytics system, which includes the OB Dashboard.&lt;/li&gt;
&lt;li&gt;The first version of pre-processed datasets (clean PPDs) for training and analytics at Aura Global.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;As mentioned, this version enables OBs to go further by:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Enhancing the OB Dashboard with new visualizations, as they seem fit (given that panels and dashboards can be exported and imported, it is possible to share new ones across all OBs, as they are developed).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Processing the PPD files as desired (they are standard CSV files, which can be ingested in alternative platforms if desired). Restrictions on them are softer than on the original logs due to the anonymization process they have been subjected to, although they are still subjected to management precautions (a code of conduct is being prepared for that).&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;release-11&#34;&gt;Release 1.1.&lt;/h3&gt;
&lt;p&gt;Version 1.1. introduces the following changes:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The table of data has been enlarged with these new fields: &lt;code&gt;AURA_ID&lt;/code&gt;, &lt;code&gt;STATUS_CD&lt;/code&gt;, &lt;code&gt;sesId&lt;/code&gt;, &lt;code&gt;sesSize&lt;/code&gt;, &lt;code&gt;sesDuration&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;An expanded list of test users is used, so that the &lt;code&gt;userType&lt;/code&gt; column contains more identifications.&lt;/li&gt;
&lt;li&gt;The code for data ingestion into a local Kibana, which previously consisted on a single Python script, has been turned into a full Python package to be installed, due to its increasing complexity.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;prerequisites-and-recommended-tools&#34;&gt;Prerequisites and recommended tools&lt;/h2&gt;
&lt;p&gt;The prerequisites for the use of version 1.1. of Aura Analytics Dashboard are set below:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Aura Platform version:&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Recommended operating system:Ubuntu 18.04 system&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Recommended tool for data visualization: &lt;a href=&#34;https://www.elastic.co/es/elastic-stack/&#34;&gt;ELK stack&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/active-listening/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/active-listening/</guid>
      <description>
        
        
        &lt;h1 id=&#34;aura-analytics-200&#34;&gt;Aura Analytics 2.0.0&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Description of &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt;, the monitoring tool designed and managed by &lt;em&gt;&lt;strong&gt;Aura Global Team&lt;/strong&gt;&lt;/em&gt; that allows &lt;strong&gt;active listening&lt;/strong&gt; in Aura&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;what-is-aura-analytics-200&#34;&gt;What is Aura Analytics 2.0.0?&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Active listening&lt;/strong&gt; is defined as a key process that involves a &lt;strong&gt;continuous monitoring of Aura performance based on real logs from the users&lt;/strong&gt; to analyze them and gather insights on the efficiency and effectiveness of Aura as a system and also to track the interaction of our users with Aura.&lt;/p&gt;
&lt;p&gt;In this framework, &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; is a tool used by &lt;strong&gt;Aura Global Team&lt;/strong&gt; that uses &lt;strong&gt;active listening&lt;/strong&gt; with the ultimate goal of &lt;strong&gt;improving Aura quality&lt;/strong&gt;, as it generates accurate information to carry out both &lt;strong&gt;corrective and predictive actions&lt;/strong&gt; and to decide &lt;strong&gt;how Aura should evolve in the future&lt;/strong&gt;.&lt;/p&gt;
&lt;h2 id=&#34;how-does-aura-analytics-200-work&#34;&gt;How does Aura Analytics 2.0.0 work?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;The process is built upon Aura users logs generated in production environment&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;From these logs, &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; create &lt;em&gt;PPDs (Privacy-Preserving Datasets)&lt;/em&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Datasets are processed, enabling the visualization through dashboards and the extraction of statistical insights&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;The Aura Global Team consumes this data to support decision-making processes&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;target-users&#34;&gt;Target users&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;The &lt;em&gt;&lt;strong&gt;Aura Global Team&lt;/strong&gt;&lt;/em&gt; is the target user of the &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; tool, responsible for its design and management as well as for the interpretation of results for decision-making.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;OBs&lt;/strong&gt; should allow the generation of datasets from their Aura users logs in their local environment just by installing and executing a single process, as shown in the document &lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/guidelines-obs/&#34;&gt;Guidelines for OBs&lt;/a&gt;.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;index-of-documents&#34;&gt;Index of documents&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; includes the following documents:&lt;/p&gt;
&lt;!-- no toc --&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&#34;#aura-analytics-versions&#34;&gt;&lt;strong&gt;Version history&lt;/strong&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/architecture&#34;&gt;&lt;strong&gt;Architecture overview and main processes and components&lt;/strong&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/operation&#34;&gt;&lt;strong&gt;Internal operation&lt;/strong&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/guidelines-obs&#34;&gt;&lt;strong&gt;Guidelines for OBs&lt;/strong&gt;&lt;/a&gt; willing to consume data provided by &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/analytics-dashboard&#34;&gt;&lt;strong&gt;Aura Analytics dashboard&lt;/strong&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/active-listening/annex-dataset-fields-detail/&#34;&gt;&lt;strong&gt;Annex: Dataset fields&lt;/strong&gt;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;aura-analytics-versions&#34;&gt;Aura Analytics versions&lt;/h2&gt;
&lt;h3 id=&#34;release-100&#34;&gt;Release 1.0.0&lt;/h3&gt;
&lt;p&gt;The first release 1.0.0. sets up the basic paths, deploys the PPD infrastructure and produce:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Version 1.0.0. of the OB Analytics system, which includes the OB Dashboard.&lt;/li&gt;
&lt;li&gt;The first version of pre-processed datasets (clean PPDs) for training and analytics at Aura Global.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;As mentioned, this version enables going further by:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Enhancing the analytics dashboard with new visualizations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Processing the PPD files as desired (they are standard CSV files, which can be ingested in alternative platforms if desired). Restrictions on them are softer than on the original logs due to the anonymization process they have been subjected to, although they are still subjected to management precautions (a code of conduct is being prepared for that).&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;release-110&#34;&gt;Release 1.1.0&lt;/h3&gt;
&lt;p&gt;Version 1.1.0. introduces the following changes:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The table of data has been enlarged with these new fields: &lt;code&gt;AURA_ID&lt;/code&gt;, &lt;code&gt;STATUS_CD&lt;/code&gt;, &lt;code&gt;sesId&lt;/code&gt;, &lt;code&gt;sesSize&lt;/code&gt;, &lt;code&gt;sesDuration&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;An expanded list of test users is used, so that the &lt;code&gt;userType&lt;/code&gt; column contains more identifications.&lt;/li&gt;
&lt;li&gt;The code for data ingestion into a local Kibana, which previously consisted on a single Python script, has been turned into a full Python package to be installed, due to its increasing complexity.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;release-200&#34;&gt;Release 2.0.0&lt;/h3&gt;
&lt;p&gt;Version 2.0.0 introduces the following changes:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In 2.0.0 version, &lt;strong&gt;Aura Analytics&lt;/strong&gt; has undergone a refactor to improve its structure and make it easier to understand, maintain and extend in the future.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; simplifies the deployment and execution process.&lt;/li&gt;
&lt;li&gt;But one of the most significant enhancements in &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; is its capability to manage both processed and to-process files centrally in one place (database). &lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;prerequisites-and-recommended-tools&#34;&gt;Prerequisites and recommended tools&lt;/h2&gt;
&lt;p&gt;The prerequisites for the use of &lt;strong&gt;Aura Analytics 2.0.0&lt;/strong&gt; are set below:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Recommended tool for data visualization: &lt;a href=&#34;https://www.elastic.co/es/elastic-stack/&#34;&gt;ELK stack&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/billing-module/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/billing-module/</guid>
      <description>
        
        
        &lt;h1 id=&#34;aura-billing-module&#34;&gt;Aura Billing Module&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Description of &lt;strong&gt;Aura Billing Module&lt;/strong&gt;, the tool for the generation of &lt;strong&gt;Liceo&lt;/strong&gt; invoices.&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction&#34;&gt;Introduction&lt;/h2&gt;
&lt;p&gt;The &lt;strong&gt;Aura Billing Module&lt;/strong&gt; is a tool for the generation of &lt;strong&gt;Liceo&lt;/strong&gt; invoices, that allow charging each customer for the services that she has used. This is a &lt;strong&gt;mandatory&lt;/strong&gt; process for OBs.&lt;/p&gt;
&lt;p&gt;It is based on the &lt;strong&gt;storage and processing of specific logs&lt;/strong&gt; in the OB&amp;rsquo;s Aura systems to track the &lt;strong&gt;type and number of interactions&lt;/strong&gt; of a user or service with Aura.&lt;/p&gt;
&lt;p&gt;This information is used to assign costs based on different billing models and criteria chosen by the OB, which ultimately determines the &lt;strong&gt;total amount of the invoice&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;The invoices will be generated in &lt;strong&gt;XLSX (Excel)&lt;/strong&gt; format and stored in an &lt;strong&gt;Azure Storage Explorer&lt;/strong&gt; blob container, along with the historical invoice records.&lt;/p&gt;
&lt;p&gt;These invoices will be available for download by the &lt;strong&gt;Aura Global Team&lt;/strong&gt;, to be sent to the OBs.&lt;/p&gt;
&lt;p&gt;&lt;i class=&#34;fa-regular fa-file-lines fa-xl&#34; style=&#34;color: #0d5de7;&#34;&gt;&lt;/i&gt; Interested in how the &lt;strong&gt;Aura Billing Module&lt;/strong&gt; works and which are the tasks required to bring it into use?  Access the document &lt;a href=&#34;../../../docs/developers-workspace/monitoring/billing-module/operation/&#34;&gt;Aura Billing Module operation&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id=&#34;generated-liceo-invoices&#34;&gt;Generated Liceo invoices&lt;/h2&gt;
&lt;p&gt;The &lt;strong&gt;Liceo&lt;/strong&gt; invoices generated by &lt;strong&gt;Aura Billing Module&lt;/strong&gt; will contain the following information:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Invoicing model (based on the payment model of the OB)&lt;/li&gt;
&lt;li&gt;Aura components used to provide the service&lt;/li&gt;
&lt;li&gt;Service/app that used this component&lt;/li&gt;
&lt;li&gt;Number of queries per component&lt;/li&gt;
&lt;li&gt;Cost of each query in each specific component&lt;/li&gt;
&lt;li&gt;Total amount generated by each component&lt;/li&gt;
&lt;li&gt;Total number of requests made during the billing period&lt;/li&gt;
&lt;li&gt;Total amount of the invoice&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/aura-logs/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/aura-logs/</guid>
      <description>
        
        
        &lt;h1 id=&#34;manage-aura-logs&#34;&gt;Manage Aura logs&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Learn what are Aura logs and how they are managed in &lt;strong&gt;Kibana&lt;/strong&gt; and other tools&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction&#34;&gt;Introduction&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Logs&lt;/strong&gt; are files that record specific single events, warnings and errors as they occur within a software environment. They can include contextual information, such as the time an event occurred and which user or endpoint was associated with it.&lt;/p&gt;
&lt;p&gt;In Aura, logs are generated by specific components when an event happens and stored in order to monitor or debug the system.&lt;/p&gt;
&lt;p&gt;Logs are stored in an &lt;a href=&#34;https://www.elastic.co/&#34;&gt;&lt;strong&gt;ElasticSearch&lt;/strong&gt; cluster&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Once stored, Aura integrates a logging system based on &lt;a href=&#34;https://www.elastic.co/kibana/&#34;&gt;&lt;strong&gt;Kibana&lt;/strong&gt;&lt;/a&gt;, which is the official tool to manage logs in Aura. Moreover, logs can be managed with &lt;a href=&#34;https://grafana.com/&#34;&gt;&lt;strong&gt;Grafana&lt;/strong&gt;&lt;/a&gt; and &lt;a href=&#34;https://www.fluentd.org/&#34;&gt;fluentd&lt;/a&gt; for specific features.&lt;/p&gt;
&lt;p&gt;&amp;#x26a0;&amp;#xfe0f; You should not integrate third-party applications or scripts with &lt;strong&gt;ElasticSearch&lt;/strong&gt;. These kinds of integrations are weak because the &lt;strong&gt;ElasticSearch&lt;/strong&gt; API is not part of the public interface with the OB. This means that it could change without notice for several reasons such as updating the version of &lt;strong&gt;ElasticSearch&lt;/strong&gt; or changing Aura internal architecture.&lt;/p&gt;
&lt;h2 id=&#34;manage-logs-in-kibana&#34;&gt;Manage logs in Kibana&lt;/h2&gt;
&lt;p&gt;The &lt;a href=&#34;https://www.elastic.co/guide/en/kibana/index.html&#34;&gt;official &lt;strong&gt;Kibana&lt;/strong&gt; User Guide&lt;/a&gt; is the reference guide to use &lt;strong&gt;Kibana&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;Moreover, the current section includes certain useful points for managing Aura logs through this tool.&lt;/p&gt;
&lt;h3 id=&#34;policies-in-kibana&#34;&gt;Policies in Kibana&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Kibana&lt;/strong&gt; includes index lifecycle policies.&lt;/p&gt;
&lt;p&gt;By default, we add one policy for each index created (service and system index), to delete the logs older than seven days.&lt;/p&gt;
&lt;h3 id=&#34;snapshot-in-kibana&#34;&gt;Snapshot in Kibana&lt;/h3&gt;
&lt;p&gt;Index snapshot is configured by default as long-term storage for the logs. These snapshots are taken daily and end in the cluster Azure Storage blob container (&lt;code&gt;aura-backups/elk&lt;/code&gt;).&lt;/p&gt;
&lt;h2 id=&#34;manage-logs-in-grafana&#34;&gt;Manage logs in Grafana&lt;/h2&gt;
&lt;h3 id=&#34;discover-section&#34;&gt;Discover section&lt;/h3&gt;
&lt;p&gt;The &amp;ldquo;Discover&amp;rdquo; section in &lt;strong&gt;Grafana&lt;/strong&gt; is very useful to look for logs and troubleshoot issues.&lt;/p&gt;
&lt;p&gt;You can full-text search logs using &lt;a href=&#34;https://lucene.apache.org/core/2_9_4/queryparsersyntax.html&#34;&gt;&lt;strong&gt;Lucene&lt;/strong&gt; query syntax&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Moreover, logs are tagged with many fields that can be useful to narrow down a search, such as:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;kubernetes.labels.app&lt;/strong&gt;: name of the Kubernetes application that generated the log.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;kubernetes.pod_name&lt;/strong&gt;: name of the Kubernetes pod that generated the log.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;corr&lt;/strong&gt;: correlator that tracks E2E requests.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;lvl&lt;/strong&gt;: log level (&lt;code&gt;TRACE&lt;/code&gt;, &lt;code&gt;DEBUG&lt;/code&gt;, &lt;code&gt;INFO&lt;/code&gt;, &lt;code&gt;WARN&lt;/code&gt;, &lt;code&gt;ERROR&lt;/code&gt; or &lt;code&gt;FATAL&lt;/code&gt;).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Queries that rely on a specific text are weak. Aura cannot guarantee that log messages do not change between versions. In fact, they do change. This is why metrics based on logs will not be reliable and it is not recommended to use &lt;strong&gt;Kibana&lt;/strong&gt; to get metrics.&lt;/p&gt;
&lt;h2 id=&#34;manage-logs-in-fluentd&#34;&gt;Manage logs in fluentd&lt;/h2&gt;
&lt;h3 id=&#34;logs-external-forwarding-feature&#34;&gt;Logs external forwarding feature&lt;/h3&gt;
&lt;p&gt;It is possible to send logs to an external system (a &lt;strong&gt;fluentd&lt;/strong&gt; endpoint).&lt;/p&gt;
&lt;p&gt;To enable this feature, add the following configuration to your config file:&lt;/p&gt;
&lt;div class=&#34;highlight&#34;&gt;&lt;pre tabindex=&#34;0&#34; style=&#34;background-color:#f8f8f8;-moz-tab-size:4;-o-tab-size:4;tab-size:4;&#34;&gt;&lt;code class=&#34;language-yaml&#34; data-lang=&#34;yaml&#34;&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;external_forwarding&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;  &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;secret_shared_key&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt; &lt;/span&gt;&lt;span style=&#34;color:#4e9a06&#34;&gt;&amp;#34;mysecretkey&amp;#34;&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;  &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;tls_config&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;    &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;tls_enabled&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt; &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;True&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;  &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;remote_servers&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;  &lt;/span&gt;- &lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;hostname&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt; &lt;/span&gt;&lt;span style=&#34;color:#000&#34;&gt;xxx&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;    &lt;/span&gt;&lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;port&lt;/span&gt;&lt;span style=&#34;color:#000;font-weight:bold&#34;&gt;:&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt; &lt;/span&gt;&lt;span style=&#34;color:#000&#34;&gt;yyy&lt;/span&gt;&lt;span style=&#34;color:#f8f8f8;text-decoration:underline&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;ul&gt;
&lt;li&gt;Set &lt;code&gt;hostname&lt;/code&gt; and &lt;code&gt;port&lt;/code&gt; fields with the remote endpoint. If you configure more than one remote server, fluentd load balances the traffic to them in a round-robin order.
&lt;!-- PRoduct Team requirement: explain how to configure more than one remote server --&gt;
&lt;/li&gt;
&lt;li&gt;The &lt;code&gt;hostname&lt;/code&gt; value can be an IP address, but it is not recommended if TLS is enabled. Turning off TLS is possible but discouraged for security reasons.&lt;/li&gt;
&lt;li&gt;&lt;code&gt;secret_shared_key&lt;/code&gt; is used to verify client&amp;rsquo;s identity and must be configured properly in all the remote servers.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You can find additional information regarding receivers&amp;rsquo; configuration (including TLS configuration and password authentication procedure) &lt;a href=&#34;https://docs.fluentd.org/v1.0/articles/in_forward&#34;&gt;here&lt;/a&gt;.&lt;/p&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/metrics/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/metrics/</guid>
      <description>
        
        
        &lt;h1 id=&#34;manage-aura-metrics&#34;&gt;Manage Aura metrics&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Learn what are Aura metrics, how they are generated and stored in &lt;strong&gt;Prometheus&lt;/strong&gt; and the process for its analysis through &lt;strong&gt;Grafana&lt;/strong&gt;&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction&#34;&gt;Introduction&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Metrics&lt;/strong&gt; provide a measurement of certain data that represent a specific aspect of the monitored system at a point in time and offer an aggregated view over the system. They are useful to visualize long-term trends and alerts on log data.&lt;/p&gt;
&lt;p&gt;Each Aura component is in charge of publishing its own metrics, which are typically generated at fixed-time intervals from aggregated logs.&lt;/p&gt;
&lt;p&gt;Once generated, Aura metrics are pooled by &lt;strong&gt;Prometheus&lt;/strong&gt;, which is in charge of gathering and exposing them.&lt;/p&gt;
&lt;p&gt;&lt;a href=&#34;https://grafana.com/&#34;&gt;&lt;strong&gt;Grafana&lt;/strong&gt;&lt;/a&gt; is the most suitable tool to represent metrics through different dashboards. Each component counts on a &lt;a href=&#34;../../../docs/developers-workspace/monitoring/dashboards/&#34;&gt;&lt;strong&gt;Grafana&lt;/strong&gt; dashboard&lt;/a&gt; to show its current behavior and there is a single dashboard for an Aura overview.&lt;/p&gt;
&lt;p&gt;If you think a new metric could useful, please contact the &lt;strong&gt;Aura Platform Team&lt;/strong&gt;, so it can be officially included as part of the platform.&lt;/p&gt;
&lt;p&gt;The aim of this section is to explain both how Aura metrics work and all the metrics stored by each component.&lt;/p&gt;
&lt;p&gt;&amp;#x26a0;&amp;#xfe0f; Saved dashboards, visualizations and queries are not guaranteed to be kept between upgrades because all the stack, including &lt;strong&gt;ElasticSearch&lt;/strong&gt; and &lt;strong&gt;Grafana&lt;/strong&gt; can be upgraded to newer versions.&lt;/p&gt;
&lt;h2 id=&#34;prometheus&#34;&gt;Prometheus&lt;/h2&gt;
&lt;p&gt;Aura metrics system is based on &lt;a href=&#34;https://prometheus.io/&#34;&gt;&lt;strong&gt;Prometheus&lt;/strong&gt;&lt;/a&gt;, a Cloud Native Computing Foundation project that works as systems and services monitoring system. &lt;strong&gt;Prometheus&lt;/strong&gt; collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts when specified conditions are observed.&lt;/p&gt;
&lt;p&gt;&lt;a href=&#34;https://github.com/siimon/prom-client&#34;&gt;prom-client&lt;/a&gt; is being used to implement prometheus functionality in &lt;code&gt;Node.js&lt;/code&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Prometheus&lt;/strong&gt; service pools every component to get the metrics generated during the last time period. Every component counts on a private endpoint (not accessible from Internet) called &lt;code&gt;/metrics&lt;/code&gt; where Prometheus requests the metrics.&lt;/p&gt;
&lt;p&gt;Currently, the metric types used in this component are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;https://prometheus.io/docs/concepts/metric_types/#summary&#34;&gt;&lt;strong&gt;Summary&lt;/strong&gt;&lt;/a&gt;: similar to &lt;a href=&#34;https://prometheus.io/docs/concepts/metric_types/#histogram&#34;&gt;&lt;strong&gt;histogram&lt;/strong&gt;&lt;/a&gt; metrics, it includes samples observations (such as request durations and response sizes). While it also provides a total count of observations and a sum of all observed values, it calculates configurable quantiles over a sliding time window.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;https://prometheus.io/docs/concepts/metric_types/#counter&#34;&gt;&lt;strong&gt;Counter&lt;/strong&gt;&lt;/a&gt;: cumulative metric that represents a single monotonically increasing counter whose value can only increase or be reset to zero on restart. For example, you can use a counter to represent the number of requests served, tasks completed, or errors.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;https://prometheus.io/docs/concepts/metric_types/#gauge&#34;&gt;&lt;strong&gt;Gauge&lt;/strong&gt;&lt;/a&gt;: similar to &lt;strong&gt;Counter&lt;/strong&gt;, but it represents a single numerical value that can arbitrarily go up and down.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;prometheus-es-exporter&#34;&gt;Prometheus-es-exporter&lt;/h2&gt;
&lt;p&gt;Working with Prometheus, we can create metrics using queries to ElasticSearch indexes (as well as create alarms, dashboard, etc) using &lt;a href=&#34;https://github.com/braedon/prometheus-es-exporter&#34;&gt;&lt;strong&gt;prometheus-es-exporter&lt;/strong&gt;&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;This component is not deployed by default, but it can be enabled changing the variable &lt;strong&gt;prometheus_es_exporter_enabled&lt;/strong&gt; to &lt;code&gt;true&lt;/code&gt; in you &lt;code&gt;config.yml&lt;/code&gt; file. (In Brazil, it is set to &lt;code&gt;true&lt;/code&gt; by default). Access here the guidelines to &lt;a href=&#34;../../../docs/deployment/installer/#enable-prometheus-es-exporter-component&#34;&gt;enable prometheus-es-exporter component&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;To config your own metrics from queries, write the new section, as in the following example, in your &lt;code&gt;config.yml&lt;/code&gt;.&lt;/p&gt;
&lt;div class=&#34;highlight&#34;&gt;&lt;pre tabindex=&#34;0&#34; style=&#34;background-color:#f8f8f8;-moz-tab-size:4;-o-tab-size:4;tab-size:4;&#34;&gt;&lt;code class=&#34;language-ts&#34; data-lang=&#34;ts&#34;&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#000&#34;&gt;prometheus_es_exporter&lt;/span&gt;:
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;  &lt;span style=&#34;color:#204a87;font-weight:bold&#34;&gt;query_blocks&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;    &lt;span style=&#34;color:#000&#34;&gt;ob&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;      &lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;-&lt;/span&gt; &lt;span style=&#34;color:#000&#34;&gt;name&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt; &lt;span style=&#34;color:#4e9a06&#34;&gt;&amp;#34;query_ob_br&amp;#34;&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;        &lt;span style=&#34;color:#000&#34;&gt;QueryIntervalSecs&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt; &lt;span style=&#34;color:#4e9a06&#34;&gt;&amp;#34;60&amp;#34;&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;        &lt;span style=&#34;color:#000&#34;&gt;QueryJson&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt; &lt;span style=&#34;color:#4e9a06&#34;&gt;&amp;#39;{&amp;#34;size&amp;#34;:0,&amp;#34;query&amp;#34;:{&amp;#34;bool&amp;#34;:{&amp;#34;must&amp;#34;:[],&amp;#34;filter&amp;#34;:[{&amp;#34;bool&amp;#34;:{&amp;#34;filter&amp;#34;:[{&amp;#34;bool&amp;#34;:{&amp;#34;should&amp;#34;:[{&amp;#34;match_phrase&amp;#34;:{&amp;#34;msg&amp;#34;:&amp;#34;[AzureEventHub] emit&amp;#34;}}],&amp;#34;minimum_should_match&amp;#34;:1}},{&amp;#34;bool&amp;#34;:{&amp;#34;should&amp;#34;:[{&amp;#34;match_phrase&amp;#34;:{&amp;#34;kubernetes.labels.app&amp;#34;:&amp;#34;aura-bot&amp;#34;}}],&amp;#34;minimum_should_match&amp;#34;:1}}]}},{&amp;#34;range&amp;#34;:{&amp;#34;@timestamp&amp;#34;:{&amp;#34;gte&amp;#34;:&amp;#34;now-1m&amp;#34;,&amp;#34;lte&amp;#34;:&amp;#34;now&amp;#34;}}}]}}}&amp;#39;&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;        &lt;span style=&#34;color:#000&#34;&gt;QueryIndices&lt;/span&gt;&lt;span style=&#34;color:#ce5c00;font-weight:bold&#34;&gt;:&lt;/span&gt; &lt;span style=&#34;color:#4e9a06&#34;&gt;&amp;#34;aurak8s-service-*&amp;#34;&lt;/span&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;p&gt;Where:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;name&lt;/code&gt;: Mandatory. Name of the query. It must start with &lt;code&gt;query_*&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;QueryIntervalSecs&lt;/code&gt;: Optional. It indicates how often to run queries in seconds. By default, &lt;code&gt;60&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;&lt;code&gt;QueryJson&lt;/code&gt;: Mandatory. The search query to run.&lt;/li&gt;
&lt;li&gt;&lt;code&gt;QueryIndices&lt;/code&gt;: Optional. Indices to run the query on. Any way of specifying indices supported by your ElasticSearch version can be used. By default, &lt;code&gt;_all&lt;/code&gt;. Although this field is optional, it is highly recommended to delimit the search query.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;aura-components-metrics&#34;&gt;Aura components metrics&lt;/h2&gt;
&lt;p&gt;The main Aura components can generate their own metrics.&lt;/p&gt;
&lt;p&gt;Select your intended component in the left menu and access to its details.&lt;/p&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/dashboards/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/dashboards/</guid>
      <description>
        
        
        &lt;h1 id=&#34;aura-dashboards&#34;&gt;Aura dashboards&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Discover the dashboards that can be generated through the different tools used for Aura monitoring in order to track and analyze data&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction&#34;&gt;Introduction&lt;/h2&gt;
&lt;p&gt;Dashboards are reporting tools that aggregate and display metrics and key indicators, so they can be examined at a glance by all possible audiences.&lt;/p&gt;
&lt;p&gt;These dashboards allow data interpretation and provide an overall view for the evaluation of Aura&amp;rsquo;s performance, thus improving decision-making. Each component counts on a dashboard to show its current behavior and there is a single dashboard for an Aura overview.&lt;/p&gt;
&lt;p&gt;There are two types of dashboards for Aura metrics (Prometheus) that are generated in &lt;strong&gt;Grafana&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/dashboards/components-dashboards/&#34;&gt;Aura components dashboards&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/dashboards/system-dashboards/&#34;&gt;Aura system dashboards&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/alarms/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/alarms/</guid>
      <description>
        
        
        &lt;h1 id=&#34;management-of-alerts-in-aura&#34;&gt;Management of alerts in Aura&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Learn how to manage alerts through Prometheus system&lt;/p&gt;

&lt;/div&gt;

&lt;h2 id=&#34;introduction-to-alerts-in-aura&#34;&gt;Introduction to alerts in Aura&lt;/h2&gt;
&lt;p&gt;As previously stated, &lt;strong&gt;Prometheus&lt;/strong&gt; has a list of alert rules that are part of the platform configuration. These alerting rules allow you to define alert conditions based on &lt;strong&gt;Prometheus&lt;/strong&gt; expression language.&lt;/p&gt;
&lt;p&gt;&amp;#x26a0;&amp;#xfe0f; It is possible to edit the Aura alert rules but, for now, changes are lost in a re-deployment.&lt;br&gt;
If you think an alert is important and should be part of the platform, let us know, so we can officially include it.&lt;/p&gt;
&lt;p&gt;Alerts are sent via email, using a global SMTP server managed by the Aura Team. Other notification channels (&lt;strong&gt;Slack&lt;/strong&gt;) are also available but not used by default in production.&lt;/p&gt;
&lt;p&gt;Alerts are disabled (silenced) during Aura deployments to avoid false positives due to services that need to be restarted, etc.&lt;/p&gt;
&lt;p&gt;In order to manage alerts, Aura Platform includes the &lt;strong&gt;AlertManager system&lt;/strong&gt;, which is the part of &lt;strong&gt;Prometheus&lt;/strong&gt; Stack.
The URL to access to alertmanager is:&lt;br&gt;
&lt;code&gt;alerts-{{ environment_name }}.auracognitive.com&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;When accessing the web, you can see all the alerts, as shown in the image below.&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;../../../images/deployment/operation/alertmanager-home.png&#34; alt=&#34;Alert manager home&#34; title=&#34;Alert manager home&#34;&gt;&lt;/p&gt;
&lt;p&gt;In this panel, the most important thing that you can do is &amp;ldquo;silence&amp;rdquo; one alarm pushing in the &amp;ldquo;silence alarm&amp;rdquo; or pressing the &amp;ldquo;new silence button&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;../../../images/deployment/operation/alertmanager-newsilence.png&#34; alt=&#34;Alert manager new silence&#34; title=&#34;Alert manager new silence&#34;&gt;&lt;/p&gt;
&lt;p&gt;In order to check if the cluster is ok (ready) or the status of the system, click in the &amp;ldquo;status&amp;rdquo; section.&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;../../../images/deployment/operation/alertmanager-home.png&#34; alt=&#34;Alert manager status&#34; title=&#34;Alert manager status&#34;&gt;&lt;/p&gt;
&lt;h2 id=&#34;alerts-set-in-aura&#34;&gt;Alerts set in Aura&lt;/h2&gt;
&lt;p&gt;The current section includes the different alerts currently set in Aura, organized by their scope.&lt;/p&gt;
&lt;h3 id=&#34;scope-infrastructure&#34;&gt;Scope: &lt;strong&gt;infrastructure&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;high_cpu_usage_on_hosts&#34;&gt;&lt;strong&gt;high_cpu_usage_on_hosts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; is using a LOT of CPU. CPU usage is &amp;laquo; humanize $value &amp;raquo;%.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by(kubernetes_io_hostname) (rate(container_cpu_usage_seconds_total{id=&amp;quot;/&amp;quot;}[5m])) / sum by(kubernetes_io_hostname) (machine_cpu_cores) * 100 &amp;gt; 90&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;/ul&gt;
 &lt;!--unidad: poner comentario general para el doc--&gt;
&lt;ul&gt;
&lt;li&gt;summary: HIGH CPU USAGE WARNING ON &amp;lsquo;{{ $labels.kubernetes_io_hostname }}&amp;rsquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;high_memory_usage_on_hosts&#34;&gt;&lt;strong&gt;high_memory_usage_on_hosts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; is using a LOT of Memory. Memory usage is &amp;laquo; humanize $value &amp;raquo;%.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by(kubernetes_io_hostname) (container_memory_working_set_bytes{id=&amp;quot;/&amp;quot;}) / sum by(kubernetes_io_hostname) (machine_memory_bytes) * 100 &amp;gt; 90&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: HIGH MEMORY USAGE WARNING ON &amp;lsquo;{{ $labels.kubernetes_io_hostname }}&amp;rsquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;high_fs_usage_on_hosts&#34;&gt;&lt;strong&gt;high_fs_usage_on_hosts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; is using a LOT of FileSystem space. FileSystem usage is &amp;laquo; humanize $value &amp;raquo;%.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by(kubernetes_io_hostname) (container_fs_usage_bytes{device=~&amp;quot;^/dev/.*$&amp;quot;,id=&amp;quot;/&amp;quot;}) / sum by(kubernetes_io_hostname) (container_fs_limit_bytes{device=~&amp;quot;^/dev/.*$&amp;quot;,id=&amp;quot;/&amp;quot;}) * 100 &amp;gt; 70&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: HIGH FILESYSTEM USAGE WARNING ON &amp;lsquo;{{ $labels.kubernetes_io_hostname }}&amp;rsquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;scope-kubernetes&#34;&gt;Scope: &lt;strong&gt;kubernetes&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;high_persistent_volume_usage&#34;&gt;&lt;strong&gt;high_persistent_volume_usage&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.persistentvolumeclaim &amp;raquo; on &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; is using a LOT of persistent volume space. Persistent volume usage is &amp;laquo; humanize $value &amp;raquo;%.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;kubelet_volume_stats_used_bytes * 100 / kubelet_volume_stats_capacity_bytes &amp;gt; 70&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: HIGH PERSISTENT VOLUME USAGE WARNING ON &amp;lsquo;{{ $labels.kubernetes_io_hostname }}&amp;rsquo; by &amp;lsquo;{{ $labels.persistentvolumeclaim }}&amp;rsquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;high_persistent_volume_inode_usage&#34;&gt;&lt;strong&gt;high_persistent_volume_inode_usage&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.persistentvolumeclaim &amp;raquo; on &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; is using a LOT of persistent volume inodes. Persistent volume inode usage is &amp;laquo; humanize $value &amp;raquo;%.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;kubelet_volume_stats_inodes_used * 100 / kubelet_volume_stats_inodes &amp;gt; 70&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: HIGH PERSISTENT VOLUME INODE USAGE WARNING ON &amp;lsquo;{{ $labels.kubernetes_io_hostname  }}&amp;rsquo; by &amp;lsquo;{{ $labels.persistentvolumeclaim }}&amp;rsquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;docker_deleted_container_rate_on_hosts&#34;&gt;&lt;strong&gt;docker_deleted_container_rate_on_hosts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; has a HIGH rate of deleted/stopped containers.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by(kubernetes_io_hostname) (rate(kubelet_docker_operations{operation_type=~&amp;quot;remove_container|stop_container&amp;quot;}[5m])) &amp;gt; 0.1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 1m&lt;/li&gt;
&lt;li&gt;summary: DOCKER DELETED/STOPPED CONTAINER RATE WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;runtime_deleted_container_rate_on_hosts&#34;&gt;&lt;strong&gt;runtime_deleted_container_rate_on_hosts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &amp;laquo; $labels.kubernetes_io_hostname &amp;raquo; has a HIGH rate of deleted/stopped containers.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by(kubernetes_io_hostname) (rate(kubelet_runtime_operations{operation_type=~&amp;quot;stop_podsandbox|remove_container|stop_container&amp;quot;}[5m])) &amp;gt; 0.1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 1m&lt;/li&gt;
&lt;li&gt;summary: RUNTIME DELETED/STOPPED CONTAINER RATE WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;frequent_container_restarts&#34;&gt;&lt;strong&gt;frequent_container_restarts&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Container &amp;laquo; $labels.container &amp;raquo; on pod &amp;laquo; $labels.pod &amp;raquo; has been restarted &amp;laquo; $value &amp;raquo; times within the last hour.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;increase(kube_pod_container_status_restarts_total[1h]) &amp;gt; 5&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: KUBERNETES FREQUENT CONTAINER RESTARTS WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h3 id=&#34;node_not_ready&#34;&gt;&lt;strong&gt;node_not_ready&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Description: Node &amp;laquo; $labels.node &amp;raquo; has status &amp;laquo; $labels.condition &amp;raquo; as &amp;laquo; $labels.status &amp;raquo;.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;kube_node_status_condition{condition!=&amp;quot;Ready&amp;quot;,status!=&amp;quot;false&amp;quot;} &amp;gt; 0 or on(node) kube_node_status_condition{condition=&amp;quot;Ready&amp;quot;,status=&amp;quot;false&amp;quot;} &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: KUBERNETES NODE NOT READY WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;job_error&#34;&gt;&lt;strong&gt;job_error&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: JOB ERROR&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;kube_job_status_failed==1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: KUBERNETES JOB NOT READY WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;scope-prometheus&#34;&gt;Scope: &lt;strong&gt;prometheus&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_rule_evaluation_slow&#34;&gt;&lt;strong&gt;prometheus_rule_evaluation_slow&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus has a 90th percentile latency of &amp;laquo; $value &amp;raquo;s completing rule evaluation cycles.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_evaluator_duration_seconds{quantile=&amp;quot;0.9&amp;quot;} &amp;gt; 60&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS RULE EVALUATION SLOW WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_indexing_backlog&#34;&gt;&lt;strong&gt;prometheus_indexing_backlog&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus is backlogging on the indexing queue. Queue is currently &amp;laquo; $value | printf &lt;code&gt;%.0f&lt;/code&gt; &amp;raquo;% full.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_local_storage_indexing_queue_length / prometheus_local_storage_indexing_queue_capacity * 100 &amp;gt; 10&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS INDEXING BACKLOG WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_not_ingesting_samples&#34;&gt;&lt;strong&gt;prometheus_not_ingesting_samples&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus has not ingested any sample in the last 10 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;rate(prometheus_local_storage_ingested_samples_total[5m]) == 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS NOT INGESTING SAMPLES WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_persist_errors&#34;&gt;&lt;strong&gt;prometheus_persist_errors&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus has encountered &amp;laquo; $value &amp;raquo; persistent errors per second in the last 10 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;rate(prometheus_local_storage_persist_errors_total[10m]) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS PERSIST ERRORS WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_notifications_backlog&#34;&gt;&lt;strong&gt;prometheus_notifications_backlog&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus is backlogging on the notifications queue. The queue has not been empty for 10 minutes. Current queue length: &amp;laquo; $value &amp;raquo;.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_notifications_queue_length &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS NOTIFICATIONS BACKLOG WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_storage_inconsistent&#34;&gt;&lt;strong&gt;prometheus_storage_inconsistent&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus has detected a storage inconsistency. A server restart is needed to initiate recovery.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_local_storage_inconsistencies_total &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS STORAGE INCONSISTENCY WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_persistence_pressure_too_high_24h&#34;&gt;&lt;strong&gt;prometheus_persistence_pressure_too_high_24h&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus is approaching critical persistence pressure. Throttled ingestion expected within the next 24h.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_local_storage_persistence_urgency_score &amp;gt; 0.8 and predict_linear(prometheus_local_storage_persistence_urgency_score[30m], 3600 * 24) &amp;gt; 1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 30m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS PERSISTENCE PRESSURE 24H WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_persistence_pressure_too_high_2h&#34;&gt;&lt;strong&gt;prometheus_persistence_pressure_too_high_2h&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus is approaching critical persistence pressure. Throttled ingestion expected within the next 2h.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_local_storage_persistence_urgency_score &amp;gt; 0.85 and predict_linear(prometheus_local_storage_persistence_urgency_score[30m], 3600 * 2) &amp;gt; 1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 30m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS PERSISTENCE PRESSURE 24H WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_series_maintenance_stalled&#34;&gt;&lt;strong&gt;prometheus_series_maintenance_stalled&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus is maintaining memory time series so slowly that it will take &amp;laquo; $value | printf &lt;code&gt;%.0f&lt;/code&gt; &amp;raquo;h to complete a full cycle. This will lead to persistence falling behind.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;prometheus_local_storage_memory_series / on(job, instance) rate(prometheus_local_storage_series_ops_total{type=&amp;quot;maintenance_in_memory&amp;quot;}[5m]) / 3600 &amp;gt; 24 and prometheus_local_storage_rushed_mode == 1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 1h&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS SERIES MAINTENANCE WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;prometheus_target_scrape_sync_too_low&#34;&gt;&lt;strong&gt;prometheus_target_scrape_sync_too_low&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: Prometheus target scrape sync rate is too low.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;rate(prometheus_target_scrape_pool_sync_total{app=&amp;quot;prometheus&amp;quot;}[10m]) == 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: PROMETHEUS TARGET SCRAPE SYNC WARNING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;scope-logs&#34;&gt;Scope: &lt;strong&gt;logs&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;elasticsearch_too_few_nodes_running&#34;&gt;&lt;strong&gt;elasticsearch_too_few_nodes_running&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: There are only &amp;laquo; $value &amp;raquo; &amp;lt; 3 ElasticSearch nodes running.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;elasticsearch_cluster_health_number_of_node &amp;lt; 3&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 10m&lt;/li&gt;
&lt;li&gt;summary: TOO FEW ELASTICSEARCH NODES&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;elasticsearch_high_memory_usage&#34;&gt;&lt;strong&gt;elasticsearch_high_memory_usage&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: The memory (heap) usage is over 90% for 15m on node &amp;laquo; $labels.node &amp;raquo;&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;elasticsearch_jvm_memory_used_bytes{area=&amp;quot;heap&amp;quot;} / elasticsearch_jvm_memory_max_bytes{area=&amp;quot;heap&amp;quot;} &amp;gt; 0.9&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 15m&lt;/li&gt;
&lt;li&gt;summary: ELASTICSEARCH HIGH MEMORY USAGE&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;elasticsearch_not_indexing&#34;&gt;&lt;strong&gt;elasticsearch_not_indexing&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: ElasticSearch data node is not indexing new documents&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;increase(elasticsearch_indices_docs{es_data_node=&amp;quot;true&amp;quot;}[1m]) == 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: ELASTICSEARCH NOT INDEXING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;scope-aura&#34;&gt;Scope: &lt;strong&gt;Aura&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bot_unauthorized_aura-bridge&#34;&gt;&lt;strong&gt;aura-bot_unauthorized_aura-bridge&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; has not authorized the connection with &lt;em&gt;&lt;strong&gt;aura-bot&lt;/strong&gt;&lt;/em&gt; for 3 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (status_code) (rate(http_request_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,status_code=~&amp;quot;401&amp;quot;}[3m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 3m&lt;/li&gt;
&lt;li&gt;summary: AURA-BOT RETURN UNAUTHORIZED TO AURA-BRIDGE&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bot_bad-request_aura-bridge&#34;&gt;&lt;strong&gt;aura-bot_bad-request_aura-bridge&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; has not been able to correctly handle the connection with &lt;em&gt;&lt;strong&gt;aura-bot&lt;/strong&gt;&lt;/em&gt; for 3 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (status_code) (rate(http_request_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,status_code=~&amp;quot;400&amp;quot;}[3m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 3m&lt;/li&gt;
&lt;li&gt;summary: AURA-BOT RETURN BAD REQUEST TO AURA-BRIDGE&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bot_internal-error_aura-bridge&#34;&gt;&lt;strong&gt;aura-bot_internal-error_aura-bridge&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; failed to connect to &lt;em&gt;&lt;strong&gt;aura-bot&lt;/strong&gt;&lt;/em&gt; for 3 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (host,status) (rate(outgoing_request_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,status=~&amp;quot;5..&amp;quot;,host=~&amp;quot;aura-bot.*&amp;quot;}[3m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 3m&lt;/li&gt;
&lt;li&gt;summary: COMMUNICATION ERROR BETWEEN AURA-BOT AND AURA-BRIDGE&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bridge-error_callback&#34;&gt;&lt;strong&gt;aura-bridge-error_callback&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; failed to handle the connection with callback for 3 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (host,status) (rate(outgoing_request_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,status=~&amp;quot;5..&amp;quot;,host!~&amp;quot;aura-bot.*&amp;quot;}[3m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 3m&lt;/li&gt;
&lt;li&gt;summary: COMMUNICATION ERROR BETWEEN AURA-BRIDGE AND CALLBACK&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bridge_error_whatsapp&#34;&gt;&lt;strong&gt;aura-bridge_error_whatsapp&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: errors in &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; with WhatsApp functionality for 5 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (origin,originStatus)(rate(outgoing_message_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,origin=~&amp;quot;aura-bot|whatsapp|4p&amp;quot;,originStatus!=&amp;quot;200&amp;quot;,httpStatus!~&amp;quot;403|408|400&amp;quot;}[5m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: Error happened in WhatsApp functionality.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;aura-bridge_error_4p&#34;&gt;&lt;strong&gt;aura-bridge_error_4p&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: errors in &lt;em&gt;&lt;strong&gt;aura-bridge&lt;/strong&gt;&lt;/em&gt; with &lt;strong&gt;Kernel&lt;/strong&gt; in WhatsApp functionality for 5 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (origin,originStatus)(rate(outgoing_message_duration_seconds_count{app=&amp;quot;aura-bridge&amp;quot;,origin=~&amp;quot;4p&amp;quot;,httpStatus=~&amp;quot;403|408|400&amp;quot;}[5m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 5m&lt;/li&gt;
&lt;li&gt;summary: Error happened with &lt;strong&gt;Kernel&lt;/strong&gt; in WhatsApp functionality.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;nlp-provisioning_killed-processes&#34;&gt;&lt;strong&gt;nlp-provisioning_killed-processes&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: killed &lt;em&gt;&lt;strong&gt;nlp-provisioning&lt;/strong&gt;&lt;/em&gt; processes for 15 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (exported_job) (rate(nlp_provisioning_killed_processes{exported_job=&amp;quot;nlp_provisioning_job&amp;quot;}[15m])) &amp;gt; 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 15m&lt;/li&gt;
&lt;li&gt;summary: Processes killed in &lt;em&gt;&lt;strong&gt;nlp-provisioning&lt;/strong&gt;&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;h4 id=&#34;alive-processes_nlp-provisioning_expected-alive-processes&#34;&gt;&lt;strong&gt;alive-processes_nlp-provisioning_expected-alive-processes&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: alive &lt;em&gt;&lt;strong&gt;nlp-provisioning&lt;/strong&gt;&lt;/em&gt; processes vs expected alive &lt;em&gt;&lt;strong&gt;nlp-provisioning&lt;/strong&gt;&lt;/em&gt; processes for 15 minutes.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;sum by (exported_job)(nlp_provisioning_alive_processes{exported_job=&amp;quot;nlp_provisioning_job&amp;quot;}) / sum by (exported_job) (nlp_provisioning_expected_alive_processes{exported_job=&amp;quot;nlp_provisioning_job&amp;quot;})!=1&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 15m&lt;/li&gt;
&lt;li&gt;summary: Processes killed in &lt;em&gt;&lt;strong&gt;nlp-provisioning&lt;/strong&gt;&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;!-- cambiar --&gt;
&lt;h3 id=&#34;scope-misc&#34;&gt;Scope: &lt;strong&gt;misc&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;h4 id=&#34;probe_down&#34;&gt;&lt;strong&gt;probe_down&lt;/strong&gt;&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Description: The endpoint &amp;laquo; $labels.instance &amp;raquo; is down or not reachable. The blackbox exporter could not validate &amp;laquo; $labels.app &amp;raquo;&amp;rsquo;s health.&lt;/li&gt;
&lt;li&gt;Expr: &lt;code&gt;probe_success == 0&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;For: 2m&lt;/li&gt;
&lt;li&gt;summary: PROBE FAILING&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
    <item>
      <title>Docs: </title>
      <link>/docs/developers-workspace/monitoring/queries/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/docs/developers-workspace/monitoring/queries/</guid>
      <description>
        
        
        &lt;h1 id=&#34;queries&#34;&gt;Queries&lt;/h1&gt;


&lt;div class=&#34;pageinfo pageinfo-primary&#34;&gt;
&lt;p&gt;Description of the different types of queries that can be done in order to retrieve metrics from the system&lt;/p&gt;

&lt;/div&gt;

&lt;p&gt;With the goal of retrieving from Aura specific information regarding the generated logs and metrics, we can make queries to the system.&lt;/p&gt;
&lt;p&gt;These queries are classified into two categories:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/queries/basic-queries/&#34;&gt;Basic monitoring queries&lt;/a&gt;&lt;br&gt;
Guidelines for retrieving information from basic queries.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&#34;../../../docs/developers-workspace/monitoring/queries/db-queries/&#34;&gt;Basic database queries&lt;/a&gt;&lt;br&gt;
Guidelines for retrieving information from queries to the database.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;

      </description>
    </item>
    
  </channel>
</rss>
