Categories:
Atria RAG server metrics
List of metrics available in atria-rag-server
http_request_duration_seconds
This metric is intended to store the information related to all the incoming HTTP requests received by atria-rag-server.
It is stored as a Summary in Prometheus, so every sample, besides the defined labels, also includes its duration.
This metric allows measuring the behavior of the requests from any given endpoint. Specifically, the duration since the request lands in atria-rag-server until its HTTP response is returned:
- The number of requests during a time
- The average/min/max duration of these requests
Labels:
method: HTTP method used by the request being stored (GET,POST,PUT,DELETE, etc.)path: specific endpoint of the requeststatus_code: HTTP status code returned in the responseapplication: application name that is using the model
outgoing_request_duration_seconds
This metric is intended to store the information related to all the outgoing HTTP requests made by atria-rag-server. It is stored as a Summary in Prometheus, so every sample, besides the defined labels, also includes its duration.
The metric allows measuring the behavior of the requests to any given endpoint:
- The number of requests during a time
- The average/min/max duration of these requests
Labels:
method: HTTP method used by the request being stored (GET,POST,PUT,DELETE, etc.)host: host and domain where the request is being sentpath: specific endpoint of the requeststatus: HTTP status code returned in the response