Categories:
ATRIA RAG Generate DB architecture and components
Development architecture and technical components of the atria-rag-generate-db
Architecture overview
The following diagram schematically shows the main technical components integrated into atria-rag-generate-db.

A brief description of the technical components is included below:
Data sources
A project contains information required for the execution of the generation of the databases: specific path of documents to feed the databases, allowed file extensions, etc. It can read from different sources, this source type is defined in the extensions field.
Before the information from the documents is stored in the corresponding database, the documents are processed, e.g., they are cut up and cleaned.
Retrievers
The retrievers are in charge of reading the information from the documents and feeding the databases.
The retrievers are defined in the retrievers field of the project. Each retriever is associated with a database in order to feed or retrieve information from it.