Categories:
Check Hugging Face embedding models downloading
Guidelines to check if the Hugging Face models used in ATRIA are downloaded during the generate-db process
Introduction
The free embedding templates we are currently using in ATRIA are paraphrase-multilingual-MiniLM-L12-v2 and multi-qa-distilbert-cos-v1 both from Hugging Face. (These models are the ones used with the following embeddings by default available in ATRIA: Local Sentence Transformer and Distilbert-based Local Sentence Transformer).
During the generate-db process, these models are loaded into memory and the process may fail if there is a connection problem with Hugging Face. In this error scenario, the only solution is to wait until the service is again up and running.
In the current document, we include the instructions to check if the embedding models can be downloaded, in order to detect the process failure.
Prerequisites
-
Install huggingface-cli
pkgx install huggingface-cli
Check if the Hugging Face models are downloaded properly
The way to check if the service is up is by launching the following command:
huggingface-cli download sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
If the download starts, the service is up, and you can restart the generate-db process.