diff --git a/docs/getting_started/installation.md b/docs/getting_started/installation.md index 94fbde5f9d8c8ac4bf892226794050f55abd8011..1d736a10e0ea94765d7bd439bde9daef9c0f242d 100644 --- a/docs/getting_started/installation.md +++ b/docs/getting_started/installation.md @@ -26,7 +26,7 @@ need additional environment keys + tokens setup depending on the LLM provider. If you don't wish to use OpenAI, the environment will automatically fallback to using `LlamaCPP` and `llama2-chat-13B` for text generation and `BAAI/bge-small-en` for retrieval and embeddings. These models will all run locally. -In order to use `LlamaCPP`, follow the installation guide [here](/examples/llm/llama_2_llama_cpp.ipynb). You'll need to install the `llama-cpp-python` package, preferably compiled to support your GPU. This will use aronund 11.5GB of memory across the CPU and GPU. +In order to use `LlamaCPP`, follow the installation guide [here](/examples/llm/llama_2_llama_cpp.ipynb). You'll need to install the `llama-cpp-python` package, preferably compiled to support your GPU. This will use around 11.5GB of memory across the CPU and GPU. In order to use the local embeddings, simply run `pip install sentence-transformers`. The local embedding model uses about 500MB of memory.