-
-
Notifications
You must be signed in to change notification settings - Fork 118
2.3.39 Satellite RAGLite
av edited this page Apr 26, 2025
·
2 revisions
Handle:
raglite
URL: http://localhost:34391
RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite.
[!WARN] Unfortunately due to bugs in
litellm
andraglite
implementation, it's not possible to useraglite
with a local Ollama instance. See this issue for details.
# ⚠️ Due to above, before starting you'll need to supply
# an alternative model for RAGLite to use
# OpenRouter below is an example of a model that works
harbor config set openrouter.api_key <API_KEY>
harbor config set raglite.model openrouter/openai/gpt-4o
# [Optional] Pre-build the image
harbor build raglite
# Run the service
harbor up raglite
- Harbor runs
chainlit
integration as araglite
service - Harbor shares your HuggingFace hub cache and token with the service as it can download and run embeddings models
Following options can be set via harbor config
:
# RAGLite workspace location on the host
# Should be relative to $(harbor home) or an absolute path
RAGLITE_WORKSPACE ./raglite/data
# LiteLLM-compatible notation for the model to use
# for chat completions. Must support:
# - tool calls with streaming
# - model info resolution
RAGLITE_MODEL ollama/llama3.1:8b
# LiteLLM-compatible notation for the model to use
# for embeddings
RAGLITE_EMBEDDER ollama/mxbai-embed-large:latest
You can’t perform that action at this time.