Distributed Retrieval-Augmented Generation

Prerequisites

Ubuntu 22.04
Python 3.10
Ollama 0.5.7

Install Python requirements:

$ pip install -r requirements.txt

Install and start Ollama local LLM service:

# start ollama service:
$ sudo systemctl start ollama
# start serving model "Llama 3.2 - 3B"
# for other Ollama models: https://ollama.com/library
$ ollama run llama3.2:3b
# preload a model into Ollama to get faster response times:
$ curl http://localhost:11434/api/generate -d '{"model": "llama3.2:3b"}'
# check ollama log
$ journalctl -u ollama

If you do not have root permission, install and start Ollama manually:

# Download and extract the package:
$ curl -L https://ollama.com/download/ollama-linux-amd64.tgz -o ollama-linux-amd64.tgz
$ sudo tar -C <your-path>/ -xzf ollama-linux-amd64.tgz
# Create user-mode systemd.service file `~/.config/systemd/user/ollama.service` with the following content:
[Unit]
Description=Ollama Service
After=network-online.target

[Service]
ExecStart=<your-path>/bin/ollama serve
Restart=always
RestartSec=3
Environment="PATH=$PATH"

[Install]
WantedBy=default.target
# Start systemd ollama service in user mode:
$ systemctl --user enable ollama
$ systemctl --user start ollama
# Check ollama service log:
$ journalctl --user -f -u ollama

Install NLTK Data

$ python -m nltk.downloader all

Models

The following Ollama models are utilized in our experiments.

Datasets

The following Hugging Face datasets are utilized in our experiments.

Use HF_HUB_CACHE to configure where repositories from the Hub will be cached locally (models, datasets and spaces).

Run

$ python simulator.py

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
auto_scripts		auto_scripts
config		config
figures		figures
modules		modules
templates		templates
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
simulator.py		simulator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distributed Retrieval-Augmented Generation

Prerequisites

Models

Datasets

Run

DEBUG

About

Uh oh!

Languages

xuchenhao001/DRAG

Folders and files

Latest commit

History

Repository files navigation

Distributed Retrieval-Augmented Generation

Prerequisites

Models

Datasets

Run

DEBUG

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages