Retrieval-augmented generation with open-source large language models

Prerequisites

The tested environment is

$ uname -a
Linux richardfeynman 5.15.133.1-microsoft-standard-WSL2 #1 SMP Thu Oct 5 21:02:42 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

$ cat /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=22.04
DISTRIB_CODENAME=jammy
DISTRIB_DESCRIPTION="Ubuntu 22.04.3 LTS"

$ poetry --version
Poetry (version 1.5.1)

$ /usr/local/cuda/bin/nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:33:58_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0

$ nvidia-smi --query-gpu=gpu_name,memory.total --format=csv
name, memory.total [MiB]
NVIDIA GeForce RTX 3060, 12288 MiB

Setup

Using poetry

poetry install

or using the provided requirements.txt with pip

pip install -r requirements.txt

Presentation

Each of the notebooks are representable as slides in a browser, i.e. for notebooks/00_introduction.ipynb, run

jupyter nbconvert notebooks/00_introduction.ipynb --to slides --post serve

This will open a browser window with the notebook rendered as slides.

References

Papers

RAG: https://arxiv.org/abs/2005.11401v4
MistralAI paper: https://arxiv.org/pdf/2310.06825.pdf
OpenAI Text embeddings paper: https://arxiv.org/abs/2201.10005
Similarity search: https://arxiv.org/abs/1702.08734

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
notebooks		notebooks
resource		resource
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
TODO.md		TODO.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Retrieval-augmented generation with open-source large language models

Prerequisites

Setup

Presentation

References

Papers

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mircomarahrens/rag-demo

Folders and files

Latest commit

History

Repository files navigation

Retrieval-augmented generation with open-source large language models

Prerequisites

Setup

Presentation

References

Papers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages