EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations

Overview

This paper presents EasyRAG, a simple, lightweight, and efficient retrieval-augmented generation framework for automated network operations. Our framework has three advantages. The first is accurate question answering. We designed a straightforward RAG scheme based on (1) a specific data processing workflow (2) dual-route sparse retrieval for coarse ranking (3) LLM Reranker for reranking (4) LLM answer generation and optimization. This approach achieved first place in the GLM4 track in the preliminary round and second place in the GLM4 track in the semifinals. The second is simple deployment. Our method primarily consists of BM25 retrieval and BGE-reranker reranking, requiring no fine-tuning of any models, occupying minimal VRAM, easy to deploy, and highly scalable; we provide a flexible code library with various search and generation strategies, facilitating custom process implementation. The last one is efficient inference. We designed an efficient inference acceleration scheme for the entire coarse ranking, reranking, and generation process that significantly reduces the inference latency of RAG while maintaining a good level of accuracy; each acceleration scheme can be plug-and-play into any component of the RAG process, consistently enhancing the efficiency of the RAG system.

Requirements

EasyRAG needs Python3.10.14 and at least 1 GPU with 16GB.

You need to change llm_keys in src/easyrag.yaml to your GLM keys.

pip install -r requirements.txt
git lfs install
bash scripts/download.sh # download models
bash scripts/process.sh # process zedx data

Reproduce

1. Run Directly

< 8000 div class="highlight highlight-source-shell notranslate position-relative overflow-auto" dir="auto" data-snippet-clipboard-copy-content="cd src # run challenge questions python3 main.py # copy answer file cp submit_result.jsonl ../answer.jsonl">

cd src
# run challenge questions
python3 main.py 
# copy answer file
cp submit_result.jsonl ../answer.jsonl

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
assets		assets
scripts		scripts
src		src
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations

Table of Contents

Overview

Requirements

Reproduce

1. Run Directly

2. Run with Docker

Usage

1. API

2.WebUI

Project Structure

Citation

Acknowledgement

Star History

About

Releases

Packages

Contributors 4

Languages

License

BUAADreamer/EasyRAG

Folders and files

Latest commit

History

Repository files navigation

EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations

Table of Contents

Overview

Requirements

Reproduce

1. Run Directly

2. Run with Docker

Usage

1. API

2.WebUI

Project Structure

Citation

Acknowledgement

Star History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages