RAG Project (No LangChain)

This project implements a Retrieval-Augmented Generation (RAG) system without using LangChain. It uses the following core dependencies:

OpenAI: For LLM capabilities
Pinecone: Vector database for storing and retrieving embeddings
Tiktoken: OpenAI's tokenizer
Python-dotenv: For managing environment variables

Setup

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Unix/macOS

Install dependencies:

pip install -r requirements.txt

Create a .env file and add your API keys:

OPENAI_API_KEY=your_openai_api_key
PINECONE_API_KEY=your_pinecone_api_key
PINECONE_INDEX_NAME=your_pinecone_index_name

Usage

Index documents:

python main.py -l /path/to/documents

Ask a question:

python main.py "What is the best way to do great work?"

Both index and ask:

python main.py -l /path/to/documents "What is the best way to do great work?"

Project Structure

The project consists of several key components:

main.py: Core application logic and CLI interface
utils.py: File reading utilities
tokenization.py: Token-related functions for text processing
requirements.txt: Project dependencies
.env.example: Template for environment variables

Features

Document processing and chunking
Semantic search using OpenAI embeddings
Vector storage with Pinecone
RAG-based question answering
Source citations in responses
Progress tracking for long operations

License

This project is licensed under the BSD 3-Clause License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Project (No LangChain)

Setup

Usage

Project Structure

Features

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
tokenization.py		tokenization.py
utils.py		utils.py

License

syshen/rag_no-langchain

Folders and files

Latest commit

History

Repository files navigation

RAG Project (No LangChain)

Setup

Usage

Project Structure

Features

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages