RAG Agent

A full-stack Retrieval-Augmented Generation (RAG) agent combining a FastAPI backend (Python) with a modern React + TypeScript frontend. The backend leverages FAISS (and optionally Pinecone) for vector-based retrieval, integrates with LLMs (e.g., OpenAI GPT), and exposes APIs and websockets for conversational and retrieval tasks. The frontend provides a sleek, real-time chat interface.

Project Purpose

The RAG Agent enables intelligent, context-aware conversations by augmenting large language models with efficient retrieval from a vector store. It is designed for applications such as recipe generation, knowledge assistants, or any scenario where grounding LLMs with domain data is valuable.

Features

Backend (FastAPI, Python):

Retrieval-augmented generation using FAISS or Pinecone vector stores
Integration with OpenAI LLMs for response generation and embeddings
Modular service and repository layers (e.g., for drinks, ingredients)
WebSocket and REST API endpoints (/retrieve, /generate)
Environment-based configuration, scalable and extensible design

Frontend (React, TypeScript, Vite):

Real-time chat UI with WebSocket support
Modern, responsive design using Ant Design components
Type-safe, modular codebase

Setup Flow

1. Clone the Repository

git clone <repo-url>
cd rag-agent

2. Backend Setup

Backend Prerequisites

Python 3.8+
uv (for Python package management)
(Optional) Docker for containerized deployment

Installation

cd backend
# Create a virtual environment
uv venv .venv

Configuration

Copy .env.example to .env and fill in required values (API keys, etc.)

Database & Vector Store

By default, uses SQLite and FAISS. Pinecone can be enabled via environment variables.

Running the Backend

uv run uvicorn --app-dir app main:app --reload

API: http://localhost:8000
WebSocket: ws://localhost:8000

3. Frontend Setup

Frontend Prerequisites

Node.js 16+
pnpm (recommended), npm, or yarn

Installation

cd ../frontend
pnpm install # or npm install or yarn install

Running the Frontend

pnpm dev # or npm run dev or yarn dev

App: http://localhost:5173

Architecture

flowchart TB
  UserInput[User Input] --> Prompt1[Prompt]
  UserInput --> CheckIsFollowUp
  subgraph CheckIsFollowUp[Check is follow up]
    Prompt2[Prompt] --> LLM1((LLM)) --> Parser1[Parser]
  end
  CheckIsFollowUp --> FAISS["FAISS (Vector Store)"] -.-> Prompt1
  OutputModel[Output Model] --> FormatInstruction[Format Instruction] --> Prompt1
  MessageHistory[Message History] --> Prompt1
  MessageHistory --> CheckIsFollowUp
  Prompt1 --> LLM2((LLM)) --> Parser2[Parser] --> Result

Tech Stack

Backend: FastAPI, FAISS, Pinecone, SQLAlchemy, LangChain, OpenAI, uv
Frontend: React, TypeScript, Vite, Ant Design, Socket.IO

License

MIT (or specify your license)

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
docker-compose.prod.yaml		docker-compose.prod.yaml
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Agent

Project Purpose

Features

Setup Flow

1. Clone the Repository

2. Backend Setup

Backend Prerequisites

Installation

Configuration

Database & Vector Store

Running the Backend

3. Frontend Setup

Frontend Prerequisites

Installation

Running the Frontend

Architecture

Tech Stack

License

About

Releases

Packages

Languages

listennn08/ai-agent

Folders and files

Latest commit

History

Repository files navigation

RAG Agent

Project Purpose

Features

Setup Flow

1. Clone the Repository

2. Backend Setup

Backend Prerequisites

Installation

Configuration

Database & Vector Store

Running the Backend

3. Frontend Setup

Frontend Prerequisites

Installation

Running the Frontend

Architecture

Tech Stack

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages