Eros

Eros is a private, local AI memory to help you remember the little things about the people you care about.

It combines a Retrieval-Augmented Generation (RAG) system with an LLM, powered by Ollama, LangChain, and Chroma.

Shoutout to Greg Kamradt for the semantic chunking approach, adapted from his work: https://github.com/FullStackRetrieval-com/RetrievalTutorials/blob/main/tutorials/LevelsOfTextSplitting/5_Levels_Of_Text_Splitting.ipynb

Backstory

I can be a forgetful person — especially when it comes to the small but important details about people I'm dating. So I built Eros: a personal memory system to help me log what they say, what they like, and what I should probably remember.

The idea had been floating around for a while, but it all came together after I read Geoffrey Litt’s post. That gave me the push to actually build it.

How It Works

First, the logged text is converted into an embedding database using Chroma.
Whenever a query is made, the system retrieves the top N chunks that are closest in meaning to the query.
These N chunks are supplied to the LLM, along with the query, to generate a contextual answer.

At first, I tried normal chunking with a max token threshold, then single-sentence chunking. But neither gave accurate enough results, mostly because they didn’t take context into account.

With fixed-length chunking, the problem is that each chunk might contain unrelated content. Since logs are often short, multiple different ideas can end up in the same chunk.
Sentence chunking wasn’t great either — sometimes the relevant context spans two or three sentences, and splitting them apart weakened the retrieval.

While exploring alternatives, I came across Greg Kamradt’s notebook on chunking strategies. The semantic chunking section stood out:

Sentences are first split from the logged text.
A buffer window (e.g., BUFFER_SIZE sentences before and after) is applied to each sentence to provide local context.
Each buffered group is embedded into a vector using the embedding function.
Cosine distances are computed between each pair of adjacent embeddings.
A threshold (e.g., 95th percentile of distances) is used to detect significant shifts in meaning.
Breakpoints are inserted where the distance exceeds this threshold.
Final chunks are formed by splitting the original text at these breakpoints, resulting in semantically coherent segments.

Installation

Install Ollama
pip install -r requirements.txt
Run with `python eros.py

Usage

python eros.py add "<text>" — log something quickly
python eros.py add --continuous — multiline journaling (e.g. while you're in a call)
python eros.py update — embed new logs into the vector database
python eros.py update --init — reset and re-embed everything from scratch
python eros.py query "<question>" — ask something you've logged
python eros.py query --continuous — interactive chat mode
python eros.py profile — generate a one-paragraph summary
python eros.py profile --export filename.pdf — export the summary as a PDF

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
README.md		README.md
eros.py		eros.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Eros

Backstory

How It Works

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

niranjanorkat/eros

Folders and files

Latest commit

History

Repository files navigation

Eros

Backstory

How It Works

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages