- Suwon, Republic of Korea
Highlights
- Pro
Stars
Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"
Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
Language Models as Semantic Indexers (ICML 2024)
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Question Answering"
Awesome LLM compression research papers and tools.
The official repository for MGFiD (NAACL 2024 Findings)
This is official code for the NAACL 2021 paper: "MelBERT: Metaphor Detection via Contextualized Late Interaction usingMetaphorical Identification Theories".
This is the official implementation of ConQueR. (SIGIR'23)
This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".
[CIKM'23] "Toward a Better Understanding of Loss Functions for Collaborative Filtering"
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
LlamaIndex is the leading framework for building LLM-powered agents over your data.
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).