Highlights
Stars
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
Pocket Flow: Codebase to Tutorial
Generative Representational Instruction Tuning
LangChain, LangGraph Open Tutorial for everyone!
Tool for generating high quality Synthetic datasets
A website startup template using the Chirpy theme gem.
🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".
Together Open Deep Research
Code for explaining and evaluating late chunking (chunked pooling)
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
Comprehensive guide to learn RAG from basics to advanced.
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Provides a common interface to many IR ranking datasets.
Biomedical Question Answering Datasets.
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624
A curated list of 120+ LLM libraries category wise.