Stars
This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retrieval-Augmented Generation (RAG) systems.
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
MateCat is an AI driven translation tool for language industry professionals. Matecat makes machine translation post-editing and project outsourcing easy.
Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survey".
Guideline following Large Language Model for Information Extraction
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
A bibliography and survey of the papers surrounding o1
Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
a customizable Python platform, allowing researchers to plug-and-play various LLMs participating in a goal-oriented discussion.
Dataset and Evaluation Code for the K-QA Benchmark.
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
DSPy: The framework for programming—not prompting—language models
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Build, evaluate, understand, and fix LLM-based apps
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]