Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Stars
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Python port of CausalImpact R library
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.
AI Logging for Interpretability and Explainability🔬
This repository collects all relevant resources about interpretability in LLMs
[COLING 2025] Automated Molecular Concept Generation and Labeling with Large Language Models
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
🧑🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
8000 allenai / OLMo
Modeling, training, eval, and inference code for OLMo
ICML 2024 Predicting and Interpreting Energy Barriers of Metallic Glasses with Graph Neural Networks
💱 A curated list of data valuation (DV) to design your next data marketplace
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
The nnsight package enables interpreting and manipulating the internals of deep learned models.
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
Must-read Papers on Knowledge Editing for Large Language Models.
[NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
Train transformer language models with reinforcement learning.
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。