-
SPML Lab at National Taiwan University
- Taipei, Taiwan
-
03:28
(UTC +08:00) - https://d223302.github.io/
Highlights
- Pro
Stars
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Unified automatic quality assessment for speech, music, and sound.
Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"
A library for mechanistic interpretability of GPT-style language models
Official repository for ALT (ALignment with Textual feedback).
800,000 step-level correctness labels on LLM solutions to MATH problems
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
The model, data and code for the visual GUI Agent SeeClick
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
GPT4 based personalized ArXiv paper assistant bot
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
Large Language Models Are Reasoning Teachers (ACL 2023)
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback