Stars
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
A generative speech model for daily dialogue.
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Recipes to train reward model for RLHF.
Set of tools to assess and improve LLM security.
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.
Question and Answer based on Anything.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
ZYN: Zero-Shot Reward Models with Yes-No Questions
Train transformer language models with reinforcement learning.
Retrieval and Retrieval-augmented LLMs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Universal and Transferable Attacks on Aligned Language Models
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。