Stars
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
A Lighting Pytorch Framework for Recommendation Models, Easy-to-use and Easy-to-extend.
推荐系统入门指南,全面介绍了工业级推荐系统的理论知识(王树森推荐系统公开课-基于小红书的场景讲解工业界真实的推荐系统),如何基于TensorFlow2训练模型,如何实现高性能、高并发、高可用的Golang推理微服务。Comprehensively introduced the theory of industrial recommender system, how to trainning …
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
中文命名实体识别。包含目前最新的中文命名实体识别论文、中文实体识别相关工具、数据集,以及中文预训练模型、词向量、实体识别综述等。
目前已囊括232个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生in…
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Code for Label Semantics for Few Shot Named Entity Recognition
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中