-
Institute of Computing Technology, CAS
- Beijing
Highlights
- Pro
More
Starred repositories
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
RLHF中文手册 - 详细解析RLHF全流程优化阶段,涵盖指令调优、奖励模型训练,以及拒绝采样、强化学习和直接对齐算法等关键技术。
Quarto template for Chinese academic writing
Train transformer language models with reinforcement learning.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
LangChain 的中文入门教程
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Official implementation for LaCo (EMNLP 2024 Findings)
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
800,000 step-level correctness labels on LLM solutions to MATH problems
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
A paper list of some recent works about Token Compress for Vit and VLM
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Resources of deep learning for mathematical reasoning (DL4MATH).