8000 hellomaxwell (deep learning/NLP/RL) / Starred · GitHub

More Web Proxy on the site http://driver.im/

hellomaxwell

Follow

deep learning/NLP/RL hellomaxwell

Follow

3 followers · 9 following

Lists (11)

Sort

ckpt2pb

dstc9

out-of-domain

torch学习

句向量表示

大型数据集

大模型面

大模型面试

对话项目

数据扩增

聚类项目

Starred repositories

fufankeji / LLMs-Technology-Community-Beyondata

《赋范大模型技术社区》是针对各阶大模型学习者量身打造的基于各类大模型，包括环境设置、本地部署、高效微调、开发实战等技能在内的全流程指导！

388 67 Updated Feb 21, 2025

deepseek-ai / DeepSeek-V3

Python 97,032 15,780 Updated Apr 9, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 7,591 839 Updated Apr 30, 2025

explodinggradients / ragas

Supercharge Your LLM Application Evaluations 🚀

Python 9,226 917 Updated May 17, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,577 553 Updated Apr 19, 2025

fufankeji / fufan-chat-api

基于RAG的私有知识库问答系统

Python 251 61 Updated Nov 28, 2024

Tongji-KGLLM / RAG-Survey

2,015 133 Updated May 8, 2024

langchain-ai / rag-from-scratch

Jupyter Notebook 4,123 1,189 Updated Jul 9, 2024

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

20,096 1,938 Updated May 19, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,502 1,421 Updated May 22, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,345 1,503 Updated Apr 29, 2025

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

C 4,155 417 Updated Apr 18, 2025

open-compass / BotChat

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Jupyter Notebook 149 6 Updated May 22, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,486 1,874 Updated May 21, 2025

abditag2 / bert

Forked from google-research/bert

TensorFlow code and pre-trained models for BERT

Python 24 6 Updated Apr 19, 2019

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 9,706 706 Updated May 22, 2025

PaddlePaddle / TrustAI

飞桨可信AI

Python 187 36 Updated Jan 18, 2023

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,208 497 Updated Aug 6, 2024

jina-ai / finetuner

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,497 68 Updated Mar 11, 2024

hansonrobotics / bert-as-service

Python 34 16 Updated Nov 19, 2020

liuhuanyong / WordMultiSenseDisambiguation

WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarity compute,基于百科知识库的中文词语多词义/义项获取与特定句子词语语义消歧.

Python 128 53 Updated Dec 15, 2018

facebookresearch / atlas

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Python 536 71 Updated Nov 28, 2023

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 9,250 924 Updated May 16, 2025

QiangLong2017 / Deep-Reiforcement-Learning

Python 62 15 Updated Oct 11, 2022

wangshusen / DRL

Deep Reinforcement Learning

3,894 626 Updated Dec 10, 2022

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,332 170 Updated Jul 25, 2023

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,856 1,900 Updated May 22, 2025

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,647 477 Updated Jan 8, 2024

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,675 5,622 Updated May 21, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,503 4,383 Updated May 22, 2025

Starred topics

knowledge-aware-recommendation

0