8000 hellomaxwell (deep learning/NLP/RL) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hellomaxwell's full-sized avatar

Block or report hellomaxwell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

《赋范大模型技术社区》是针对各阶大模型学习者量身打造的基于各类大模型,包括环境设置、本地部署、高效微调、开发实战等技能在内的全流程指导!

388 67 Updated Feb 21, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 7,591 839 Updated Apr 30, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 9,226 917 Updated May 17, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,577 553 Updated Apr 19, 2025

基于RAG的私有知识库问答系统

Python 251 61 Updated Nov 28, 2024
Jupyter Notebook 4,123 1,189 Updated Jul 9, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

20,096 1,938 Updated May 19, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,502 1,421 Updated May 22, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,345 1,503 Updated Apr 29, 2025

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,155 417 Updated Apr 18, 2025

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Jupyter Notebook 149 6 Updated May 22, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,486 1,874 Updated May 21, 2025

TensorFlow code and pre-trained models for BERT

Python 24 6 Updated Apr 19, 2019

Retrieval and Retrieval-augmented LLMs

Python 9,706 706 Updated May 22, 2025

飞桨可信AI

Python 187 36 Updated Jan 18, 2023

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,208 497 Updated Aug 6, 2024

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,497 68 Updated Mar 11, 2024

WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarity compute,基于百科知识库的中文词语多词义/义项获取与特定句子词语语义消歧.

Python 128 53 Updated Dec 15, 2018

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Python 536 71 Updated Nov 28, 2023

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 9,250 924 Updated May 16, 2025

Deep Reinforcement Learning

3,894 626 Updated Dec 10, 2022

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,332 170 Updated Jul 25, 2023

Train transformer language models with reinforcement learning.

Python 13,856 1,900 Updated May 22, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,647 477 Updated Jan 8, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,675 5,622 Updated May 21, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,503 4,383 Updated May 22, 2025
Next
0