8000 minmie (arvinChen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View minmie's full-sized avatar

Block or report minmie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 7,390 756 Updated May 22, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,535 1,424 Updated May 22, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,213 497 Updated Aug 6, 2024

FireAct: Toward Language Agent Fine-tuning

Python 278 20 Updated Oct 22, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,159 767 Updated Oct 16, 2024

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,229 3,074 Updated May 22, 2025

A course on aligning smol models.

Jupyter Notebook 5,851 2,069 Updated Jan 24, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 98,747 14,837 Updated May 23, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 44,391 4,357 Updated May 22, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,867 274 Updated Apr 13, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,905 1,615 Updated May 23, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,579 551 Updated Apr 19, 2025

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,921 166 Updated Sep 23, 2024

This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord

Jupyter Notebook 635 194 Updated May 15, 2025

Ultralytics YOLO11 🚀

Python 41,164 7,961 Updated May 23, 2025

deep learning for image processing including classification and object-detection etc.

Python 24,848 8,182 Updated Jan 12, 2025

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,164 191 Updated Mar 25, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,049 1,147 Updated Jul 13, 2024

State-of-the-Art Text Embeddings

Python 16,753 2,597 Updated May 23, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡ 8974 ️🍸 🍹 🍷

Python 4,448 240 Updated May 23, 2025

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,695 1,600 Updated Jan 13, 2025

Inference code for Llama models

Python 58,265 9,774 Updated Jan 26, 2025

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,159 570 Updated Sep 23, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

20,113 1,938 Updated May 19, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,679 1,837 Updated May 15, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,163 6,304 Updated May 23, 2025

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 20,558 2,276 Updated Mar 2, 2025

DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Python 339 24 Updated Jun 11, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 9,197 1,019 Updated May 22, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 35,997 8,674 Updated Oct 11, 2024
Next
0