8000 chi2liu (633WHU) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chi2liu's full-sized avatar
  • PayPal
  • Shanghai, China

Block or report chi2liu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for data-aware compression of DeepSeek models

Python 33 5 Updated Jun 10, 2025

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 13,475 1,004 Updated Jun 11, 2025

Fully open reproduction of DeepSeek-R1

Python 24,755 2,291 Updated Jun 2, 2025

A simple CLI chatbot that demonstrates the integration of the Model Context Protocol (MCP).

Python 196 22 Updated Dec 5, 2024
C# 54 9 Updated Oct 29, 2024

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Jupyter Notebook 631 52 Updated Mar 22, 2025

A pedagogical implementation of Autograd

Jupyter Notebook 984 101 Updated May 26, 2020

Benchmarking the serving capabilities of vLLM

Python 46 11 Updated Aug 20, 2024

FlashMLA: Efficient MLA decoding kernels

Cuda 11,594 839 Updated Apr 29, 2025

Serving CrewAI Agent as REST API with BentoML, optionally with self-host open-source LLMs

Python 17 1 Updated Dec 23, 2024

[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models

Python 49 5 Updated Sep 4, 2024
Python 275 38 Updated Jun 11, 2025
Python 54 8 Updated Nov 18, 2024
Jupyter Notebook 33 13 Updated Jul 31, 2024

A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current issues and future directions.

221 13 Updated Dec 30, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 25,772 2,641 Updated Jun 10, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 45,193 4,482 Updated Jun 11, 2025

LLM101n: Let's build a Storyteller

33,599 1,829 Updated Aug 1, 2024

LLM Inference benchmark

Python 419 39 Updated Jul 23, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,495 556 Updated Jun 11, 2025

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper

TypeScript 4,902 764 Updated Sep 28, 2024

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 33,161 2,537 Updated Jun 11, 2025

Convert Pydantic from V1 to V2 ♻

Python 338 27 Updated Jul 25, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,985 556 Updated Apr 11, 2025

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,560 223 Updated Apr 10, 2025

Material for gpu-mode lectures

Jupyter Notebook 4,580 460 Updated Feb 9, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,765 2,506 Updated Aug 12, 2024

📚A curated list of Awesome LLM Inference Papers with Codes.

Python 4,107 283 Updated Jun 9, 2025

Python library & examples for Masked Language Model Scoring (ACL 2020)

Python 342 60 Updated Dec 20, 2022
Next
0