zhiyuan8

Zack Li zhiyuan8

Building @NexaAI , Previously at @google and Amazon Lab126. Passionate AI developer, committed to lifelong learning.

57 followers · 21 following

Nexa AI Inc

Achievements

x4 x3

Achievements

x4 x3

Lists (1)

Sort

🚀 My stack

1 repository

Stars

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,925 81 Updated May 21, 2025

onyx-dot-app / onyx

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

Python 12,918 1,690 Updated May 29, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,864 2,081 Updated Jan 24, 2025

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 73,435 8,026 Updated May 27, 2025

Unstructured-IO / unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 11,332 943 Updated May 26, 2025

huggingface / agents-course

This repository contains the Hugging Face Agents Course.

MDX 19,056 1,267 Updated May 28, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,278 61 Updated Feb 8, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,498 1,407 Updated May 27, 2025

Chenglin-Yang / 1.58bit.flux

282 2 Updated Dec 31, 2024

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,418 174 Updated Jul 12, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 17,567 1,701 Updated May 22, 2025

mit-han-lab / deepcompressor

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 481 36 Updated Mar 27, 2025

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 2,448 148 Updated Mar 31, 2025

NexaAI / nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 4,556 635 Updated Mar 6, 2025

shenzheyu / llama.cpp

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++ 1 Updated Mar 27, 2025

meta-llama / llama-stack

Composable building blocks to build Llama Apps

Python 7,818 1,044 Updated May 28, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,966 1,311 Updated May 25, 2025

UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices

C++ 887 105 Updated May 27, 2025

Davidqian123 / AI-Soulmate

An interactive AI character with voice input, voice output, and profile image generation—all running locally with Nexa SDK and powered by Llama3 Uncensored Model. Enjoy a private and immersive expe…

Python 10 1 Updated Oct 7, 2024