8000 chenhuiyu (Huiyu Chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chenhuiyu's full-sized avatar
🍊
Coding
🍊
Coding

Block or report chenhuiyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 933 134 Updated Jun 4, 2025

SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025

Jupyter Notebook 25 1 Updated Mar 1, 2025

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 6,368 1,142 Updated May 22, 2025

Tips and resources to prepare for Behavioral interviews.

6,456 1,225 Updated Apr 20, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 33,615 3,828 Updated Jun 6, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,441 2,506 Updated Jun 6, 2025

A-MEM: Agentic Memory for LLM Agents

Python 389 48 Updated May 18, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,483 555 Updated Feb 12, 2025

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…

Python 59 3 Updated May 15, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,489 383 Updated Jun 8, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,130 1,182 Updated Jun 8, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,873 1,750 Updated Feb 26, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 374 28 Updated Jun 3, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,475 593 Updated Jun 6, 2025

🧭 SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages

Python 13 1 Updated Feb 12, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,800 7,393 Updated Apr 20, 2025

Neural Networks: Zero to Hero

Jupyter Notebook 13,963 1,940 Updated Aug 18, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,850 779 Updated May 15, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,475 1,002 Updated Jun 6, 2025

🚢 Data Toolkit for Sailor Language Models

Python 91 10 Updated Feb 24, 2025

左程云的算法和数据结构通关课

Java 2,141 555 Updated Jun 8, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,209 3,187 Updated Jun 6, 2025

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 950 54 Updated Apr 25, 2025

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 821 63 Updated Dec 3, 2024

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,410 85 Updated Jan 7, 2025

The LLM Evaluation Framework

Python 4B6B 7,144 654 Updated Jun 8, 2025

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

Python 531 63 Updated Jun 26, 2024

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 9,030 1,568 Updated Jun 4, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,913 1,461 Updated May 29, 2025
Next
0