- Singapore
-
00:24
(UTC +08:00) - https://chenhuiyu.github.io/
- https://orcid.org/0009-0005-8768-0460
Lists (6)
Sort Name ascending (A-Z)
Stars
Scalable data pre processing and curation toolkit for LLMs
SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
Tips and resources to prepare for Behavioral interviews.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
verl: Volcano Engine Reinforcement Learning for LLMs
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
🧭 SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Neural Networks: Zero to Hero
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
✨✨Latest Advances on Multimodal Large Language Models
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Evaluate your LLM's response with Prometheus and GPT4 💯
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.