8000 Zhenyu001225 (Zhenyu Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Zhenyu001225's full-sized avatar
  • Rensselaer Polytechnic Institute
  • Troy,NY

Block or report Zhenyu001225

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Python 14 Updated Apr 30, 2025

(ICLR 2023 Spotlight) MPCFormer: fast, performant, and private transformer inference with MPC

Python 96 16 Updated Jun 12, 2023
C++ 18 1 Updated Dec 22, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,669 2,500 Updated Aug 12, 2024

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,455 94 Updated May 30, 2025

Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'

Python 14 1 Updated Jul 21, 2024

A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Survey of Safety on Large Vision-Language Models: Attacks, Defen…

98 6 Updated May 3, 2025
Python 15 2 Updated Jun 13, 2024

[CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks"

Python 19 1 Updated May 25, 2025

Efficient Multimodal Large Language Models: A Survey

350 21 Updated Apr 29, 2025

Accepted by ECCV 2024

Python 130 1 Updated Oct 15, 2024

Accepted by IJCAI-24 Survey Track

Python 205 5 Updated Aug 25, 2024

Research and Materials on Hardware implementation of Transformer Model

Jupyter Notebook 263 35 Updated Feb 28, 2025

The official implementation of the paper "RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction"

Python 3 Updated May 18, 2025

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 136 24 Updated Jul 10, 2024

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 167 21 Updated May 21, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 686 30 Updated Mar 19, 2025

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

852E Cuda 291 31 Updated Nov 22, 2024

Official implementation of "When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture" published at NeurIPS 2022.

Python 33 4 Updated Sep 19, 2024

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,781 107 Updated Apr 3, 2025

GMoE could be the next backbone model for many kinds of generalization task.

Python 271 35 Updated Mar 21, 2023

Machine Learning Engineering Open Book

Python 13,868 835 Updated May 30, 2025

Awesome-Low-Rank-Adaptation

101 11 Updated Oct 13, 2024

A family of efficient edge language models in 100M~1B sizes.

13 Updated Feb 14, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 771 68 Updated Mar 14, 2025

[ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More

Python 44 1 Updated Feb 12, 2025

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

Python 107 4 Updated Oct 21, 2024
Next
0