8000 dongdong1203 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dongdong1203's full-sized avatar

Block or report dongdong1203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

s1: Simple test-time scaling

Python 6,389 745 Updated May 19, 2025

NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing

Jupyter Notebook 81 23 Updated Jun 19, 2024

Development repository for the Triton language and compiler

MLIR 15,650 1,991 Updated May 23, 2025

This is simple code of SpikedAttention (Neurips 2024)

Python 20 Updated Mar 30, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,987 512 Updated Apr 29, 2025

Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.

Batchfile 135,809 13,200 Updated May 23, 2025

An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).

C++ 79 13 Updated Jul 26, 2024

A open source reimplementation of Google's Tensor Processing Unit (TPU).

Python 431 73 Updated Dec 6, 2017

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

C++ 120 14 Updated May 10, 2024

ABC: System for Sequential Logic Synthesis and Formal Verification

C 991 628 Updated May 23, 2025

A curated list of awesome hardware/chip design resources for deep learning

49 6 Updated May 1, 2018

A curated list of Computer Architecture and Systems resources

510 56 Updated Mar 5, 2025
Python 232 31 Updated Nov 9, 2022

Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.

Verilog 71 17 Updated Apr 30, 2019

A general framework for optimizing DNN dataflow on systolic array

Python 35 10 Updated Jan 2, 2021

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,686 536 Updated May 23, 2025

A co-design architecture on sparse attention

Python 52 4 Updated Aug 23, 2021

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

C++ 115 25 Updated Feb 10, 2025

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 370 155 Updated Aug 3, 2024

SystemC training aimed at TLM.

C++ 29 9 Updated Jul 31, 2020

Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions

Scala 191 32 Updated Jun 25, 2020

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

Python 1,611 270 Updated May 10, 2025

A massively parallel, high-level programming language

Rust 18,736 462 Updated Feb 23, 2025

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…

2,934 273 Updated Feb 24, 2025

A Collection Of The State-of-the-art Metaheuristic Algorithms In Python (Metaheuristic/Optimizer/Nature-inspired/Biology)

Python 1,019 206 Updated Sep 3, 2024

A flexible cross-platform IIR and FIR engine for crossovers, room correction etc.

Rust 667 57 Updated Apr 27, 2025

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 335 83 Updated May 7, 2025

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Scala 88 10 Updated Aug 27, 2024

Deep Learning library for Lava

Jupyter Notebook 159 75 Updated May 20, 2025

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,405 639 Updated May 21, 2025
Next
0