[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yyhyyh17's full-sized avatar

Block or report yyhyyh17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

row-major matmul optimization

C++ 596 80 Updated Sep 9, 2023

NanoGPT (124M) in 5 minutes

Python 1,645 151 Updated Dec 5, 2024

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,770 543 Updated Oct 24, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 5,949 669 Updated Dec 2, 2024

A tutorial for CUDA&PyTorch

C++ 118 25 Updated Oct 28, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,789 260 Updated Oct 5, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,770 991 Updated Dec 3, 2024

Helpful tools and examples for working with flex-attention

Python 495 25 Updated Dec 3, 2024

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 594 21 Updated Jun 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,316 4,754 Updated Dec 5, 2024

The official Meta Llama 3 GitHub site

Python 27,367 3,106 Updated Aug 12, 2024
Jupyter Notebook 23 1 Updated Sep 9, 2024

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 81 7 Updated Aug 14, 2024

Ring attention implementation with flash attention

Python 596 49 Updated Nov 8, 2024

NumPy aware dynamic Python compiler using LLVM

Python 10,023 1,131 Updated Dec 4, 2024

LLM101n: Let's build a Storyteller

30,390 1,658 Updated Aug 1, 2024

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,692 108 Updated Sep 18, 2024

LLM training in simple, raw C/CUDA

Cuda 24,633 2,792 Updated Oct 2, 2024

本实验的主要目的是基于遥感图像计算灰度共生矩阵,并基于该矩阵计算多种纹理特征。所有的计算结果已与ENVI结果进行对比,实验结果一致。

Python 130 29 Updated Jun 8, 2020

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 512 14 Updated Dec 1, 2024

how to optimize some algorithm in cuda.

Cuda 1,688 137 Updated Dec 4, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,618 10,186 Updated Jun 18, 2024

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Python 895 61 Updated Jun 28, 2024

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

864 34 Updated Apr 28, 2024

PyQt5 implementation of YOLOv5 GUI

Python 1,339 224 Updated Sep 12, 2024

Add gui for YoloV5 using PyQt5

Python 147 23 Updated Aug 17, 2021

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,762 459 Updated Nov 5, 2024

图像质量评估

Python 14 4 Updated Nov 14, 2019

Code release for CVPR 2024 paper LEOD: Label-Efficient Object Detection for Event Cameras

Python 32 4 Updated Mar 11, 2024

A Pytorch implementation of StyleGAN2

Python 165 36 Updated Nov 30, 2020
Next