8000 ce-lery (ce-lery) / Starred · GitHub

More Web Proxy on the site http://driver.im/

ce-lery

Follow

ce-lery ce-lery

Follow

4 followers · 5 following

Achievements

Achievements

Stars

NVlabs / tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

C++ 4,068 502 Updated Apr 29, 2025

jiachenzhu / DyT

Code release for DynamicTanh (DyT)

Python 951 78 Updated Mar 30, 2025

Aivis-Project / AivisSpeech

AivisSpeech: AI Voice Imitation System - Text to Speech Software

TypeScript 369 13 Updated Jun 3, 2025

NVIDIA / cuda-q-academic

This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.

Jupyter Notebook 130 33 Updated Jun 12, 2025

NVIDIA / cuda-quantum

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

C++ 724 255 Updated Jun 15, 2025

hova88 / CUDA-MatMul-Practice

Cuda 16 2 Updated Jan 4, 2024

kohya-ss / musubi-tuner

Python 713 72 Updated Jun 15, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,444 617 Updated Jun 11, 2025

dCaples / AutoDidact

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Jupyter Notebook 632 53 Updated Mar 22, 2025

emcie-co / parlant

Parlant is the open-source conversation modeling engine for building better, deliberate Agentic UX. It gives you the power of LLMs without the unpredictability.

Python 3,130 317 Updated Jun 15, 2025

eduand-alvarez / CUDA_Custom_MatMul_Experiment

This project integrates a custom CUDA-based matrix multiplication kernel into a PyTorch deep learning model, leveraging GPU acceleration for matrix operations. The goal is to compare the performanc…

Python 5 1 Updated Aug 26, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 90,813 24,438 Updated Jun 15, 2025

r-barnes / pytorch_cmake_example

Cuda 32 4 Updated Jul 15, 2021

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 23,674 5,679 Updated Aug 14, 2024

wmmae / wmma_extension

An extension library of WMMA API (Tensor Core API)

Cuda 99 15 Updated Jul 12, 2024

NVIDIA / cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

C++ 577 121 Updated Jun 12, 2025

NVIDIA / CUDALibrarySamples

CUDA Library Samples

Cuda 1,977 394 Updated Jun 10, 2025

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,606 2,055 Updated May 22, 2025

NVlabs / COAT

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 208 14 Updated Jun 15, 2025

xuyuzhuang11 / OneBit

The homepage of OneBit model quantization framework.

Python 181 4 Updated Feb 5, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,758 288 Updated May 19, 2025

facebookresearch / fastText

Library for fast text representation and classification.

HTML 26,254 4,773 Updated Mar 22, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,796 2,294 Updated Jun 2, 2025

OpenEvaByte / evabyte

EvaByte: Efficient Byte-level Language Models at Scale

Python 101 6 Updated Apr 22, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,379 124 Updated Jun 3, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,608 94 Updated Mar 18, 2025

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 802 65 Updated Jun 5, 2025

simeydotme / pokemon-cards-css

A collection of advanced CSS styles to create realistic-looking effects for the faces of Pokemon cards.

CSS 6,243 580 Updated Apr 25, 2025

chenzhiling9954 / Critical-Tokens-Matter

Python 34 Updated May 25, 2025

gau-nernst / quantized-training

Explore training for quantized models

Python 18 2 Updated Jun 15, 2025

0