8000 ce-lery (ce-lery) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ce-lery's full-sized avatar

Block or report ce-lery

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lightning fast C++/CUDA neural network framework

C++ 4,068 502 Updated Apr 29, 2025

Code release for DynamicTanh (DyT)

Python 951 78 Updated Mar 30, 2025

AivisSpeech: AI Voice Imitation System - Text to Speech Software

TypeScript 369 13 Updated Jun 3, 2025

This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.

Jupyter Notebook 130 33 Updated Jun 12, 2025

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

C++ 724 255 Updated Jun 15, 2025
Python 713 72 Updated Jun 15, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,444 617 Updated Jun 11, 2025

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Jupyter Notebook 632 53 Updated Mar 22, 2025

Parlant is the open-source conversation modeling engine for building better, deliberate Agentic UX. It gives you the power of LLMs without the unpredictability.

Python 3,130 317 Updated Jun 15, 2025

This project integrates a custom CUDA-based matrix multiplication kernel into a PyTorch deep learning model, leveraging GPU acceleration for matrix operations. The goal is to compare the performanc…

Python 5 1 Updated Aug 26, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 90,813 24,438 Updated Jun 15, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 23,674 5,679 Updated Aug 14, 2024

An extension library of WMMA API (Tensor Core API)

Cuda 99 15 Updated Jul 12, 2024

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

C++ 577 121 Updated Jun 12, 2025

CUDA Library Samples

Cuda 1,977 394 Updated Jun 10, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,606 2,055 Updated May 22, 2025

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 208 14 Updated Jun 15, 2025

The homepage of OneBit model quantization framework.

Python 181 4 Updated Feb 5, 2025

Witness the aha moment of VLM with less than $3.

Python 3,758 288 Updated May 19, 2025

Library for fast text representation and classification.

HTML 26,254 4,773 Updated Mar 22, 2024

Fully open reproduction of DeepSeek-R1

Python 24,796 2,294 Updated Jun 2, 2025

EvaByte: Efficient Byte-level Language Models at Scale

Python 101 6 Updated Apr 22, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,379 124 Updated Jun 3, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,608 94 Updated Mar 18, 2025

Low-bit LLM inference on CPU with lookup table

C++ 802 65 Updated Jun 5, 2025

A collection of advanced CSS styles to create realistic-looking effects for the faces of Pokemon cards.

CSS 6,243 580 Updated Apr 25, 2025

Explore training for quantized models

Python 18 2 Updated Jun 15, 2025
Next
0