8000 atyenoria (Akinori Nakajima) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View atyenoria's full-sized avatar
🌴
On vacation
🌴
On vacation
  • VoicePing
  • Tokyo, Japan

Highlights

  • Pro

Block or report atyenoria

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The python library for real-time communication

JavaScript 3,943 345 Updated May 23, 2025

Interface for OuteTTS models.

Python 1,278 106 Updated May 23, 2025

Towards Human-Sounding Speech

Python 4,828 389 Updated May 6, 2025

unsloth-5090-multiple

Python 5 1 Updated May 21, 2025

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 714 36 Updated Dec 19, 2024

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 245 36 Updated Feb 28, 2025

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 410 20 Updated Oct 16, 2024

A flexible distributed key-value database that is optimized for caching and other realtime workloads.

C 21,535 819 Updated May 23, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 880 46 Updated Mar 20, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,662 231 Updated May 19, 2025

Official inference framework for 1-bit LLMs

Python 19,809 1,477 Updated May 23, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,221 208 Updated May 24, 2025

A SAML library not dependent on any frameworks that runs in Node.

TypeScript 111 68 Updated Apr 21, 2025

Node.js library for SAML SSO

TypeScript 628 230 Updated May 6, 2025

The Desktop AgentOS.

Python 7,285 895 Updated May 13, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 25,303 1,619 Updated May 22, 2025
Rust 465 35 Updated Apr 11, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,487 121 Updated Apr 17, 2024

Update ASR paper everyday

Python 220 12 Updated May 24, 2025

A hyperparameter optimization framework

Python 12,004 1,101 Updated May 24, 2025

Open-source and reproducible benchmarks for Speaker Diarization

Jupyter Notebook 24 Updated Apr 17, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,309 1,298 Updated May 21, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,362 52 Updated Apr 18, 2025

Scalable and memory-optimized training of diffusion models

Python 1,150 127 Updated May 13, 2025

Lets make video diffusion practical!

Python 13,602 1,170 Updated May 4, 2025

Fast implementation of the edit distance(Levenshtein distance)

C++ 685 64 Updated Feb 16, 2024

Tools for handling multimodal data in machine learning projects.

Python 1,024 233 Updated May 22, 2025

Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package

C++ 328 64 Updated Apr 3, 2025

A post-modern modal text editor.

Rust 37,690 2,830 Updated May 24, 2025
Next
0