atyenoria

🌴

On vacation

Akinori Nakajima atyenoria

🌴

On vacation

AI Translation Platform for Global Team

283 followers · 922 following

VoicePing
Tokyo, Japan

Achievements

x3 x3

Achievements

x3 x3

Highlights

Lists (1)

Sort

🚀 My stack

1 repository

Stars

gradio-app / fastrtc

The python library for real-time communication

JavaScript 3,943 345 Updated May 23, 2025

edwko / OuteTTS

Interface for OuteTTS models.

Python 1,278 106 Updated May 23, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 4,828 389 Updated May 6, 2025

thad0ctor / unsloth-5090-multiple

unsloth-5090-multiple

Python 5 1 Updated May 21, 2025

nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 714 36 Updated Dec 19, 2024

TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 245 36 Updated Feb 28, 2025

HKUNLP / ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 410 20 Updated Oct 16, 2024

valkey-io / valkey

A flexible distributed key-value database that is optimized for caching and other realtime workloads.

C 21,535 819 Updated May 23, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 880 46 Updated Mar 20, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,662 231 Updated May 19, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 19,809 1,477 Updated May 23, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,221 208 Updated May 24, 2025

node-saml / node-saml

A SAML library not dependent on any frameworks that runs in Node.

TypeScript 111 68 Updated Apr 21, 2025

tngan / samlify

Node.js library for SAML SSO

TypeScript 628 230 Updated May 6, 2025

microsoft / UFO

The Desktop AgentOS.

Python 7,285 895 Updated May 13, 2025

deepseek-ai / DeepSeek-Prover-V2

1,103 76 Updated Apr 30, 2025

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 25,303 1,619 Updated May 22, 2025

huggingface / hf_transfer

Rust 465 35 Updated Apr 11, 2025

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,487 121 Updated Apr 17, 2024

halsay / ASR-TTS-paper-daily

Update ASR paper everyday

Python 220 12 Updated May 24, 2025

optuna / optuna

A hyperparameter optimization framework

Python 12,004 1,101 Updated May 24, 2025

argmaxinc / SDBench

Open-source and reproducible benchmarks for Speaker Diarization

Jupyter Notebook 24 Updated Apr 17, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,309 1,298 Updated May 21, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,362 52 Updated Apr 18, 2025

a-r-r-o-w / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,150 127 Updated May 13, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 13,602 1,170 Updated May 4, 2025

roy-ht / editdistance

Fast implementation of the edit distance(Levenshtein distance)

C++ 685 64 Updated Feb 16, 2024

lhotse-speech / lhotse

Tools for handling multimodal data in machine learning projects.

Python 1,024 233 Updated May 22, 2025

mhahsler / dbscan

Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package

C++ 328 64 Updated Apr 3, 2025

helix-editor / helix

A post-modern modal text editor.

Rust 37,690 2,830 Updated May 24, 2025