8000 kyutai · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 8.6k 748

  2. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 1.9k 163

  3. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.2k 89

  4. unmute unmute Public

    Make text LLMs listen and speak

    Python 562 84

  5. moshi-finetune moshi-finetune Public

    Python 260 20

  6. moshivis moshivis Public

    Kyutai with an "eye"

    Python 207 26

Repositories

Showing 10 of 15 repositories
  • moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    kyutai-labs/moshi’s past year of commit activity
    Python 8,638 Apache-2.0 748 49 11 Updated Jul 11, 2025
  • unmute Public

    Make text LLMs listen and speak

    kyutai-labs/unmute’s past year of commit activity
    Python 562 MIT 84 26 (3 issues need help) 3 Updated Jul 8, 2025
  • delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    kyutai-labs/delayed-streams-modeling’s past year of commit activity
    Python 1,878 Apache-2.0 163 15 1 Updated Jul 8, 2025
  • kyutai-labs/moshi-finetune’s past year of commit activity
    Python 260 Apache-2.0 20 5 0 Updated Jul 7, 2025
  • moshi-swift Public
    kyutai-labs/moshi-swift’s past year of commit activity
    Swift 104 MIT 8 1 0 Updated Jun 26, 2025
  • yomikomi Public

    A small rust-based data loader

    kyutai-labs/yomikomi’s past year of commit activity
    Rust 30 Apache-2.0 0 0 0 Updated Jun 9, 2025
  • dactory Public
    kyutai-labs/dactory’s past year of commit activity
    Python 41 Apache-2.0 3 0 0 Updated Apr 30, 2025
  • sphn Public

    python bindings for symphonia/opus - read various audio formats from python and write opus files

    kyutai-labs/sphn’s past year of commit activity
    Rust 64 Apache-2.0 6 1 0 Updated Apr 28, 2025
  • hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- Hibiki adapts its flow to accumulate just enough context to produce a correct translation in real-time, chunk by chunk.

    kyutai-labs/hibiki’s past year of commit activity
    Rust 1,222 Apache-2.0 89 8 1 Updated Apr 15, 2025
  • moshivis Public

    Kyutai with an "eye"

    kyutai-labs/moshivis’s past year of commit activity
    Python 207 Apache-2.0 26 0 0 Updated Mar 26, 2025

Most used topics

Loading…

0