8000 shyyhs (Song) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View shyyhs's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@NLPforCOVID-19

Block or report shyyhs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dataset of birdsong. 10 second samples stored as data vectors in csv's.

Jupyter Notebook 5 2 Updated Sep 27, 2020

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,101 7,356 Updated May 12, 2025

A machine translation evaluation metrics that calculates F score for byte-level n-gram overlap.

1 Updated Apr 8, 2025

POT : Python Optimal Transport

Python 2,542 518 Updated Apr 26, 2025

Gromov-Wasserstein Alignment of Embeddings

Python 66 14 Updated Sep 23, 2021

Releases from OpenAI Preparedness

Python 732 63 Updated Apr 11, 2025

Whisperのデコーダをllm-jp-1.3b-v1.0に置き換えた音声認識モデルを学習させるためのコード

Python 8 Updated Sep 7, 2024

Train transformer language models with reinforcement learning.

Python 13,677 1,871 Updated May 9, 2025

CycleResearcher: Improving Automated Research via Automated Review

Jupyter Notebook 160 10 Updated May 9, 2025

🙌 OpenHands: Code Less, Make More

Python 54,075 6,105 Updated May 12, 2025

Repository for the EMNLP 2025 conference

HTML 2 3 Updated May 10, 2025

Code Repository for the tutorial "Connecting Ideas in Lower-Resource Scenarios: NLP for National Varieties, Creoles, and Other Low-resource Languages @ COLING 2025

Jupyter Notebook 6 Updated Jan 20, 2025

EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports e…

8 Updated Apr 11, 2025

A High-Quality Multilingual Dataset for Structured Documentation Translation

Python 36 7 Updated May 1, 2025

Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric Evaluation of Machine Translation with a Densely Annotated P…

Python 78 11 Updated Sep 21, 2023

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,475 529 Updated May 3, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 24 4 Updated Mar 31, 2025

A library for minimum Bayes risk (MBR) decoding

Python 37 7 Updated Apr 10, 2025

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Python 1,216 60 Updated Apr 10, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 14,449 1,043 Updated Mar 17, 2025
C++ 830 119 Updated May 24, 2023

Compact Language Detector 2

C++ 862 131 Updated May 22, 2021
Python 15 2 Updated Mar 19, 2025

Streamlit — A faster way to build and share data apps.

Python 39,305 3,443 Updated May 12, 2025

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook 545 29 Updated Jul 26, 2024

Go ahead and axolotl questions

Python 9,318 1,011 Updated May 12, 2025

Translation models for 22 scheduled languages of India

Python 313 84 Updated May 2, 2025
Python 11 Updated Apr 2, 2024
Jupyter Notebook 9,524 671 Updated Apr 23, 2025
Next
0