8000 GitHub - wodaoaicc/open-thoughts: Open Thoughts: Fully Open Data Curation for Thinking Models
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Open Thoughts: Fully Open Data Curation for Thinking Models

License

Notifications You must be signed in to change notification settings

wodaoaicc/open-thoughts

 
 

Repository files navigation

Open Thoughts GitHub Repository

Static Badge Hugging Face
Curating the best open reasoning datasets
A collaboration led by Bespoke Labs and the DataComp community


Our first goal is to curate a reasoning dataset to train state-of-the-art small reasoning models that surpass DeepSeek-R1-Distill-Qwen-32B and DeepSeek-R1-Distill-Qwen-7B on math and code reasoning benchmarks.

News

About

Open Thoughts: Fully Open Data Curation for Thinking Models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.3%
  • Makefile 0.7%
0