8000 NOrangeeroli / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View NOrangeeroli's full-sized avatar

Block or report NOrangeeroli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Understanding R1-Zero-Like Training: A Critical Perspective

Python 912 42 Updated Apr 15, 2025

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Python 1,196 60 Updated Apr 10, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,329 154 Updated Mar 20, 2025

minimal-cost for training 0.5B R1-Zero

Python 716 89 Updated Apr 25, 2025

Simple RL training for reasoning

Python 3,535 265 Updated Apr 10, 2025

Fully open reproduction of DeepSeek-R1

Python 24,326 2,235 Updated May 8, 2025

Official Repo for Open-Reasoner-Zero

Python 1,911 98 Updated Apr 8, 2025

Learning Formal Mathematics from Intrinsic Motivation

Rust 27 15 Updated Mar 12, 2025

An environment for learning formal mathematical reasoning from scratch

Python 66 7 Updated Aug 18, 2024
Python 65 14 Updated Feb 17, 2022

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 1 Updated Feb 4, 2025

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 5,095 400 Updated Feb 4, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 16,040 1,653 Updated Apr 12, 2025

Fully local web research and report writing assistant

Python 7,312 718 Updated Mar 24, 2025

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Python 531 69 Updated Aug 30, 2024
0