8000 saisurbehera / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View saisurbehera's full-sized avatar
🎯
🎯

Block or report saisurbehera

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

prime-rl is a codebase for decentralized async RL training at scale

Python 336 47 Updated Jun 16, 2025

A non-saturating, open-ended environment for evaluating LLMs in Factorio

HTML 731 42 Updated Jun 14, 2025

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 2,662 483 Updated Dec 24, 2024

LuxMineRL game

Jupyter Notebook 1 Updated Jan 19, 2025

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python 1,714 318 Updated Jul 30, 2024

Autonomous agents for everyone

TypeScript 16,081 5,238 Updated Jun 16, 2025

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,571 1,933 Updated Oct 17, 2024

code for training & evaluating Contextual Document Embedding models

Python 194 11 Updated May 14, 2025

Training Sparse Autoencoders on Language Models

Python 827 176 Updated Jun 15, 2025

The Open Source AI-Powered Code Editor. A fork of VSCode and Continue and PearAI

TypeScript 427 44 Updated Sep 29, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 637 48 Updated Jan 20, 2025

A massively parallel, optimal functional runtime in Rust

Cuda 11,036 423 Updated Nov 21, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,162 101 Updated May 8, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 315 32 Updated Aug 6, 2024

Reverse Engineering the Abstraction and Reasoning Corpus

Jupyter Notebook 279 45 Updated Feb 24, 2025

Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model

Jupyter Notebook 1 Updated Aug 23, 2022

Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge

C++ 157 27 Updated Jun 8, 2020

Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA

Jupyter Notebook 1,879 356 Updated May 15, 2025

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 154 10 Updated Jun 13, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 23,674 5,679 Updated Aug 14, 2024

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Python 423 32 Updated May 5, 2025

FinOps and cloud cost optimization tool. Supports AWS, Azure, GCP, Alibaba Cloud and Kubernetes.

Python 1,503 224 Updated Jun 13, 2025

✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks

Jupyter Notebook 16 Updated Aug 16, 2024

Simple-to-use scoring function for arbitrarily tokenized texts.

Python 41 5 Updated Feb 19, 2025

An all-in-one Blender assistant powered by GPT3/4 + Whisper integration

Python 126 19 Updated Sep 9, 2024
Jupyter Notebook 883 160 Updated Feb 5, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 176,162 45,809 Updated Jun 15, 2025
Next
0