8000 pringwong (Pring Wong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View pringwong's full-sized avatar
  • Beijing

Block or report pringwong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A platform that lets you build agents to learn to play StarCraft: Brood War.

C++ 652 124 Updated Aug 31, 2021

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 49,096 8,198 Updated May 13, 2025

Optimizing inference proxy for LLMs

Python 2,226 174 Updated May 13, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,726 374 Updated May 13, 2025

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,555 322 Updated Jul 18, 2024

Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.

Python 29,737 2,868 Updated May 14, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,477 8,344 Updated May 6, 2025

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索

Python 484 48 Updated Sep 4, 2024

[CVPR 2024] Code release for TransNeXt model

Python 523 23 Updated Jun 13, 2024

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 386 10 Updated Jul 9, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 140,483 11,754 Updated May 14, 2025

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,753 834 Updated May 29, 2022

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,764 348 Updated Mar 23, 2025

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 75 10 Updated Jul 28, 2023

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 26,844 3,387 Updated Apr 3, 2025

[NeurIPS 2021] You Only Look at One Sequence

Jupyter Notebook 865 121 Updated May 4, 2022

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,251 245 Updated May 14, 2025

LLM Analytics

TypeScript 659 28 Updated Oct 19, 2024

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,446 1,315 Updated May 14, 2025

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 55,942 5,514 Updated Apr 25, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,090 185 Updated Nov 7, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,314 514 Updated Jan 16, 2025

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 175 10 Updated May 14, 2025

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,464 119 Updated Jun 13, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,094 340 Updated Jan 13, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 898 128 Updated Apr 24, 2025

CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.

Python 110 11 Updated Sep 11, 2024

WarAgent: LLM-based Multi-Agent Simulation of World Wars

Python 266 35 Updated Mar 5, 2024

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 415 17 Updated Oct 11, 2023

Freeciv-web is an Open Source strategy game implemented in HTML5 and WebGL, which can be played online against other players, or in single player mode against AI opponents.

JavaScript 2,054 344 Updated May 2, 2025
0