AFeng-x

🎯

Focusing

AFeng AFeng-x

🎯

Focusing

Go ahead !

51 followers · 106 following

Hongkong | Shenzhen

Achievements

Stars

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 1,172 38 Updated May 30, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

53,647 16,430 Updated May 21, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 3,322 217 Updated May 28, 2025

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 2,453 149 Updated Mar 31, 2025

Pointcept / PointTransformerV3

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 1,161 66 Updated Apr 24, 2025

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 27,492 2,858 Updated May 30, 2025

thu-ml / RoboticsDiffusionTransformer

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,216 125 Updated Apr 4, 2025

JiuhaiChen / BLIP3o

Python 1,064 34 Updated May 30, 2025

xming521 / WeClone

🚀 One-stop solution for creating your digital avatar from chat logs 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天…

Python 12,535 935 Updated May 30, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,114 41 Updated May 21, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 12,227 1,268 Updated May 29, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,702 235 Updated May 29, 2025

modelscope / MCPBench

The evaluation benchmark on MCP servers

Python 114 3 Updated May 21, 2025

kortix-ai / suna

Suna - Open Source Generalist AI Agent

TypeScript 13,515 1,941 Updated May 30, 2025

ZJU-REAL / Awesome-GUI-Agents

A curated collection of resources, tools, and frameworks for developing GUI Agents.

48 2 Updated May 30, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,188 178 Updated May 30, 2025

liaokongVFX / MCP-Chinese-Getting-Started-Guide

Model Context Protocol(MCP) 编程极速入门

1,963 114 Updated Apr 23, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,168 58 Updated May 28, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 13,855 1,202 Updated May 4, 2025

jamez-bondos / awesome-gpt4o-images

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 6,188 554 Updated May 26, 2025

google-a2a / A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 16,261 1,573 Updated May 30, 2025

ByteDance-Seed / Seed-Thinking-v1.5

773 14 Updated Apr 20, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

871 40 Updated Apr 20, 2025

DCDmllm / AnyEdit

【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"

Jupyter Notebook 128 5 Updated Apr 5, 2025

TencentQQGYLab / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,847 646 Updated Mar 19, 2025

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

611 16 Updated May 20, 2025

microsoft / ai-agents-for-beginners

11 Lessons to Get Started Building AI Agents

Jupyter Notebook 22,515 6,034 Updated May 26, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,047 236 Updated May 28, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,855 666 Updated May 30, 2025

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,326 1,183 Updated May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AFeng AFeng-x

Achievements

Achievements

Block or report AFeng-x

Stars

Paper2Poster / Paper2Poster

x1xhlol / system-prompts-and-models-of-ai-tools

ByteDance-Seed / Bagel

huggingface / smollm

Pointcept / PointTransformerV3

openai / codex

thu-ml / RoboticsDiffusionTransformer

JiuhaiChen / BLIP3o

xming521 / WeClone

ByteDance-Seed / Seed1.5-VL

bytedance / deer-flow

MoonshotAI / Kimi-Audio

modelscope / MCPBench

kortix-ai / suna

ZJU-REAL / Awesome-GUI-Agents

SandAI-org / MAGI-1

liaokongVFX / MCP-Chinese-Getting-Started-Guide

facebookresearch / perception_models

lllyasviel / FramePack

jamez-bondos / awesome-gpt4o-images

google-a2a / A2A

ByteDance-Seed / Seed-Thinking-v1.5

MoonshotAI / Kimi-VL

DCDmllm / AnyEdit

TencentQQGYLab / AppAgent

yaotingwangofficial / Awesome-MCoT

microsoft / ai-agents-for-beginners

QwenLM / Qwen2.5-Omni

modelscope / ms-swift

bytedance / UI-TARS-desktop