Stars
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`
A security scanner for your LLM agentic workflows
The AI-native proxy server for agents. Arch handles the pesky low-level work in building agentic apps like calling specific tools, routing prompts to the right agents, clarifying vague inputs, unif…
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
An Extension for Forge Webui that implements Attention Couple
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!
A multi-page application to visualize and predict Covid numbers
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
This repo collects the research resources based on SAM(Segment Anything Model) proposed by Meta AI. If you would like to contribute, please open an issue.
A curated list of awesome resources for dichotomous image segmentation (DIS).
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
This is the official pytorch implementation of DIS-SAM.
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
Get your documents ready for gen AI
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
✨✨Latest Advances on Multimodal Large Language Models
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
FlashInfer: Kernel Library for LLM Serving
SGLang is a fast serving framework for large language models and vision language models.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.