8000 drfarasat (Farasat Munir) / Starred · GitHub

More Web Proxy on the site http://driver.im/

drfarasat

Follow

Farasat Munir drfarasat

Follow

0 followers · 6 following

Dr

Stars

song-wensong / insert-anything

Python 429 15 Updated Jun 3, 2025

MengyuWang826 / SegRefiner

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

Python 189 13 Upda 10000 ted Jan 21, 2024

Deci-AI / data-gradients

Computer Vision dataset analysis

Python 301 36 Updated Aug 6, 2024

TencentARC / BlobCtrl

[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Python 90 2 Updated Mar 20, 2025

chenhang98 / BPR

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Python 185 21 Updated Sep 11, 2023

splx-ai / agentic-radar

A security scanner for your LLM agentic workflows

Python 588 62 Updated Jun 9, 2025

katanemo / archgw

The AI-native proxy server for agents. Arch handles the pesky low-level work in building agentic apps like calling specific tools, routing prompts to the right agents, clarifying vague inputs, unif…

Rust 2,722 152 Updated Jun 11, 2025

patchy631 / ai-engineering-hub

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 9,751 1,696 Updated Jun 13, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,903 1,376 Updated Jun 13, 2025

rbrown101010 / yapsearch

TypeScript 335 190 Updated Jan 23, 2025

Haoming02 / sd-forge-couple

An Extension for Forge Webui that implements Attention Couple

Python 360 18 Updated Apr 11, 2025

stackblitz-labs / bolt.diy

Forked from stackblitz/bolt.new

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

TypeScript 16,260 8,901 Updated Jun 12, 2025

Avaiga / demo-covid-dashboard

A multi-page application to visualize and predict Covid numbers

Python 22 10 Updated May 19, 2025

finegrain-ai / refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

Python 828 62 Updated May 15, 2025

ZrrSkywalker / Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,585 107 Updated Jul 22, 2024

baibizhe / Awesome-SAM

This repo collects the research resources based on SAM(Segment Anything Model) proposed by Meta AI. If you would like to contribute, please open an issue.

45 3 Updated Aug 22, 2023

Tennine2077 / Awesome-Dichotomous-Image-Segmentation

A curated list of awesome resources for dichotomous image segmentation (DIS).

28 Updated May 25, 2025

ZhengPeng7 / BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 2,345 170 Updated Jun 11, 2025

Tennine2077 / DIS-SAM

This is the official pytorch implementation of DIS-SAM.

Jupyter Notebook 13 4 Updated Apr 16, 2025

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,571 208 Updated Jun 9, 2025

DS4SD / quackling

Build document-native LLM applications

Python 53 2 Updated Sep 11, 2024

docling-project / docling

Get your documents ready for gen AI

Python 31,802 2,034 Updated Jun 13, 2025

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,617 429 Updated Aug 19, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,528 1,008 Updated Jun 13, 2025

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,305 58 Updated Mar 14, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,171 327 Updated Jun 13, 2025

punica-ai / punica

Serving multiple LoRA finetuned LLM as one

Python 1,065 52 Updated May 8, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,104 1,987 Updated Jun 13, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,960 784 Updated May 15, 2025

gligen / GLIGEN

Open-Set Grounded Text-to-Image Generation

Python 2,124 161 Updated Mar 6, 2024

0