8000 drfarasat (Farasat Munir) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View drfarasat's full-sized avatar

Block or report drfarasat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

Python 189 13 Upda 10000 ted Jan 21, 2024

Computer Vision dataset analysis

Python 301 36 Updated Aug 6, 2024

[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Python 90 2 Updated Mar 20, 2025

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Python 185 21 Updated Sep 11, 2023

A security scanner for your LLM agentic workflows

Python 588 62 Updated Jun 9, 2025

The AI-native proxy server for agents. Arch handles the pesky low-level work in building agentic apps like calling specific tools, routing prompts to the right agents, clarifying vague inputs, unif…

Rust 2,722 152 Updated Jun 11, 2025

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 9,751 1,696 Updated Jun 13, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,903 1,376 Updated Jun 13, 2025
TypeScript 335 190 Updated Jan 23, 2025

An Extension for Forge Webui that implements Attention Couple

Python 360 18 Updated Apr 11, 2025

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

TypeScript 16,260 8,901 Updated Jun 12, 2025

A multi-page application to visualize and predict Covid numbers

Python 22 10 Updated May 19, 2025

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

Python 828 62 Updated May 15, 2025

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,585 107 Updated Jul 22, 2024

This repo collects the research resources based on SAM(Segment Anything Model) proposed by Meta AI. If you would like to contribute, please open an issue.

45 3 Updated Aug 22, 2023

A curated list of awesome resources for dichotomous image segmentation (DIS).

28 Updated May 25, 2025

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 2,345 170 Updated Jun 11, 2025

This is the official pytorch implementation of DIS-SAM.

Jupyter Notebook 13 4 Updated Apr 16, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,571 208 Updated Jun 9, 2025

Build document-native LLM applications

Python 53 2 Updated Sep 11, 2024

Get your documents ready for gen AI

Python 31,802 2,034 Updated Jun 13, 2025

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,617 429 Updated Aug 19, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,528 1,008 Updated Jun 13, 2025

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,305 58 Updated Mar 14, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 3,171 327 Updated Jun 13, 2025

Serving multiple LoRA finetuned LLM as one

Python 1,065 52 Updated May 8, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 15,104 1,987 Updated Jun 13, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,960 784 Updated May 15, 2025

Open-Set Grounded Text-to-Image Generation

Python 2,124 161 Updated Mar 6, 2024
Next
0