8000 1633232731 (zhangxichen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 1633232731's full-sized avatar
🍉
Focusing
🍉
Focusing

Block or report 1633232731

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 44 8 Updated Apr 12, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,560 241 Updated Jun 11, 2025

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 452 23 Updated Mar 10, 2025

The official implementation of RAR

Python 88 1 Updated Mar 27, 2024

Universal Monocular Metric Depth Estimation

Python 912 82 Updated May 18, 2025

Automatically find issues in image datasets and practice data-centric computer vision.

Python 1,091 75 Updated Apr 3, 2025

Official Documentation for the Binance Spot APIs and Streams

4,370 1,396 Updated Jun 11, 2025

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 336 17 Updated Feb 23, 2024

免费开源的网易BUFF、悠悠有品、ECOsteam、C5Game、Steam的全自动收发货解决方案

Python 993 134 Updated Jun 6, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,595 1,418 Updated May 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,064 6,293 Updated Jun 10, 2025

C# application with primary purpose of farming Steam cards from multiple accounts simultaneously.

C# 12,084 1,081 Updated Jun 11, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,804 134 Updated Mar 13, 2025

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,797 177 Updated May 15, 2025

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,581 579 Updated Jul 17, 2024

Official implementation of Adabins: Depth Estimation using adaptive bins

Python 768 160 Updated May 29, 2022

object detection based on owl-vit

Python 59 8 Updated Aug 18, 2023

measure relative lenghts in an image

Python 6 Updated Apr 7, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 31,149 9,680 Updated Aug 21, 2024

Authenticator on Windows for Battle.net / Steam / Guild Wars 2 / Glyph / Runescape / SWTOR / Bitcoin and digital currency exchanges

C# 1,817 409 Updated Sep 26, 2021

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 621 117 Updated Jun 25, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,559 453 Updated Jun 5, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,220 828 Updated Aug 12, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,874 3,254 Updated Jun 11, 2025

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 771 57 Updated Mar 20, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,435 1,505 Updated Sep 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,434 5,923 Updated Sep 18, 2024

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,963 347 Updated Apr 25, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,503 185 Updated Aug 1, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,978 226 Updated May 20, 2024
Next
0