8000 zytx121 (Yue Zhou) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zytx121's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@Justice-Eternal

Block or report zytx121

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

8 Updated May 17, 2025

[IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

Jupyter Notebook 71 5 Updated May 20, 2025

【Numbered musical notation tools】je 简谱 处理工具,包括转调、播放、制谱、midi提取(转换)与制作等

JavaScript 62 9 Updated Mar 29, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,985 308 Updated May 11, 2025

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

46 4 Updated Apr 3, 2025

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

49 1 Updated May 10, 2025

[ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation

Python 95 2 Updated Mar 28, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,731 658 Updated May 24, 2025

汽车之家车型品牌车系车型等的数据

3 1 Updated Sep 16, 2023

(不定时月更)汽车之家数据,各车型,参数配置。

Python 80 22 Updated May 5, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,507 279 Updated May 24, 2025
Python 159 12 Updated May 6, 2024

Summary of Geoguesser Models / Agents

4 Updated Jun 27, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,573 215 Updated Apr 1, 2025

[NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)

Python 90 2 Updated May 16, 2025

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 351 6 Updated May 5, 2025

[TPAMI] Oriented object detection on STAR dataset.

Python 77 4 Updated Feb 3, 2025

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

166 8 Updated May 24, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,836 173 Updated May 20, 2025

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 609 39 Updated Jan 7, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,555 344 Updated May 20, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,909 485 Updated Feb 7, 2025
16 1 Updated Dec 25, 2024

Official implementation of the CVPR23 paper: Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection

Python 116 3 Updated Nov 23, 2023

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,569 8,346 Updated May 6, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,355 1,495 Updated Sep 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,212 5,899 Updated Sep 18, 2024

JDet is an object detection benchmark based on Jittor. Mainly focus on aerial image object detection (oriented object detection).

Python 201 35 Updated Mar 2, 2025
Next
0