zytx121

🎯

Focusing

Yue Zhou zytx121

🎯

Focusing

Research Fellow at S-Lab@NTU

135 followers · 37 following

BUPT -> SJTU -> NTU
Singapore
zytx121.github.io

Achievements

x3 x3 x2

Achievements

x3 x3 x2

Organizations

Lists (1)

Sort

🚀 My stack

Stars

VisionXLab / AirSpatialBot

[TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

8 Updated May 17, 2025

VisionXLab / mllm-mmrotate

[IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

Jupyter Notebook 71 5 Updated May 20, 2025

madderscientist / je_score_operator

【Numbered musical notation tools】je 简谱处理工具，包括转调、播放、制谱、midi提取（转换）与制作等

JavaScript 62 9 Updated Mar 29, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,985 308 Updated May 11, 2025

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

46 4 Updated Apr 3, 2025

VisionXLab / GeoGround

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

49 1 Updated May 10, 2025

mc-lan / Text4Seg

[ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation

Python 95 2 Updated Mar 28, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,731 658 Updated May 24, 2025

szx2015 / spider_autohome_data

汽车之家车型品牌车系车型等的数据

3 1 Updated Sep 16, 2023

swoiow / autohome

(不定时月更)汽车之家数据，各车型，参数配置。

Python 80 22 Updated May 5, 2025

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,507 279 Updated May 24, 2025

gastruc / osv5m

Python 159 12 Updated May 6, 2024

zilunzhang / StreetCLIP-Repoduce

Python 11 3 Updated Jul 1, 2024

zilunzhang / Awesome-Geoguesser

Summary of Geoguesser Models / Agents

4 Updated Jun 27, 2024

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,573 215 Updated Apr 1, 2025

OpenGVLab / PIIP

[NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)

Python 90 2 Updated May 16, 2025

OpenGVLab / OmniCorpus

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 351 6 Updated May 5, 2025

VisionXLab / STAR-MMRotate

[TPAMI] Oriented object detection on STAR dataset.

Python 77 4 Updated Feb 3, 2025

Zhuzi24 / STAR-MMDetection

3 Updated Jul 2, 2024

zytx121 / Awesome-VLGFM

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

166 8 Updated May 24, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,836 173 Updated May 20, 2025

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 609 39 Updated Jan 7, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,555 344 Updated May 20, 2025

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,909 485 Updated Feb 7, 2025

CAESAR-Radi / SIVED

16 1 Updated Dec 25, 2024

Chasel-Tsui / mmrotate-dcfl

Official implementation of the CVPR23 paper: Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection

Python 116 3 Updated Nov 23, 2023

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,569 8,346 Updated May 6, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,355 1,495 Updated Sep 5, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,212 5,899 Updated Sep 18, 2024

Jittor / JDet

JDet is an object detection benchmark based on Jittor. Mainly focus on aerial image object detection (oriented object detection).

Python 201 35 Updated Mar 2, 2025