Stars
Language-Driven Semantic Segmentation
High-Resolution Image Synthesis with Latent Diffusion Models
Train the HRNet model on ImageNet
Recent weakly supervised semantic segmentation paper
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
the implementation of: A Multi-Strategy Contrastive Learning Framework for Weakly Supervised Semantic Segmentation (MuSCLe)
Photovoltaic Panel (PVP) Dataset: a public dataset for extracting high-quality photovoltaic panels in large-scale systems
[CVPR 2022] Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
Awesome Knowledge-Distillation for CV
Code release for ConvNeXt V2 model
这是一个segformer-pytorch的源码,可以用于训练自己的模型。
Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs with gaussian edge potentials.
Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Code for "Efficient and Controllable Remote Sensing Fake Sample Generation Based on Diffusion Model"