-
Tongji University
- Shanghai
-
13:49
(UTC -12:00)
Stars
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
[CVPR 2025] Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method
[CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Provide with pre-build flash-attention package wheels using GitHub Actions
[IEEE TGRS 2024] ChangeMamba: Remote Sensing Change Detection Based on Spatio-Temporal State Space Model
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
This project provides modules to crawl and process conference call transcripts and retrieve call recordings from SeekingAlpha.com.
A review of change detection methods, including codes and open data sets for deep learning. From paper: change detection based on artificial intelligence: state-of-the-art and challenges.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
[ECCV-2020-oral]-Semantic Flow for Fast and Accurate Scene Parsing
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…
PaddleSlim is an open-source library for deep model compression and architecture search.
TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
OpenMMLab Detection Toolbox and Benchmark
moving object detection for satellite videos.
A Library for Advanced Deep Time Series Models.
Official Repo of NeurIPS '21: "Trust, but Verify: Cross-Modality Fusion for HD Map Change Detection"
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Speech Recognition using DeepSpeech2.
Code release for CVPR'24 submission 'OmniGlue'
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Reference implementations of MLPerf™ inference benchmarks