Author: Yu, Fisher : Search

Applied Filters

Publication Date

People

32 Results for: Author: Yu, FisherEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,855,380 records)|Limit your search to The ACM Full-Text Collection (777,925 records)

Showing 1 - 20of32 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

Article
November 2024
HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Computer Vision – ECCV 2024Pages 483–500https://doi.org/10.1007/978-3-031-73661-2_27
Abstract
Transformers have exhibited promising performance in computer vision tasks including image super-resolution (SR). However, popular transformer-based SR methods often employ window self-attention with quadratic computational complexity to window ...
0
Metrics
Total Citations0
Article
November 2024
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Computer Vision – ECCV 2024Pages 162–179https://doi.org/10.1007/978-3-031-73397-0_10
Abstract
The recent Gaussian Splatting achieves high-quality and real-time novel-view synthesis of the 3D scenes. However, it is solely concentrated on the appearance and geometry modeling, while lacking in fine-grained object-level scene understanding. To ...
0
Metrics
Total Citations0
Article
October 2024
Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs
Computer Vision – ECCV 2024Pages 1–18https://doi.org/10.1007/978-3-031-73242-3_1
Abstract
The supervision of state-of-the-art multiple object tracking (MOT) methods requires enormous annotation efforts to provide bounding boxes for all frames of all videos, and instance IDs to associate them through time. To this end, we introduce ...
0
Metrics
Total Citations0
research-article
July 2024
Lightweight image super-resolution via flexible meta pruning
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2495, Pages 60305–60314

Lightweight image super-resolution (SR) methods have obtained promising results with moderate model complexity. These approaches primarily focus on a lightweight architecture design, but neglect to further reduce network redundancy. While some model ...
0
Metrics
Total Citations0
research-article
July 2024
Flexible residual binarization for image super-resolution
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2468, Pages 59731–59740

Binarized image super-resolution (SR) has attracted much research attention due to its potential to drastically reduce parameters and operations. However, most binary SR works binarize network weights directly, which hinders high-frequency information ...
0
Metrics
Total Citations0
research-article
December 2023
Real-time motion prediction via heterogeneous polyline transformer with relative pose encoding
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2507, Pages 57481–57499

The real-world deployment of an autonomous driving system requires its components to run on-board and in real-time, including the motion prediction module that predicts the future trajectories of surrounding traffic participants. Existing agent-centric ...
0
Metrics
Total Citations0
1
Supplementary Material
Additional material
research-article
December 2023
QuantSR: accurate low-bit quantization for efficient image super-resolution
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2483, Pages 56838–56848

Low-bit quantization in image super-resolution (SR) has attracted copious attention in recent research due to its ability to reduce parameters and operations significantly. However, many quantized SR models suffer from accuracy degradation compared to ...
0
Metrics
Total Citations0
research-article
December 2023
BiMatting: efficient video matting via binarization
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1876, Pages 43307–43321

Real-time video matting on edge devices faces significant computational resource constraints, limiting the widespread use of video matting in applications such as online conferences and short-form video production. Binarization is a powerful compression ...
0
Metrics
Total Citations0
research-article
December 2023
Segment anything in high quality
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1303, Pages 29914–29934

The recent Segment Anything Model (SAM) represents a big leap in scaling up segmentation models, allowing for powerful zero-shot capabilities and flexible prompting. Despite being trained with 1.1 billion masks, SAM's mask prediction quality falls short ...
0
Metrics
Total Citations0
1
Supplementary Material
Additional material
research-article
December 2023
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 12Pages 15380–15393https://doi.org/10.1109/TPAMI.2023.3301975
Similarity learning has been recognized as a crucial step for object tracking. However, existing multiple object tracking methods only use sparse ground truth matching as the training objective, while ignoring the majority of the informative regions in ...
10
Metrics
Total Citations10
research-article
November 2023
Unifying Flow, Stereo and Depth Estimation
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 11Pages 13941–13958https://doi.org/10.1109/TPAMI.2023.3298645
We present a unified formulation and model for three motion and 3D perception tasks: optical flow, rectified stereo matching and unrectified stereo depth estimation from posed images. Unlike previous specialized architectures for each specific task, we ...
15
Metrics
Total Citations15
Article
September 2023
COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking
Pattern RecognitionPages 443–458https://doi.org/10.1007/978-3-031-54605-1_29
Abstract
Continual learning allows a model to learn multiple tasks sequentially while retaining the old knowledge without the training data of the preceding tasks. This paper extends the scope of continual learning research to class-incremental learning ...
1
Metrics
Total Citations1
research-article
July 2023
BiBench: benchmarking and analyzing network binarization
ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 1177, Pages 28351–28388

Network binarization emerges as one of the most promising compression approaches offering extraordinary computation and memory savings by minimizing the bit-width. However, recent research has shown that applying existing binarization algorithms to ...
0
Metrics
Total Citations0
research-article
July 2023
Scaling vision transformers to 22 billion parameters
ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 296, Pages 7480–7512

The scaling of Transformers has driven breakthrough capabilities for language models. At present, the largest large language models (LLMs) contain upwards of 100B parameters. Vision Transformers (ViT) have introduced the same architecture to image and ...
1
Metrics
Total Citations1
Article
October 2022
The Tenth Visual Object Tracking VOT2022 Challenge Results
Computer Vision – ECCV 2022 WorkshopsPages 431–460https://doi.org/10.1007/978-3-031-25085-9_25
Abstract
The Visual Object Tracking challenge VOT2022 is the tenth annual tracker benchmarking activity organized by the VOT initiative. Results of 93 entries are presented; many are state-of-the-art trackers published at major computer vision conferences ...
2
Metrics
Total Citations2
Article
October 2022
SAGA: Stochastic Whole-Body Grasping with Contact
Computer Vision – ECCV 2022Pages 257–274https://doi.org/10.1007/978-3-031-20068-7_15
Abstract
The synthesis of human grasping has numerous applications including AR/VR, video games and robotics. While methods have been proposed to generate realistic hand–object interaction for object grasping and manipulation, these typically only consider ...
4
Metrics
Total Citations4
Article
October 2022
Tracking Every Thing in the Wild
Computer Vision – ECCV 2022Pages 498–515https://doi.org/10.1007/978-3-031-20047-2_29
Abstract
Current multi-category Multiple Object Tracking (MOT) metrics use class labels to group tracking results for per-class evaluation. Similarly, MOT methods typically only associate objects with the same class predictions. These two prevalent ...
5
Metrics
Total Citations5
Article
October 2022
TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
Computer Vision – ECCV 2022Pages 19–35https://doi.org/10.1007/978-3-031-19830-4_2
Abstract
Traditional domain adaptive semantic segmentation addresses the task of adapting a model to a novel target domain under limited or no additional supervision. While tackling the input domain gap, the standard domain adaptation settings assume no ...
3
Metrics
Total Citations3
Article
October 2022
Learning Online Multi-sensor Depth Fusion
Computer Vision – ECCV 2022Pages 87–105https://doi.org/10.1007/978-3-031-19824-3_6
Abstract
Many hand-held or mixed reality devices are used with a single sensor for 3D reconstruction, although they often comprise multiple sensors. Multi-sensor depth fusion is able to substantially improve the robustness and accuracy of 3D reconstruction ...
0
Metrics
Total Citations0
Article
October 2022
Video Mask Transfiner for High-Quality Video Instance Segmentation
Computer Vision – ECCV 2022Pages 731–747https://doi.org/10.1007/978-3-031-19815-1_42
Abstract
While Video Instance Segmentation (VIS) has seen rapid progress, current approaches struggle to predict high-quality masks with accurate boundary details. Moreover, the predicted segmentations often fluctuate over time, suggesting that temporal ...
3
Metrics
Total Citations3

Search Results

Applied Filters

Publication Date

People

Authors

Institutions

Publications

Journal/Magazine Names

All Publications

Content Type

Supplemental Material Type

Publisher

Proceedings Series

ACM SIG Sponsors

Results

Caption

HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs

Lightweight image super-resolution via flexible meta pruning

Flexible residual binarization for image super-resolution

Real-time motion prediction via heterogeneous polyline transformer with relative pose encoding

QuantSR: accurate low-bit quantization for efficient image super-resolution

BiMatting: efficient video matting via binarization

Segment anything in high quality

QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking

Unifying Flow, Stereo and Depth Estimation

COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking

BiBench: benchmarking and analyzing network binarization

Scaling vision transformers to 22 billion parameters

The Tenth Visual Object Tracking VOT2022 Challenge Results

SAGA: Stochastic Whole-Body Grasping with Contact

Tracking Every Thing in the Wild

TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation

Learning Online Multi-sensor Depth Fusion

Video Mask Transfiner for High-Quality Video Instance Segmentation