Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
LDCNet: Long-Distance Context Modeling for Large-Scale 3D Point Cloud Scene Semantic Segmentation
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 1321–1330https://doi.org/10.1145/3664647.3680716Large-scale point cloud semantic segmentation is a challenging task in 3D computer vision. A key challenge is how to resolve ambiguities arising from locally high inter-class similarity. In this study, we introduce a solution by modeling long-distance ...
- research-articleFebruary 2024
SasWOT: real-time semantic segmentation architecture search without training
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 858, Pages 7722–7730https://doi.org/10.1609/aaai.v38i7.28606In this paper, we present SasWOT, the first training-free Semantic segmentation Architecture Search (SAS) framework via an auto-discovery proxy. Semantic segmentation is widely used in many real-time applications. For fast inference and memory efficiency,...
- research-articleFebruary 2023
High-to-low-level feature matching and complementary information fusion for reference-based image super-resolution
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 40, Issue 1Pages 99–108https://doi.org/10.1007/s00371-023-02768-3AbstractThe aim of the reference-based image super-resolution (RefSR) is to reconstruct high-resolution (HR) when a reference (Ref) image with similar content as that of the low-resolution (LR) input is given. In the task, the quality of existing ...
- research-articleNovember 2022
Multilayered stitch generating for random-needle embroidery
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 38, Issue 11Pages 3667–3679https://doi.org/10.1007/s00371-021-02195-2AbstractRandom-needle embroidery is a kind of traditional Chinese embroidery where artists reproduce the original color image using multilayered intersecting stitches and overlapped deliberately. In other words, artists embroider intersecting stitches ...
- research-articleOctober 2022
Active Patterns Perceived for Stochastic Video Prediction
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 5961–5969https://doi.org/10.1145/3503161.3547770Predicting future scenes based on historical frames is challenging, especially when it comes to the complex uncertainty in nature. We observe that there is a divergence between spatial-temporal variations of active patterns and non-active patterns in a ...
-
- research-articleSeptember 2022
Fine-grained traffic video vehicle recognition based orientation estimation and temporal information
Multimedia Tools and Applications (MTAA), Volume 82, Issue 9Pages 13745–13763https://doi.org/10.1007/s11042-022-13811-1AbstractIn this paper, we propose a method for fine-grained vehicle recognition in traffic surveillance video. Compared with general theory about single image fine-grained recognition, this method focuses on multi-frame information combination and the ...
- research-articleJune 2022
Weakly Supervised Fine-grained Recognition based on Combined Learning for Small Data and Coarse Label
ICMR '22: Proceedings of the 2022 International Conference on Multimedia RetrievalPages 194–201https://doi.org/10.1145/3512527.3531419Learning with weak supervision already becomes one of the research trends in fine-grained image recognition. These methods aim to learn feature representation in the case of less manual cost or expert knowledge. Most existing weakly supervised methods ...
- ArticleJune 2022
Category-Sensitive Incremental Learning for Image-Based 3D Shape Reconstruction
AbstractRecovering the three-dimensional shape of an object from a two-dimensional image is an important research topic in computer vision. Traditional methods use stereo vision or inter-image matching to obtain geometric information about the object, but ...
- research-articleMay 2022
Video supervised for 3D reconstruction from single image
Multimedia Tools and Applications (MTAA), Volume 81, Issue 11Pages 15061–15083https://doi.org/10.1007/s11042-022-12459-1AbstractAs a long-standing ill-posed problem, 3D reconstruction from a single image is an important research topic in computer vision. The information in a single image can represent an infinite number of possible three-dimensional shapes. To recover ...
- research-articleJuly 2021
Dyeing creation: a textile pattern discovery and fabric image generation method
Multimedia Tools and Applications (MTAA), Volume 80, Issue 17Pages 26511–26530https://doi.org/10.1007/s11042-021-10902-3AbstractCreating different textile patterns to generate printable fabric images is a difficult image processing task. To accomplish this task, we propose a novel framework for dyeing creation, which allows non-professionals to design individual fabric ...
- research-articleApril 2021
A self-supervised method of single-image depth estimation by feeding forward information using max-pooling layers
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 37, Issue 4Pages 815–829https://doi.org/10.1007/s00371-020-01832-6AbstractWe propose an encoder–decoder CNN framework to predict depth from one single image in a self-supervised manner. To this aim, we design three kinds of encoder based on the recent advanced deep neural network and one kind of decoder which can ...
- research-articleFebruary 2021
Crowd aware summarization of surveillance videos by deep reinforcement learning
Multimedia Tools and Applications (MTAA), Volume 80, Issue 4Pages 6121–6141https://doi.org/10.1007/s11042-020-09888-1AbstractSurveillance videos which record crowd behaviors have dramatically increased due to the wide applications. A quick view of such crowd surveillance video in a constrained time is an increasing demand because it always contain a huge number of ...
- research-articleFebruary 2021
A structural-constraint 3D point clouds segmentation adversarial method
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 37, Issue 2Pages 325–340https://doi.org/10.1007/s00371-020-01801-zAbstractPoint cloud segmentation is a key task of shape analysis with various applications. Existing segmentation methods usually apply a single segmentation network to compute point-wise loss for network training. The point-wise loss, which will lose ...
- research-articleNovember 2020
Saliency based multiple object cosegmentation by ensemble MIML learning
Multimedia Tools and Applications (MTAA), Volume 79, Issue 41-42Pages 31299–31328https://doi.org/10.1007/s11042-020-09458-5AbstractAs an interesting and emerging topic, multiple foreground cosegmentation (MFC) aims at extracting a finite number of common objects from an image collection, which is useful to variety of visual media applications. Although a number of approaches ...
- research-articleOctober 2020
Stable Video Style Transfer Based on Partial Convolution with Depth-Aware Supervision
MM '20: Proceedings of the 28th ACM International Conference on MultimediaPages 2445–2453https://doi.org/10.1145/3394171.3413526As a very important research issue in digital media art, neural learning based video style transfer has attracted more and more attention. A lot of recent works import optical flow method to original image style transfer framework to preserve frame-...
- research-articleMay 2020
Progressive decomposition: a method of coarse-to-fine image parsing using stacked networks
Multimedia Tools and Applications (MTAA), Volume 79, Issue 19-20Pages 13379–13402https://doi.org/10.1007/s11042-019-08288-4AbstractTo parse images into fine-grained semantic parts, the complex elements will put it in trouble when using off-the-shelf semantic segmentation networks, because it is difficult for them to utilize the contextual information of fine-grained parts. In ...
- ArticleJanuary 2020
Single View Depth Estimation via Dense Convolution Network with Self-supervision
AbstractDepth estimation from single image by deep learning is a hot topic of research nowadays. Existing methods mainly focus on learning neural network supervised by ground truth. This paper proposes a method for single view depth estimation based on ...
- research-articleDecember 2019
Direction-aware neural style transfer with texture enhancement
AbstractNeural learning methods have been shown to be effective in style transfer. These methods, which are called NST, aim to synthesize a new image that retains the high-level structure of a content image while keeps the low-level features ...
- research-articleOctober 2019
Co-saliency Detection Based on Hierarchical Consistency
MM '19: Proceedings of the 27th ACM International Conference on MultimediaPages 1392–1400https://doi.org/10.1145/3343031.3351016As an interesting and emerging topic, co-saliency detection aims at discovering common and salient objects in a group of related images, which is useful to variety of visual media applications. Although a number of approaches have been proposed to ...
- ArticleAugust 2019
Detecting robust co-saliency with recurrent co-attention neural network
IJCAI'19: Proceedings of the 28th International Joint Conference on Artificial IntelligencePages 818–825Effective feature representations which should not only express the image's individual properties, but also reflect the interaction among group images are essentially crucial for robust co-saliency detection. This paper proposes a novel deep learning co-...