Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Diverse Consensuses Paired with Motion Estimation-Based Multi-Model Fitting
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 9281–9290https://doi.org/10.1145/3664647.3681646Multi-model fitting aims to robustly estimate the parameters of various model instances in data contaminated by noise and outliers. Most previous works employ only a single type of consensus or implicit fusion model to represent the correlation between ...
- research-articleApril 2024
GLOCAL: A self-supervised learning framework for global and local motion estimation
Pattern Recognition Letters (PTRL), Volume 178, Issue CPages 91–97https://doi.org/10.1016/j.patrec.2023.12.024AbstractMotions in videos are typically a mixture of local dynamic object motions and global camera motion, which are inconsistent in some cases, and even interfere with each other, causing difficulties in various downstream applications, such as video ...
Highlights- A unified framework to estimate global and local motion simultaneously.
- An implicit–explicit bottleneck for global motion without irrelevant information.
- Two loss functions to learn global and local motion in a self-supervised ...
Display Omitted
- research-articleOctober 2024
Fast Subpixel Motion Estimation Based on Human Visual System
International Journal of Intelligent Systems (IJIS), Volume 2024https://doi.org/10.1155/2024/6168548More than 80% of video coding times are consumed by motion estimation calculations, which are the most complex aspect of the process. This method eliminates temporal redundancies in a video sequence to achieve maximum compression. Numerous efforts have ...
- research-articleMarch 2024
A search pattern based on the repeated motion vectors components for the fast block matching motion estimation in temporal coding
International Journal of Computational Science and Engineering (IJCSE), Volume 27, Issue 2Pages 133–141https://doi.org/10.1504/ijcse.2024.137281To reduce the amount of unnecessary data in a video's timeline, block-based motion estimate is routinely utilised. However, a significant reduction in the computational complexity of motion estimation remains a significant problem. In this manuscript, a ...
- research-articleSeptember 2023
Scene flow estimation from 3D point clouds based on dual‐branch implicit neural representations
AbstractRecently, online optimisation‐based scene flow estimation has attracted significant attention due to its strong domain adaptivity. Although online optimisation‐based methods have made significant advances, the performance is far from ...
The authors introduce a dual‐branch MLP‐based architecture to encode implicit scene representations from a source 3D point cloud, which can additionally synthesise a target 3D point cloud. Thus, the mapping function between the source and synthesised ...
-
- research-articleJuly 2023
Video Reconstruction Method Based on Spatio-Temporal Correlation
CNIOT '23: Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of ThingsPages 488–493https://doi.org/10.1145/3603781.3603867As one of the underlying tasks of computer vision, image and video reconstruction have an extensive research value. Video is interpreted as a sequence of images arranged based on a timeline. The spatio-temporal correlation leads to adjacent sequences ...
- research-articleMay 2023
SA‐FlowNet: Event‐based self‐attention optical flow estimation with spiking‐analogue neural networks
AbstractInspired by biological vision mechanism, event‐based cameras have been developed to capture continuous object motion and detect brightness changes independently and asynchronously, which overcome the limitations of traditional frame‐based cameras. ...
As event camera outputs are sparse and uneven, dense scene information is difficult to generate and the local receptive fields of the neural network also lead to poor moving objects tracking. To address these issues, we propose an improved event‐based ...
- research-articleFebruary 2023
Video2mesh: 3D human pose and shape recovery by a temporal convolutional transformer network
AbstractFrom a 2D video of a person in action, human mesh recovery aims to infer the 3D human pose and shape frame by frame. Despite progress on video‐based human pose and shape estimation, it is still challenging to guarantee high accuracy and smoothness ...
From a video of a person in action, human mesh recovery aims to infer the 3D human pose and shape. We propose a Video2mesh, a temporal convolutional transformer (TConvTransformer) network which is able to recover accurate and smooth human mesh from 2D ...
- research-articleJanuary 2023
Multi‐task framework of precipitation nowcasting
CAAI Transactions on Intelligence Technology (CIT2), Volume 8, Issue 4Pages 1350–1363https://doi.org/10.1049/cit2.12184AbstractPrecipitation forecasting plays an important role in disaster warning, agricultural production, and other fields. To solve this issue, some deep learning methods are proposed to forecast future radar echo images and convert them into rainfall ...
- research-articleMay 2022
Characterization of surface motion patterns in highly deformable soft tissue organs from dynamic MRI: An application to assess 4D bladder motion
Computer Methods and Programs in Biomedicine (CBIO), Volume 218, Issue Chttps://doi.org/10.1016/j.cmpb.2022.106708Highlights- Non-invasive characterization of bladder dynamics during deep respiratory movements from dynamic Magnetic Resonance Imaging.
Background and objectives: Dynamic Magnetic Resonance Imaging (MRI) may capture temporal anatomical changes in soft tissue organs with high-contrast but the obtained sequences usually suffer from limited volume coverage ...
- research-articleMarch 2022
An Efficient Multilevel Transform-Domain Partial Distortion Search Algorithm
Pattern Recognition and Image Analysis (SPPRIA), Volume 32, Issue 1Pages 45–56https://doi.org/10.1134/S1054661822010102AbstractThis paper proposes a fast partial distortion-based block matching motion estimation algorithm in Walsh–Hadamard. The proposed algorithm divides the current block into subblocks and provides a sequence of fine-partial distortions to reject the ...
- research-articleMarch 2022
Block Matching Algorithms for the Estimation of Motion in Image Sequences: Analysis
Pattern Recognition and Image Analysis (SPPRIA), Volume 32, Issue 1Pages 33–44https://doi.org/10.1134/S1054661822010072AbstractSeveral video coding standards and techniques have been introduced for multimedia applications, particularly h.26x series for video processing. These standards employ motion estimation process for reducing the amount of data that is required to ...
- surveyFebruary 2022
Survey on Digital Video Stabilization: Concepts, Methods, and Challenges
ACM Computing Surveys (CSUR), Volume 55, Issue 3Article No.: 47, Pages 1–37https://doi.org/10.1145/3494525Digital video stabilization is a challenging task that aims to transform a potentially shaky video into a pleasant one by smoothing the camera trajectory. Despite the various works found in the literature addressing this task, their organization and ...
- research-articleJanuary 2022
An improved motion estimation criterion for temporal coding of video
International Journal of Computational Science and Engineering (IJCSE), Volume 25, Issue 3Pages 308–314https://doi.org/10.1504/ijcse.2022.123120The size of video data is growing exponentially worldwide and hence there is a need for better video coding standards. MPEG and H.26X have provided several standards for video coding. The latest and effective video coding standards are AVC, HEVC, and AV1. ...
- research-articleDecember 2021
Neural frame interpolation for rendered content
- Karlis Martins Briedis,
- Abdelaziz Djelouah,
- Mark Meyer,
- Ian McGonigal,
- Markus Gross,
- Christopher Schroers
ACM Transactions on Graphics (TOG), Volume 40, Issue 6Article No.: 239, Pages 1–13https://doi.org/10.1145/3478513.3480553The demand for creating rendered content continues to drastically grow. As it often is extremely computationally expensive and thus costly to render high-quality computer-generated images, there is a high incentive to reduce this computational burden. ...
- research-articleOctober 2021
GLM-Net: Global and Local Motion Estimation via Task-Oriented Encoder-Decoder Structure
MM '21: Proceedings of the 29th ACM International Conference on MultimediaPages 4211–4219https://doi.org/10.1145/3474085.3475556In this work, we study the problem of separating the global camera motion and the local dynamic motion from an optical flow. Previous methods either estimate global motions by a parametric model, such as a homography, or estimate both of them by an ...
- research-articleMay 2021
Accurate localization of moving objects in dynamic environment for small unmanned aerial vehicle platform using global averaging
AbstractSmall unmanned aerial vehicles (UAVs) have developed rapidly and are widely used for disaster relief, traffic monitoring and military surveillance. To perform these tasks better, it is necessary to improve the environmental perception ability of ...
- research-articleJanuary 2021
Methods for forgery detection in digital forensics
International Journal of Electronic Security and Digital Forensics (IJESDF), Volume 13, Issue 5Pages 528–547https://doi.org/10.1504/ijesdf.2021.117310The information present in the footage/video clip is one of the solid evidence of the event at the incidents. Therefore, visual media crime-scene investigation has risen as a crucial research field, which fundamentally promotes techniques to deal with ...
- research-articleDecember 2020
A Fast Compression Framework Based on 3D Point Cloud Data for Telepresence
International Journal of Automation and Computing (SPIJAC), Volume 17, Issue 6Pages 855–866https://doi.org/10.1007/s11633-020-1240-5AbstractIn this paper, a novel compression framework based on 3D point cloud data is proposed for telepresence, which consists of two parts. One is implemented to remove the spatial redundancy, i.e., a robust Bayesian framework is designed to track the ...
- research-articleOctober 2020
Stacked residual blocks based encoder–decoder framework for human motion prediction
Cognitive Computation and Systems (CCS2), Volume 2, Issue 4Pages 242–246https://doi.org/10.1049/ccs.2020.0008Human motion prediction is an important and challenging task in computer vision with various applications. Recurrent neural networks (RNNs) and convolutional neural networks (CNNs) have been proposed to address this challenging task. However, RNNs ...