IJCV: Vol 131, No 6

Volume 131, Issue 6Jun 2023

Volume 131, Issue 6

Jun 2023

Publisher:

Kluwer Academic Publishers
101 Philip Drive Assinippi Park Norwell, MA
United States

ISSN:0920-5691

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer

Pages 1307–1330https://doi.org/10.1007/s11263-023-01758-1

Abstract

Remote photoplethysmography (rPPG), which aims at measuring heart activities and physiological signals from facial video without any contact, has great potential in many applications (e.g., remote healthcare and affective computing). Recent deep ...

research-article

Semantics-Guided Intra-Category Knowledge Transfer for Generalized Zero-Shot Learning

Pages 1331–1345https://doi.org/10.1007/s11263-023-01767-0

Abstract

Zero-shot learning (ZSL) requires one to associate visual and semantic information observed from data of seen classes, so that test data of unseen classes can be recognized based on the described semantic representation. Aiming at synthesizing ...

research-article

SMG: A Micro-gesture Dataset Towards Spontaneous Body Gestures for Emotional Stress State Analysis

Pages 1346–1366https://doi.org/10.1007/s11263-023-01761-6

Abstract

We explore using body gestures for hidden emotional state analysis. As an important non-verbal communicative fashion, human body gestures are capable of conveying emotional information during social communication. In previous works, efforts have ...

research-article

Public Access

Context-Driven Detection of Invertebrate Species in Deep-Sea Video

Pages 1367–1388https://doi.org/10.1007/s11263-023-01755-4

Abstract

Each year, underwater remotely operated vehicles (ROVs) collect thousands of hours of video of unexplored ocean habitats revealing a plethora of information regarding biodiversity on Earth. However, fully utilizing this information remains a ...

research-article

Improved 3D Markerless Mouse Pose Estimation Using Temporal Semi-supervision

Pages 1389–1405https://doi.org/10.1007/s11263-023-01756-3

Abstract

Three-dimensional markerless pose estimation from multi-view video is emerging as an exciting method for quantifying the behavior of freely moving animals. Nevertheless, scientifically precise 3D animal pose estimation remains challenging, ...

research-article

Semi-supervised Visual Tracking of Marine Animals Using Autonomous Underwater Vehicles

Pages 1406–1427https://doi.org/10.1007/s11263-023-01762-5

Abstract

In-situ visual observations of marine organisms is crucial to developing behavioural understandings and their relations to their surrounding ecosystem. Typically, these observations are collected via divers, tags, and remotely-operated or human-...

research-article

A Minimal Solution for Image-Based Sphere Estimation

Pages 1428–1447https://doi.org/10.1007/s11263-023-01766-1

Abstract

We propose a novel minimal solver for sphere fitting via its 2D central projection, i.e., a special ellipse. The input of the presented algorithm consists of contour points detected in a camera image. General ellipse fitting problems require five ...

research-article

Refractive Pose Refinement: Generalising the Geometric Relation between Camera and Refractive Interface

Pages 1448–1476https://doi.org/10.1007/s11263-023-01763-4

Abstract

In this paper, we investigate absolute and relative pose estimation under refraction, which are essential problems for refractive structure from motion. To cope with refraction effects, we first formulate geometric constraints for establishing ...

research-article

Deep Memory-Augmented Proximal Unrolling Network for Compressive Sensing

Pages 1477–1496https://doi.org/10.1007/s11263-023-01765-2

Abstract

Mapping a truncated optimization method into a deep neural network, deep proximal unrolling network has attracted attention in compressive sensing due to its good interpretability and high performance. Each stage in such networks corresponds to ...

research-article

Through Hawks’ Eyes: Synthetically Reconstructing the Visual Field of a Bird in Flight

Pages 1497–1531https://doi.org/10.1007/s11263-022-01733-2

Abstract

Birds of prey rely on vision to execute flight manoeuvres that are key to their survival, such as intercepting fast-moving targets or navigating through clutter. A better understanding of the role played by vision during these manoeuvres is not ...

research-article

Public Access

Multi-view Tracking, Re-ID, and Social Network Analysis of a Flock of Visually Similar Birds in an Outdoor Aviary

Pages 1532–1549https://doi.org/10.1007/s11263-023-01768-z

Abstract

The ability to capture detailed interactions among individuals in a social group is foundational to our study of animal behavior and neuroscience. Recent advances in deep learning and computer vision are driving rapid progress in methods that can ...

research-article

Extreme Low-Resolution Action Recognition with Confident Spatial-Temporal Attention Transfer

Pages 1550–1565https://doi.org/10.1007/s11263-023-01771-4

Abstract

Action recognition on extreme low-resolution videos, e.g., a resolution of $12 \times 16$ pixels, plays a vital role in far-view surveillance and privacy-preserving multimedia analysis. As low-resolution videos often only contain limited information, it is ...

research-article

Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation

Pages 1566–1583https://doi.org/10.1007/s11263-023-01770-5

Abstract

Graph convolution networks (GCNs) based methods for 3D human pose estimation usually aggregate immediate features of single-hop nodes, which are unaware of the correlation of multi-hop nodes and therefore neglect long-range dependency for ...

research-article

RELAX: Representation Learning Explainability

Pages 1584–1610https://doi.org/10.1007/s11263-023-01773-2

Abstract

Despite the significant improvements that self-supervised representation learning has led to when learning from unlabeled data, no methods have been developed that explain what influences the learned representation. We address this need through ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

International Journal of Computer Vision

Sections

PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer

Semantics-Guided Intra-Category Knowledge Transfer for Generalized Zero-Shot Learning

SMG: A Micro-gesture Dataset Towards Spontaneous Body Gestures for Emotional Stress State Analysis

Context-Driven Detection of Invertebrate Species in Deep-Sea Video

Improved 3D Markerless Mouse Pose Estimation Using Temporal Semi-supervision

Semi-supervised Visual Tracking of Marine Animals Using Autonomous Underwater Vehicles

A Minimal Solution for Image-Based Sphere Estimation

Refractive Pose Refinement: Generalising the Geometric Relation between Camera and Refractive Interface

Deep Memory-Augmented Proximal Unrolling Network for Compressive Sensing

Through Hawks’ Eyes: Synthetically Reconstructing the Visual Field of a Bird in Flight

Multi-view Tracking, Re-ID, and Social Network Analysis of a Flock of Visually Similar Birds in an Outdoor Aviary

Extreme Low-Resolution Action Recognition with Confident Spatial-Temporal Attention Transfer

Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation

RELAX: Representation Learning Explainability