TPAMI CVPR Special Section
The articles in this special issue include papers from the CVPR'11 conference which was held in Colorado Spring, CO, June 2011.
Efficient Human Pose Estimation from Single Depth Images
- Jamie Shotton,
- Ross Girshick,
- Andrew Fitzgibbon,
- Toby Sharp,
- Mat Cook,
- Mark Finocchio,
- Richard Moore,
- Pushmeet Kohli,
- Antonio Criminisi,
- Alex Kipman,
- Andrew Blake
We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, ...
SfM with MRFs: Discrete-Continuous Optimization for Large-Scale Structure from Motion
Recent work in structure from motion (SfM) has built 3D models from large collections of images downloaded from the Internet. Many approaches to this problem use incremental algorithms that solve progressively larger bundle adjustment problems. These ...
Phrasal Recognition
In this paper, we introduce visual phrases, complex visual composites like "a person riding a horse." Visual phrases often display significantly reduced visual complexity compared to their component objects because the appearance of those objects can ...
Image-Based Separation of Reflective and Fluorescent Components Using Illumination Variant and Invariant Color
Traditionally, researchers tend to exclude fluorescence from color appearance algorithms in computer vision and image processing because of its complexity. In reality, fluorescence is a very common phenomenon observed in many objects, from gems and ...
Articulated Human Detection with Flexible Mixtures of Parts
We describe a method for articulated human detection and human pose estimation in static images based on a new representation of deformable part models. Rather than modeling articulation using a family of warped (rotated and foreshortened) templates, we ...
BabyTalk: Understanding and Generating Simple Image Descriptions
- Girish Kulkarni,
- Visruth Premraj,
- Vicente Ordonez,
- Sagnik Dhar,
- Siming Li,
- Yejin Choi,
- Alexander C. Berg,
- Tamara L. Berg
We present a system to automatically generate natural language descriptions from images. This system consists of two parts. The first part, content planning, smooths the output of computer vision-based detection and recognition algorithms with ...
Intrinsic Image Decomposition Using a Sparse Representation of Reflectance
Intrinsic image decomposition is an important problem that targets the recovery of shading and reflectance components from a single image. While this is an ill-posed problem on its own, we propose a novel approach for intrinsic image decomposition using ...
Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval
This paper addresses the problem of learning similarity-preserving binary codes for efficient similarity search in large-scale image collections. We formulate this problem in terms of finding a rotation of zero-centered data so as to minimize the ...
Localizing Parts of Faces Using a Consensus of Exemplars
We present a novel approach to localizing parts in images of human faces. The approach combines the output of local detectors with a nonparametric set of global models for the part locations based on over 1,000 hand-labeled exemplar images. By assuming ...
On Differential Photometric Reconstruction for Unknown, Isotropic BRDFs
This paper presents a comprehensive theory of photometric surface reconstruction from image derivatives in the presence of a general, unknown isotropic BRDF. We derive precise topological classes up to which the surface may be determined and specify ...
Paired Regions for Shadow Detection and Removal
In this paper, we address the problem of shadow detection and removal from single images of natural scenes. Differently from traditional methods that explore pixel or edge information, we employ a region-based approach. In addition to considering ...
Robust Visual Tracking Using Local Sparse Appearance Model and K-Selection
Online learned tracking is widely used for its adaptive ability to handle appearance changes. However, it introduces potential drifting problems due to the accumulation of errors during the self-updating, especially for occluded scenarios. The recent ...
Toward Wide-Angle Microvision Sensors
- Sanjeev J. Koppal,
- Ioannis Gkioulekas,
- Travis Young,
- Hyunsung Park,
- Kenneth B. Crozier,
- Geoffrey L. Barrows,
- Todd Zickler
Achieving computer vision on microscale devices is a challenge. On these platforms, the power and mass constraints are severe enough for even the most common computations (matrix manipulations, convolution, etc.) to be difficult. This paper proposes and ...
A Minimum Volume Covering Approach with a Set of Ellipsoids
A technique for adjusting a minimum volume set of covering ellipsoids technique is elaborated. Solutions to this problem have potential application in one-class classification and clustering problems. Its main original features are: 1) It avoids the ...
Contextualized Trajectory Parsing with Spatiotemporal Graph
This work investigates how to automatically parse object trajectories in surveillance videos, which aims at jointly solving three subproblems: 1) spatial segmentation, 2) temporal tracking, and 3) object categorization. We present a novel representation ...
Forward Basis Selection for Pursuing Sparse Representations over a Dictionary
The forward greedy selection algorithm of Frank and Wolfehas recently been applied with success to coordinate-wise sparse learning problems, characterized by a tradeoff between sparsity and accuracy. In this paper, we generalize this method to the setup ...
Pose-Robust Recognition of Low-Resolution Face Images
Face images captured by surveillance cameras usually have poor resolution in addition to uncontrolled poses and illumination conditions, all of which adversely affect the performance of face matching algorithms. In this paper, we develop a completely ...
Sparse Canonical Correlation Analysis: New Formulation and Algorithm
In this paper, we study canonical correlation analysis (CCA), which is a powerful tool in multivariate data analysis for finding the correlation between two sets of multidimensional variables. The main contributions of the paper are: 1) to reveal the ...
A Coprime Blur Scheme for Data Security in Video Surveillance
This paper presents a novel coprime blurred pair (CBP) model to improve data security in camera surveillance. While most previous approaches have focused on completely encrypting the video stream, we introduce a spatial encryption scheme by ...