Author: Shrivastava, Abhinav : Search

Applied Filters

Publication Date

32 Results for: Author: Shrivastava, AbhinavEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,855,380 records)|Limit your search to The ACM Full-Text Collection (777,925 records)

Showing 1 - 20of32 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Open Access
October 2024
Quantifying NBA Shot Quality: A Deep Network Approach
MMSports '24: Proceedings of the 7th ACM International Workshop on Multimedia Content Analysis in SportsPages 91–95https://doi.org/10.1145/3689061.3689068

Since the introduction of player positional tracking data to the NBA in 2013, the field of basketball analytics has been steadily developing. As such, more and more teams utilize data-driven approaches to maximize the potential for their team to score a ...
0
165
Metrics
Total Citations0
Total Downloads165
Last 12 Months165
Last 6 weeks67
View online with eReader
PDF
Article
November 2024
Investigating Style Similarity in Diffusion Models
Computer Vision – ECCV 2024Pages 143–160https://doi.org/10.1007/978-3-031-72848-8_9
Abstract
Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation increases, it has become ...
0
Metrics
Total Citations0
Article
November 2024
Do Text-Free Diffusion Models Learn Discriminative Visual Representations?
Computer Vision – ECCV 2024Pages 253–272https://doi.org/10.1007/978-3-031-73027-6_15
Abstract
Diffusion models have proven to be state-of-the-art methods for generative tasks. These models involve training a U-Net to iteratively predict and remove noise, and the resulting model can synthesize high-fidelity, diverse, novel images. However, ...
0
Metrics
Total Citations0
Article
November 2024
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Computer Vision – ECCV 2024Pages 332–349https://doi.org/10.1007/978-3-031-73024-5_20
Abstract
Image customization has been extensively studied in text-to-image (T2I) diffusion models, leading to impressive outcomes and applications. With the emergence of text-to-video (T2V) diffusion models, its temporal counterpart, motion customization, ...
0
Metrics
Total Citations0
Article
November 2024
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Computer Vision – ECCV 2024Pages 285–302https://doi.org/10.1007/978-3-031-72633-0_16
Abstract
Implicit Neural Networks (INRs) have emerged as powerful representations to encode all forms of data, including images, videos, audios, and scenes. With video, many INRs for video have been proposed for the compression task, and recent methods ...
0
Metrics
Total Citations0
Article
November 2024
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS
Computer Vision – ECCV 2024Pages 54–71https://doi.org/10.1007/978-3-031-73036-8_4
Abstract
Recently, 3D Gaussian splatting (3D-GS) has gained popularity in novel-view scene synthesis. It addresses the challenges of lengthy training times and slow rendering speeds associated with Neural Radiance Fields (NeRFs). Through rapid, ...
0
Metrics
Total Citations0
Article
October 2024
LEIA: Latent View-Invariant Embeddings for Implicit 3D Articulation
Computer Vision – ECCV 2024Pages 210–227https://doi.org/10.1007/978-3-031-72640-8_12
Abstract
Neural Radiance Fields (NeRFs) have revolutionized the reconstruction of static scenes and objects in 3D, offering unprecedented quality. However, extending NeRFs to model dynamic objects or object articulations remains a challenging problem. ...
0
Metrics
Total Citations0
Article
October 2024
Trajectory-Aligned Space-Time Tokens for Few-Shot Action Recognition
Computer Vision – ECCV 2024Pages 474–493https://doi.org/10.1007/978-3-031-72764-1_27
Abstract
We propose a simple yet effective approach for few-shot action recognition, emphasizing the disentanglement of motion and appearance representations. By harnessing recent progress in tracking, specifically point trajectories and self-supervised ...
0
Metrics
Total Citations0
Article
October 2024
Fast Encoding and Decoding for Implicit Video Representation
Computer Vision – ECCV 2024Pages 402–418https://doi.org/10.1007/978-3-031-72933-1_23
Abstract
Despite the abundant availability and content richness for video data, its high-dimensionality poses challenges for video research. Recent advancements have explored the implicit representation for videos using neural networks, demonstrating ... $^{}$
0
Metrics
Total Citations0
Article
September 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Computer Vision – ECCV 2024Pages 110–128https://doi.org/10.1007/978-3-031-72667-5_7
Abstract
We present a simple self-supervised method to enhance the performance of ViT features for dense downstream tasks. Our Lightweight Feature Transform (LiFT) is a straightforward and compact postprocessing network that can be applied to enhance the ...
0
Metrics
Total Citations0
research-article
December 2023
Video dynamics prior: an internal learning approach for robust video enhancements
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1483, Pages 34228–34246

In this paper, we present a novel robust framework for low-level vision tasks, including denoising, object removal, frame interpolation, and super-resolution, that does not require any external training data corpus. Our proposed approach directly learns ...
0
Metrics
Total Citations0
research-article
June 2023
Leveraging Hand-Object Interactions in Assistive Egocentric Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 6Pages 6820–6831https://doi.org/10.1109/TPAMI.2021.3123303
Egocentric vision holds great promise for increasing access to visual information and improving the quality of life for blind people. While we strive to improve recognition performance, it remains difficult to identify which object is of interest to the ...
0
Metrics
Total Citations0
Article
October 2022
Neural Space-Filling Curves
Computer Vision – ECCV 2022Pages 418–434https://doi.org/10.1007/978-3-031-20071-7_25
Abstract
We present Neural Space-filling Curves (SFCs), a data-driven approach to infer a context-based scan order for a set of images. Linear ordering of pixels forms the basis for many applications such as video scrambling, compression, and auto-...
0
Metrics
Total Citations0
Article
October 2022
Burn After Reading: Online Adaptation for Cross-domain Streaming Data
Computer Vision – ECCV 2022Pages 404–422https://doi.org/10.1007/978-3-031-19827-4_24
Abstract
In the context of online privacy, many methods propose complex security preserving measures to protect sensitive data. In this paper we note that: not storing any sensitive data is the best form of security. We propose an online framework called “...
0
Metrics
Total Citations0
Article
October 2022
Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers
Computer Vision – ECCV 2022Pages 201–219https://doi.org/10.1007/978-3-031-19806-9_12
Abstract
We study recognizing attributes for objects in visual scenes. We consider attributes to be any phrases that describe an object’s physical and semantic properties, and its relationships with other objects. Existing work studies attribute prediction ...
3
Metrics
Total Citations3
Article
October 2022
Learning Semantic Correspondence with Sparse Annotations
Computer Vision – ECCV 2022Pages 267–284https://doi.org/10.1007/978-3-031-19781-9_16
Abstract
Finding dense semantic correspondence is a fundamental problem in computer vision, which remains challenging in complex scenes due to background clutter, extreme intra-class variation, and a severe lack of ground truth. In this paper, we aim to ...
2
Metrics
Total Citations2
research-article
December 2021
PatchGame: learning to signal mid-level patches in referential games
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 1992, Pages 26015–26027

We study a referential game (a type of signaling game) where two agents communicate with each other via a discrete bottleneck to achieve a common goal. In our referential game, the goal of the speaker is to compose a message or a symbolic representation ...
0
Metrics
Total Citations0
1
Supplementary Material
Additional material
research-article
December 2021
NeRV: neural representations for videos
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 1649, Pages 21557–21568

We propose a novel neural representation for videos (NeRV) which encodes videos in neural networks. Unlike conventional representations that treat videos as frame sequences, we represent videos as neural networks taking frame index as input. Given a ...
0
Metrics
Total Citations0
1
Supplementary Material
Additional material
research-article
May 2021
No-frills Dynamic Planning using Static Planners
2021 IEEE International Conference on Robotics and Automation (ICRA)Pages 2005–2011https://doi.org/10.1109/ICRA48506.2021.9560762
In this paper, we address the task of interacting with dynamic environments where the changes in the environment are independent of the agent. We study this through the context of trapping a moving ball with a UR5 robotic arm. Our key contribution is an ...
0
Metrics
Total Citations0
Article
August 2020
Quantization Guided JPEG Artifact Correction
Computer Vision – ECCV 2020Pages 293–309https://doi.org/10.1007/978-3-030-58598-3_18
Abstract
The JPEG image compression algorithm is the most popular method of image compression because of it’s ability for large compression ratios. However, to achieve such high compression, information is lost. For aggressive quantization settings, this ...
9
Metrics
Total Citations9

Search Results

Applied Filters

Publication Date

People

Authors

Institutions

Publications

Journal/Magazine Names

All Publications

Content Type

Supplemental Material Type

Publisher

Proceedings Series

ACM SIG Sponsors

Results

Quantifying NBA Shot Quality: A Deep Network Approach

Investigating Style Similarity in Diffusion Models

Do Text-Free Diffusion Models Learn Discriminative Visual Representations?

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

LEIA: Latent View-Invariant Embeddings for Implicit 3D Articulation

Trajectory-Aligned Space-Time Tokens for Few-Shot Action Recognition

Fast Encoding and Decoding for Implicit Video Representation

LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

Video dynamics prior: an internal learning approach for robust video enhancements

Leveraging Hand-Object Interactions in Assistive Egocentric Vision

Neural Space-Filling Curves

Burn After Reading: Online Adaptation for Cross-domain Streaming Data

Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers

Learning Semantic Correspondence with Sparse Annotations

PatchGame: learning to signal mid-level patches in referential games

NeRV: neural representations for videos

No-frills Dynamic Planning using Static Planners

Quantization Guided JPEG Artifact Correction