Author: Agrawala, Maneesh : Search

research-article

Block and Detail: Scaffolding Sketch-to-Image Generation

UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 33, Pages 1–13https://doi.org/10.1145/3654777.3676444

We introduce a novel sketch-to-image tool that aligns with the iterative refinement process of artists. Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and ...

research-article

ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database

UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 21, Pages 1–13https://doi.org/10.1145/3654777.3676402

Scriptwriters usually rely on their mental visualization to create a vivid story by using their imagination to see, feel, and experience the scenes they are writing. Besides mental visualization, they often refer to existing images or scenes in movies ...

Article

FlashTex: Fast Relightable Mesh Texturing with LightControlNet

Computer Vision – ECCV 2024Pages 90–107https://doi.org/10.1007/978-3-031-73383-3_6

Abstract

Manually creating textures for 3D meshes is time-consuming, even for expert visual content creators. We propose a fast approach for automatically texturing an input 3D mesh based on a user-provided text prompt. Importantly, our approach ...

Article

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

Computer Vision – ECCV 2024Pages 330–348https://doi.org/10.1007/978-3-031-72946-1_19

Abstract

The development of text-to-video (T2V), i.e., generating videos with a given text prompt, has been significantly advanced in recent years. However, relying solely on text prompts often results in ambiguous frame composition due to spatial ...

course

Generative Models for Visual Content Editing and Creation

SIGGRAPH Courses '24: ACM SIGGRAPH 2024 CoursesArticle No.: 13, Pages 1–6https://doi.org/10.1145/3664475.3664553

Welcome to the SIGGRAPH 2024 Course on Generative Models for Visual Content Editing and Creation! In this course, you will embark on an exciting journey into the realm of generative models and their groundbreaking applications in computer graphics. Over ...

research-article

Transparent Image Layer Diffusion using Latent Transparency

ACM Transactions on Graphics (TOG), Volume 43, Issue 4Article No.: 100, Pages 1–15https://doi.org/10.1145/3658150

We present an approach enabling large-scale pretrained latent diffusion models to generate transparent images. The method allows generation of single transparent images or of multiple transparent layers. The method learns a "latent transparency" that ...

research-article

A Unified Differentiable Boolean Operator with Fuzzy Logic

SIGGRAPH '24: ACM SIGGRAPH 2024 Conference PapersArticle No.: 109, Pages 1–9https://doi.org/10.1145/3641519.3657484

This paper presents a unified differentiable boolean operator for implicit solid shape modeling using Constructive Solid Geometry (CSG). Traditional CSG relies on min, max operators to perform boolean operations on implicit shapes. But because these ...

research-article

Open Access

STIVi: Turning Perspective Sketching Videos into Interactive Tutorials

GI '24: Proceedings of the 50th Graphics Interface ConferenceArticle No.: 16, Pages 1–13https://doi.org/10.1145/3670947.3670969

For design and art enthusiasts who seek to enhance their skills through instructional videos, following drawing instructions while practicing can be challenging. STIVi presents perspective drawing demonstrations and commentary of prerecorded ...

research-article

Bridging the Gulf of Envisioning: Cognitive Challenges in Prompt Based Interactions with LLMs

CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 1039, Pages 1–19https://doi.org/10.1145/3613904.3642754

Large language models (LLMs) exhibit dynamic capabilities and appear to comprehend complex and ambiguous natural language prompts. However, calibrating LLM interactions is challenging for interface designers and end-users alike. A central issue is our ...

research-article

Editing Motion Graphics Video via Motion Vectorization and Transformation

ACM Transactions on Graphics (TOG), Volume 42, Issue 6Article No.: 229, Pages 1–13https://doi.org/10.1145/3618316

Motion graphics videos are widely used in Web design, digital advertising, animated logos and film title sequences, to capture a viewer's attention. But editing such video is challenging because the video provides a low-level sequence of pixels and ...

research-article

EC: A Tool for Guiding Chart and Caption Emphasis<sc/><sc/>

IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 30, Issue 1Pages 120–130https://doi.org/10.1109/TVCG.2023.3327150

Recent work has shown that when both the chart and caption emphasize the same aspects of the data, readers tend to remember the doubly-emphasized features as takeaways; when there is a mismatch, readers rely on the chart to form takeaways and can miss ...

research-article

Open Access

TaleStream: Supporting Story Ideation with Trope Knowledge

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 52, Pages 1–12https://doi.org/10.1145/3586183.3606807

Story ideation is a critical part of the story-writing process. It is challenging to support computationally due to its exploratory and subjective nature. Tropes, which are recurring narrative elements across stories, are essential in stories as they ...

research-article

Automated Conversion of Music Videos into Lyric Videos

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 13, Pages 1–11https://doi.org/10.1145/3586183.3606757

Musicians and fans often produce lyric videos, a form of music videos that showcase the song’s lyrics, for their favorite songs. However, making such videos can be challenging and time-consuming as the lyrics need to be added in synchrony and visual ...

article

Open Access

Seminal Paper

Interactive digital photomontage

ACM Transactions on Graphics (TOG), Volume 23, Issue 3Pages 294–302https://doi.org/10.1145/1015706.1015718

We describe an interactive, computer-assisted framework for combining parts of a set of photographs into a single composite picture, a process we call "digital photomontage." Our framework makes use of two techniques primarily: graph-cut optimization, ...

Also Published in:

Seminal Graphics Papers: Pushing the Boundaries, Volume 2: ISBN 9798400708978, August 2023

research-article

Open Access

SlideSpecs: Automatic and Interactive Presentation Feedback Collation

IUI '23: Proceedings of the 28th International Conference on Intelligent User InterfacesPages 695–709https://doi.org/10.1145/3581641.3584035

Presenters often collect audience feedback through practice talks to refine their presentations. In formative interviews, we find that although text feedback and verbal discussions allow presenters to receive feedback, organizing that feedback into ...

research-article

Open Access

Sketch-Based Design of Foundation Paper Pieceable Quilts

UIST '22: Proceedings of the 35th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 21, Pages 1–11https://doi.org/10.1145/3526113.3545643

Foundation paper piecing is a widely used quilt-making technique in which fabric pieces are sewn onto a paper guide to facilitate construction. But, designing paper pieceable quilt patterns is challenging because the sewing process imposes constraints ...

proceeding

UIST '22 Adjunct: Adjunct Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology

proceeding

UIST '22: Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology

research-article

Modular information flow through ownership

PLDI 2022: Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and ImplementationPages 1–14https://doi.org/10.1145/3519939.3523445

Statically analyzing information flow, or how data influences other data within a program, is a challenging task in imperative languages. Analyzing pointers and mutations requires access to a program's complete source. However, programs often use pre-...

research-article

Open Access

Automated Accessory Rigs for Layered 2D Character Illustrations

UIST '21: The 34th Annual ACM Symposium on User Interface Software and TechnologyPages 1100–1108https://doi.org/10.1145/3472749.3474809

Mix-and-match character creation tools enable users to quickly produce 2D character illustrations by combining various predefined accessories, like clothes and hairstyles, which are represented as separate, interchangeable artwork layers. However, these ...

Applied Filters

People

Names

Institutions

Authors

Editors

Advisors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Save to Binder

Upcoming Conferences

Also Published in: