Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Block and Detail: Scaffolding Sketch-to-Image Generation
UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 33, Pages 1–13https://doi.org/10.1145/3654777.3676444We introduce a novel sketch-to-image tool that aligns with the iterative refinement process of artists. Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and ...
- research-articleOctober 2024
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 21, Pages 1–13https://doi.org/10.1145/3654777.3676402Scriptwriters usually rely on their mental visualization to create a vivid story by using their imagination to see, feel, and experience the scenes they are writing. Besides mental visualization, they often refer to existing images or scenes in movies ...
- ArticleNovember 2024
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
AbstractManually creating textures for 3D meshes is time-consuming, even for expert visual content creators. We propose a fast approach for automatically texturing an input 3D mesh based on a user-provided text prompt. Importantly, our approach ...
- ArticleOctober 2024
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
AbstractThe development of text-to-video (T2V), i.e., generating videos with a given text prompt, has been significantly advanced in recent years. However, relying solely on text prompts often results in ambiguous frame composition due to spatial ...
- courseAugust 2024
Generative Models for Visual Content Editing and Creation
SIGGRAPH Courses '24: ACM SIGGRAPH 2024 CoursesArticle No.: 13, Pages 1–6https://doi.org/10.1145/3664475.3664553Welcome to the SIGGRAPH 2024 Course on Generative Models for Visual Content Editing and Creation! In this course, you will embark on an exciting journey into the realm of generative models and their groundbreaking applications in computer graphics. Over ...
-
- research-articleJuly 2024
Transparent Image Layer Diffusion using Latent Transparency
ACM Transactions on Graphics (TOG), Volume 43, Issue 4Article No.: 100, Pages 1–15https://doi.org/10.1145/3658150We present an approach enabling large-scale pretrained latent diffusion models to generate transparent images. The method allows generation of single transparent images or of multiple transparent layers. The method learns a "latent transparency" that ...
- research-articleJuly 2024
A Unified Differentiable Boolean Operator with Fuzzy Logic
- Hsueh-Ti Derek Liu,
- Maneesh Agrawala,
- Cem Yuksel,
- Tim Omernick,
- Vinith Misra,
- Stefano Corazza,
- Morgan Mcguire,
- Victor Zordan
SIGGRAPH '24: ACM SIGGRAPH 2024 Conference PapersArticle No.: 109, Pages 1–9https://doi.org/10.1145/3641519.3657484This paper presents a unified differentiable boolean operator for implicit solid shape modeling using Constructive Solid Geometry (CSG). Traditional CSG relies on min, max operators to perform boolean operations on implicit shapes. But because these ...
- research-articleSeptember 2024
STIVi: Turning Perspective Sketching Videos into Interactive Tutorials
- Capucine Nghiem,
- Adrien Bousseau,
- Mark Sypesteyn,
- Jan Willem Hoftijzer,
- Maneesh Agrawala,
- Theophanis Tsandilas
GI '24: Proceedings of the 50th Graphics Interface ConferenceArticle No.: 16, Pages 1–13https://doi.org/10.1145/3670947.3670969For design and art enthusiasts who seek to enhance their skills through instructional videos, following drawing instructions while practicing can be challenging. STIVi presents perspective drawing demonstrations and commentary of prerecorded ...
- research-articleMay 2024
Bridging the Gulf of Envisioning: Cognitive Challenges in Prompt Based Interactions with LLMs
CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 1039, Pages 1–19https://doi.org/10.1145/3613904.3642754Large language models (LLMs) exhibit dynamic capabilities and appear to comprehend complex and ambiguous natural language prompts. However, calibrating LLM interactions is challenging for interface designers and end-users alike. A central issue is our ...
- research-articleDecember 2023
Editing Motion Graphics Video via Motion Vectorization and Transformation
ACM Transactions on Graphics (TOG), Volume 42, Issue 6Article No.: 229, Pages 1–13https://doi.org/10.1145/3618316Motion graphics videos are widely used in Web design, digital advertising, animated logos and film title sequences, to capture a viewer's attention. But editing such video is challenging because the video provides a low-level sequence of pixels and ...
- research-articleNovember 2023
EC: A Tool for Guiding Chart and Caption Emphasis<sc/><sc/>
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 30, Issue 1Pages 120–130https://doi.org/10.1109/TVCG.2023.3327150Recent work has shown that when both the chart and caption emphasize the same aspects of the data, readers tend to remember the doubly-emphasized features as takeaways; when there is a mismatch, readers rely on the chart to form takeaways and can miss ...
- research-articleOctober 2023
TaleStream: Supporting Story Ideation with Trope Knowledge
UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 52, Pages 1–12https://doi.org/10.1145/3586183.3606807Story ideation is a critical part of the story-writing process. It is challenging to support computationally due to its exploratory and subjective nature. Tropes, which are recurring narrative elements across stories, are essential in stories as they ...
- research-articleOctober 2023
Automated Conversion of Music Videos into Lyric Videos
UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 13, Pages 1–11https://doi.org/10.1145/3586183.3606757Musicians and fans often produce lyric videos, a form of music videos that showcase the song’s lyrics, for their favorite songs. However, making such videos can be challenging and time-consuming as the lyrics need to be added in synchrony and visual ...
- articleAugust 2004Seminal Paper
Interactive digital photomontage
- Aseem Agarwala,
- Mira Dontcheva,
- Maneesh Agrawala,
- Steven Drucker,
- Alex Colburn,
- Brian Curless,
- David Salesin,
- Michael Cohen
ACM Transactions on Graphics (TOG), Volume 23, Issue 3Pages 294–302https://doi.org/10.1145/1015706.1015718We describe an interactive, computer-assisted framework for combining parts of a set of photographs into a single composite picture, a process we call "digital photomontage." Our framework makes use of two techniques primarily: graph-cut optimization, ...
Also Published in:
Seminal Graphics Papers: Pushing the Boundaries, Volume 2: ISBN 9798400708978, August 2023 - research-articleMarch 2023
SlideSpecs: Automatic and Interactive Presentation Feedback Collation
IUI '23: Proceedings of the 28th International Conference on Intelligent User InterfacesPages 695–709https://doi.org/10.1145/3581641.3584035Presenters often collect audience feedback through practice talks to refine their presentations. In formative interviews, we find that although text feedback and verbal discussions allow presenters to receive feedback, organizing that feedback into ...
- research-articleOctober 2022
Sketch-Based Design of Foundation Paper Pieceable Quilts
UIST '22: Proceedings of the 35th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 21, Pages 1–11https://doi.org/10.1145/3526113.3545643Foundation paper piecing is a widely used quilt-making technique in which fabric pieces are sewn onto a paper guide to facilitate construction. But, designing paper pieceable quilt patterns is challenging because the sewing process imposes constraints ...
Modular information flow through ownership
PLDI 2022: Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and ImplementationPages 1–14https://doi.org/10.1145/3519939.3523445Statically analyzing information flow, or how data influences other data within a program, is a challenging task in imperative languages. Analyzing pointers and mutations requires access to a program's complete source. However, programs often use pre-...
- research-articleOctober 2021
Automated Accessory Rigs for Layered 2D Character Illustrations
UIST '21: The 34th Annual ACM Symposium on User Interface Software and TechnologyPages 1100–1108https://doi.org/10.1145/3472749.3474809Mix-and-match character creation tools enable users to quickly produce 2D character illustrations by combining various predefined accessories, like clothes and hairstyles, which are represented as separate, interchangeable artwork layers. However, these ...