More Web Proxy on the site http://driver.im/

research-article

A Framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations

Authors:

Floraine Berthouzoz,

Mira Dontcheva,

Maneesh AgrawalaAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 30, Issue 5

Article No.: 120, Pages 1 - 14

https://doi.org/10.1145/2019627.2019639

Published: 22 October 2011 Publication History

Abstract

We present a framework for generating content-adaptive macros that can transfer complex photo manipulations to new target images. We demonstrate applications of our framework to face, landscape, and global manipulations. To create a content-adaptive macro, we make use of multiple training demonstrations. Specifically, we use automated image labeling and machine learning techniques to learn the dependencies between image features and the parameters of each selection, brush stroke, and image processing operation in the macro. Although our approach is limited to learning manipulations where there is a direct dependency between image features and operation parameters, we show that our framework is able to learn a large class of the most commonly used manipulations using as few as 20 training demonstrations. Our framework also provides interactive controls to help macro authors and users generate training demonstrations and correct errors due to incorrect labeling or poor parameter estimation. We ask viewers to compare images generated using our content-adaptive macros with and without corrections to manually generated ground-truth images and find that they consistently rate both our automatic and corrected results as close in appearance to the ground truth. We also evaluate the utility of our proposed macro generation workflow via a small informal lab study with professional photographers. The study suggests that our workflow is effective and practical in the context of real-world photo editing.

Supplementary Material

berthouzoz (berthouzoz.zip)

Supplemental movie and image files for, A Framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations

Download
22.88 MB

MP4 File (tp210_12.mp4)

Download
23.54 MB

References

[1]

Amini, A., Curwen, R., and Gore, J. 1996. Snakes and splines for tracking non-rigid heart motion. In Proceedings of ECCV. 249--261.

Digital Library

[2]

Bae, S., Paris, S., and Durand, F. 2006. Two-scale tone management for photographic look. In Proceedings of ACM Trans. Graph. 25, 3, 637--645.

Digital Library

[3]

Bitouk, D., Kumar, N., Dhillon, S., Belhumeur, P., and Nayar, S. 2008. Face swapping: Automatically replacing faces in photographs. Trans. graph. 27, 3.

Digital Library

[4]

Bolin, M., Webber, M., Rha, P., Wilson, T., and Miller, R. C. 2005. Automation and customization of rendered web pages. In Proceedings of the UIST Symposium. 163--172.

Digital Library

[5]

Cypher, A. and Halbert, D. 1993. Watch What I Do: Programming by Demonstration. MIT Press.

Digital Library

[6]

Dewdney, A. 1989. A potpourri of programmed prose and prosody. Scientific Amer.

[7]

Drori, I., Cohen-Or, D., and Yeshurun, H. 2003. Example-based style synthesis. In In Proceedings of the Conference on Computer Vision and Pattern Recognition. 143--150.

[8]

Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R. 2004. Least angle regression. In Annals of Statistics, 407--451.

[9]

Efros, A. and Freeman, W. 2001. Image quilting for texture synthesis and transfer. In Proceedings of the SIGGRAPH Conference. 341--346.

Digital Library

[10]

Felzenszwalb, P., McAllester, D., and Ramanan, D. 2008. A discriminatively trained, multiscale, deformable part model. In Proceedings of the CVPR Conference.

[11]

Grabler, F., Agrawala, M., Li, W., Dontcheva, M., and Igarashi, T. 2009. Generating photo manipulation tutorials by demonstration. ACM Trans. Graph. 28, 3, 66.

Digital Library

[12]

Guo, D. and Sim, T. 2009. Digital face makeup by example. In Proceedings of the Computer Vision and Pattern Recognition Conference. IEEE Computer Society, 73--79.

[13]

Hasinoff, S., Józwiak, M., Durand, F., and Freeman, W. 2010. Search-and-replace editing for personal photo collections. In Proceedings of the ICCP. 2. 8.

[14]

Hertzmann, A., Jacobs, C., Oliver, N., Curless, B., and Salesin, D. 2001. Image analogies. In Proceedings of the SIGGRAPH Conference. 327--340.

Digital Library

[15]

Hertzmann, A., Oliver, N., Curless, B., and Seitz, S. 2002. Curve analogies. In Proceedings of the Eurographics Workshop on Rendering. 233--246.

Digital Library

[16]

Hoiem, D., Efros, A., and Hebert, M. 2005. Geometric context from a single image. In Proceedings of the ICCV. 654--661.

Digital Library

[17]

Huggins, B. 2005. Photoshop: Retouching Cookbook for Digital Photographers. O'Reilly.

Digital Library

[18]

Jones, M. and Rehg, J. 2002. Statistical color models with application to skin detection. Int. J. Comput. Vision 46, 1, 81--96.

Digital Library

[19]

Kalnins, R., Markosian, L., Meier, B., Kowalski, M., Lee, J., Davidson, P., Webb, M., Hughes, J., and Finkelstein, A. 2002. WYSIWYG NPR: Drawing strokes directly on 3D models. ACM Trans. Graph. 21, 3, 755--762.

Digital Library

[20]

Kang, S., Kapoor, A., and Lischinski, D. 2010. Personalization of image enhancement. In Proceedings of the CVPR.

[21]

Kass, M., Witkin, A., and Terzopoulos, D. 1988. Snakes: Active contour models. Int. J. comput. Vis. 1, 4, 321--331.

[22]

Kelby, S. 2007. The Adobe Photoshop CS3 Book for Digital Photographers. Voices That Matter.

Digital Library

[23]

Kurlander, D. and Feiner, S. 1992. A history-based macro by example system. In Proceedings of the UIST Symposium. 99--106.

Digital Library

[24]

Lau, T., Bergman, L., Castelli, V., and Oblinger, D. 2004. Sheepdog: Learning procedures for technical support. In Proceedings of the IUI Conference. 109--116.

Digital Library

[25]

Lewis, D. 1998. Naive (Bayes) at forty: The independence assumption in information retrieval. In Proceedings of the ECML Conference. 8, 4--15.

Digital Library

[26]

Lieberman, H. 1993. Mondrian: A teachable graphical editor. In Watch What I Do: Programming by Demonstration, 341--358.

Digital Library

[27]

Lieberman, H. 2001. Your Wish is My Command: Giving Users the Power to Instruct their Software. Morgan Kaufmann.

[28]

Little, G., Lau, T., Cypher, A., Lin, J., Haber, E., and Kandogan, E. 2007. Koala: Capture, share, automate, personalize business processes on the web. In Proceedings of the CHI. 943--946.

Digital Library

[29]

Liu, Z., Shan, Y., and Zhang, Z. 2001. Expressive expression mapping with ratio images. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques. ACM, 276.

Digital Library

[30]

Modugno, F. and Myers, B. 1994. Pursuit: Graphically representing programs in a demonstrational visual shell. In Proceedings of the CHI. 455--456.

Digital Library

[31]

Nguyen, M., Lalonde, J., Efros, A., and De la Torre, F. 2008. Image-based shaving. Comput. Graph. Forum. 27, 627--635.

[32]

Reinhard, E., Ashikhmin, M., Gooch, B., and Shirley, P. 2001. Color transfer between images. IEEE Comput. Graph. Appl. 34--41.

Digital Library

[33]

Schwarz, D. 2005. Current research in concatenative sound synthesis. In Proceedings of the ICMC. 9--12.

[34]

Simhon, S. and Dudek, G. 2003. Curve Synthesis from Learned Refinement Models. http://www.clm.mcgill.ca/saol/pubs/eq03.pdf.

[35]

Zhou, Y., Gu, L., and Zhang, H. 2003. Bayesian tangent shape model: Estimating shape and pose parameters via bayesian inference. In Proceedings of the CVPR Conference. 109--116.

Digital Library

Cited By

Kaur ANoori Hoshyar ASaikrishna VFirmin SXia F(2024)Deepfake video detection: challenges and opportunitiesArtificial Intelligence Review10.1007/s10462-024-10810-657:6Online publication date: 29-May-2024
https://doi.org/10.1007/s10462-024-10810-6
Akhtar Z(2023)Deepfakes Generation and Detection: A Short SurveyJournal of Imaging10.3390/jimaging90100189:1(18)Online publication date: 13-Jan-2023
https://doi.org/10.3390/jimaging9010018
Riyaz MIgnacimuthu S(2023)Smart phone-macro lens setup (SPMLS): a low-cost and portable photography device for amateur taxonomists, biodiversity researchers, and citizen enthusiastsBulletin of the National Research Centre10.1186/s42269-023-01120-y47:1Online publication date: 6-Oct-2023
https://doi.org/10.1186/s42269-023-01120-y
Show More Cited By

Index Terms

A Framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        3D imaging
  2. Computer graphics
    1. Animation

Recommendations

Generating photo manipulation tutorials by demonstration

We present a demonstration-based system for automatically generating succinct step-by-step visual tutorials of photo manipulations. An author first demonstrates the manipulation using an instrumented version of GIMP that records all changes in interface ...
Generating photo manipulation tutorials by demonstration
SIGGRAPH '09: ACM SIGGRAPH 2009 papers

We present a demonstration-based system for automatically generating succinct step-by-step visual tutorials of photo manipulations. An author first demonstrates the manipulation using an instrumented version of GIMP that records all changes in interface ...
Photo Restoration and Retouching Using Corel Paint Shop Pro Photo

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 30, Issue 5

October 2011

198 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/2019627

Issue’s Table of Contents

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 October 2011

Accepted: 01 April 2011

Revised: 01 February 2011

Received: 01 September 2010

Published in TOG Volume 30, Issue 5

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Science Foundation

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

45
Total Citations
View Citations
2,684
Total Downloads

Downloads (Last 12 months)14
Downloads (Last 6 weeks)2

Reflects downloads up to 16 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Kaur ANoori Hoshyar ASaikrishna VFirmin SXia F(2024)Deepfake video detection: challenges and opportunitiesArtificial Intelligence Review10.1007/s10462-024-10810-657:6Online publication date: 29-May-2024
https://doi.org/10.1007/s10462-024-10810-6
Akhtar Z(2023)Deepfakes Generation and Detection: A Short SurveyJournal of Imaging10.3390/jimaging90100189:1(18)Online publication date: 13-Jan-2023
https://doi.org/10.3390/jimaging9010018
Riyaz MIgnacimuthu S(2023)Smart phone-macro lens setup (SPMLS): a low-cost and portable photography device for amateur taxonomists, biodiversity researchers, and citizen enthusiastsBulletin of the National Research Centre10.1186/s42269-023-01120-y47:1Online publication date: 6-Oct-2023
https://doi.org/10.1186/s42269-023-01120-y
Gokbudak FOztireli A(2023)One-shot Detail Retouching with Patch Space Neural Transformation BlendingProceedings of the 20th ACM SIGGRAPH European Conference on Visual Media Production10.1145/3626495.3626499(1-10)Online publication date: 30-Nov-2023
https://dl.acm.org/doi/10.1145/3626495.3626499
Karacan LAkata ZErdem AErdem E(2019)Manipulating Attributes of Natural Scenes via HallucinationACM Transactions on Graphics10.1145/336831239:1(1-17)Online publication date: 26-Nov-2019
https://dl.acm.org/doi/10.1145/3368312
Ma SWei ZTian FFan XZhang JShen XLin ZHuang JMěch RSamaras DWang HBrewster SFitzpatrick GCox AKostakos V(2019)SmartEyeProceedings of the 2019 CHI Conference on Human Factors in Computing Systems10.1145/3290605.3300701(1-12)Online publication date: 2-May-2019
https://dl.acm.org/doi/10.1145/3290605.3300701
Yang HWang BVesdapunt NGuo MKang S(2019)Personalized Exposure Control Using Adaptive Metering and Reinforcement LearningIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2018.286555525:10(2953-2968)Online publication date: 1-Oct-2019
https://doi.org/10.1109/TVCG.2018.2865555
Hennessey JLi WRussell BShechtman EMitra N(2017)Transferring image-based edits for multi-channel compositingACM Transactions on Graphics10.1145/3130800.313084236:6(1-16)Online publication date: 20-Nov-2017
https://dl.acm.org/doi/10.1145/3130800.3130842
Chandakkar PLi B(2017)Joint Regression and Ranking for Image Enhancement2017 IEEE Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV.2017.33(235-243)Online publication date: Mar-2017
https://doi.org/10.1109/WACV.2017.33
LIU YLIU MLIU BSUN JLIU X(2016)Laplace operator based multi-channel image filters learningJournal of Advanced Mechanical Design, Systems, and Manufacturing10.1299/jamdsm.2016jamdsm009810:8(JAMDSM0098-JAMDSM0098)Online publication date: 2016
https://doi.org/10.1299/jamdsm.2016jamdsm0098
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents