[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3474085.3475711acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Learning to Compose Stylistic Calligraphy Artwork with Emotions

Published: 17 October 2021 Publication History

Abstract

Emotion plays a critical role in calligraphy composition, which makes the calligraphy artwork impressive and have a soul. However, previous research on calligraphy generation all neglected the emotion as a major contributor to the artistry of calligraphy. Such defects prevent them from generating aesthetic, stylistic, and diverse calligraphy artworks, but only static handwriting font library instead. To address this problem, we propose a novel cross-modal approach to generate stylistic and diverse Chinese calligraphy artwork driven by different emotions automatically. We firstly detect the emotions in the text by a classifier, then generate the emotional Chinese character images via a novel modified Generative Adversarial Network (GAN) structure, finally we predict the layout for all character images with a recurrent neural network. We also collect a large-scale stylistic Chinese calligraphy image dataset with rich emotions. Experimental results demonstrate that our model outperforms all baseline image translation models significantly for different emotional styles in terms of content accuracy and style discrepancy. Besides, our layout algorithm can also learn the patterns and habits of calligrapher, and makes the generated calligraphy more artistic. To the best of our knowledge, we are the first to work on emotion-driven discourse-level Chinese calligraphy artwork composition.

References

[1]
Bo Chang, Qiong Zhang, Shenyi Pan, and Lili Meng. 2018. Generating handwritten chinese characters using cyclegan. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 199--207.
[2]
Jie Chang, Yujun Gu, and Ya Zhang. 2017. Chinese typeface transformation with hierarchical adversarial network. arXiv preprint arXiv:1711.06448 (2017).
[3]
Huimin Chen, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, and Zhipeng Guo. 2019. Sentiment-controllable Chinese poetry generation. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 4925--4931.
[4]
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. Stargan: Unified generative adversarial networks for multidomain image-to-image translation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8789--8797.
[5]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
[6]
Vincent Dumoulin, Jonathon Shlens, and Manjunath Kudlur. 2016. A learned representation for artistic style. arXiv preprint arXiv:1610.07629 (2016).
[7]
Yiming Gao and Jiangqin Wu. 2018. CalliGAN: Unpaired Mutli-chirography Chinese Calligraphy Image Translation. In Asian Conference on Computer Vision. Springer, 334--348.
[8]
Yiming Gao and Jiangqin Wu. 2020. GAN-Based Unpaired Chinese Character Image Translation via Skeleton Transformation and Stroke Rendering. In Proceedings of the AAAI Conference on Artificial Intelligence.
[9]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680.
[10]
Xun Huang, Ming-Yu Liu, Serge Belongie, and Jan Kautz. 2018. Multimodal unsupervised image-to-image translation. In Proceedings of the European Conference on Computer Vision (ECCV). 172--189.
[11]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A Efros. 2017. Image-toimage translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1125--1134.
[12]
Yue Jiang, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2017. DCFont: an endto- end deep Chinese font generation system. In SIGGRAPH Asia 2017 Technical Briefs. 1--4.
[13]
Yue Jiang, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2019. SCFont: Structure-Guided Chinese Font Generation via Deep Stacked Networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 4015--4022.
[14]
Junho Kim, Minjae Kim, Hyeonwoo Kang, and Kwang Hee Lee. 2020. U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation. In International Conference on Learning Representations. https://openreview.net/forum?id=BJlZ5ySKPH
[15]
Diederik P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR) (2015).
[16]
Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Singh, and Ming-Hsuan Yang. 2018. Diverse image-to-image translation via disentangled representations. In Proceedings of the European Conference on Computer Vision (ECCV). 35--51.
[17]
Ming-Yu Liu, Thomas Breuel, and Jan Kautz. 2017. Unsupervised image-to-image translation networks. In Advances in neural information processing systems. 700-- 708.
[18]
Pengyuan Lyu, Xiang Bai, Cong Yao, Zhen Zhu, Tengteng Huang, andWenyu Liu. 2017. Auto-encoder guided GAN for Chinese calligraphy synthesis. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 1. IEEE, 1095--1100.
[19]
Pavlo Melnyk, Zhiqiang You, and Keqin Li. 2018. A high-performance CNN method for offline handwritten Chinese character recognition and visualization. Soft Computing (2018), 1--11.
[20]
Yiping Meng, Fan Tang, Weiming Dong, and Xiaopeng Zhang. 2016. Optimal character composing for Chinese calligraphic artwork. In SIGGRAPH ASIA 2016 Posters. 1--2.
[21]
Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014).
[22]
Wanqiong Pan, Zhouhui Lian, Rongju Sun, Yingmin Tang, and Jianguo Xiao. 2014. Flexifont: a flexible system to generate personal font libraries. In Proceedings of the 2014 ACM symposium on Document engineering. ACM, 17--20.
[23]
Réjean Plamondon, Wacef Guerfali, and Xiaolin Li. 1998. The generation of oriental characters: new perspectives for automatic handwriting processing. International journal of pattern recognition and artificial intelligence 12, 01 (1998), 31--44.
[24]
Maxim Shcherbakov, Adriaan Brebels, N.L. Shcherbakova, Anton Tyukov, T.A. Janovsky, and V.A. Kamaev. 2013. A survey of forecast error measures. World Applied Sciences Journal 24 (01 2013), 171--176. https://doi.org/10.5829/idosi.wasj. 2013.24.itmies.80032
[25]
Xiongbo Shi. 2017. The embodied art: an aesthetics of Chinese calligraphy. (2017).
[26]
Danyang Sun, Tongzheng Ren, Chongxuan Li, Hang Su, and Jun Zhu. 2018. Learning to write stylized Chinese characters by reading a handful of examples. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 920--927.
[27]
Donghui Sun, Qing Zhang, and Jun Yang. 2018. Pyramid Embedded Generative Adversarial Network for Automated Font Generation. In 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 976--981.
[28]
Vincent Tam and KW Yeung. 2010. Learning to write Chinese characters with correct stroke sequences on mobile devices. In 2010 2nd International Conference on Education Technology and Computer, Vol. 4. IEEE, V4--395.
[29]
Yuchen Tian. 2016. Rewrite: Neural Style Transfer For Chinese Fonts. https: //github.com/kaonashi-tyc/Rewrite.
[30]
Yuchen Tian. 2017. Master chinese calligraphy with conditional adversarial networks. https://github.com/kaonashi-tyc/zi2zi.
[31]
Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. 2017. Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6924--6932.
[32]
Helena TFWong and Horace HS Ip. 2000. Virtual brush: a model-based synthesis of Chinese calligraphy. Computers & Graphics 24, 1 (2000), 99--113.
[33]
Shan-Jean Wu, Chih-Yuan Yang, and Jane Yung-jen Hsu. 2020. CalliGAN: Style and Structure-aware Chinese Calligraphy Character Generator. arXiv preprint arXiv:2005.12500 (2020).
[34]
Songhua Xu, Tao Jin, Hao Jiang, and Francis CM Lau. 2009. Automatic generation of personal chinese handwriting by capturing the characteristics of personal handwriting. In Twenty-First IAAI Conference.
[35]
Songhua Xu, Francis CM Lau, William K Cheung, and Yunhe Pan. 2005. Automatic generation of artistic Chinese calligraphy. IEEE Intelligent Systems 20, 3 (2005), 32--39.
[36]
Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong. 2017. Dualgan: Unsupervised dual learning for image-to-image translation. In Proceedings of the IEEE international conference on computer vision. 2849--2857.
[37]
Jinhui Yu and Qunsheng Peng. 2005. Realistic synthesis of cao shu of Chinese calligraphy. Computers & Graphics 29, 1 (2005), 145--153.
[38]
Xiaoming Yu, Yuanqi Chen, Shan Liu, Thomas Li, and Ge Li. 2019. Multi-mapping Image-to-Image Translation via Learning Disentanglement. In Advances in Neural Information Processing Systems. 2990--2999.
[39]
Yexun Zhang, Ya Zhang, and Wenbin Cai. 2018. Separating style and content for generalized style transfer. In Proceedings of the IEEE conference on computer vision and pattern recognition. 8447--8455.
[40]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2921--2929.
[41]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE international conference on computer vision. 2223--2232.
[42]
Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A Efros, Oliver Wang, and Eli Shechtman. 2017. Toward multimodal image-to-image translation. In Advances in Neural Information Processing Systems. 465--476.
[43]
Alfred Zong and Yuke Zhu. 2014. Strokebank: Automating personalized chinese handwriting generation. In Twenty-Sixth IAAI Conference.

Cited By

View all
  • (2024)The Application of Geometric Figures in Traditional Calligraphic Art and Its Visual Communication EffectApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35739:1Online publication date: 27-Nov-2024
  • (2024)Future Ink: The Collision of AI and Chinese CalligraphyJournal on Computing and Cultural Heritage 10.1145/3700882Online publication date: 26-Nov-2024
  • (2023)Calligraphy's Reflective Expression: Intelligent Chinese Calligraphy Affective and Style Recognition2023 15th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)10.1109/IHMSC58761.2023.00027(80-83)Online publication date: Aug-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '21: Proceedings of the 29th ACM International Conference on Multimedia
October 2021
5796 pages
ISBN:9781450386517
DOI:10.1145/3474085
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ai and art
  2. calligraphy dataset
  3. calligraphy generation
  4. generative adversarial network

Qualifiers

  • Research-article

Conference

MM '21
Sponsor:
MM '21: ACM Multimedia Conference
October 20 - 24, 2021
Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)90
  • Downloads (Last 6 weeks)11
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The Application of Geometric Figures in Traditional Calligraphic Art and Its Visual Communication EffectApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35739:1Online publication date: 27-Nov-2024
  • (2024)Future Ink: The Collision of AI and Chinese CalligraphyJournal on Computing and Cultural Heritage 10.1145/3700882Online publication date: 26-Nov-2024
  • (2023)Calligraphy's Reflective Expression: Intelligent Chinese Calligraphy Affective and Style Recognition2023 15th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)10.1109/IHMSC58761.2023.00027(80-83)Online publication date: Aug-2023
  • (2022)SE-GAN: Skeleton Enhanced Gan-Based Model for Brush Handwriting Font Generation2022 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME52920.2022.9859964(1-6)Online publication date: 18-Jul-2022
  • (2022)The Doctrine of the Mean: Chinese Calligraphy with Moderate Visual Complexity Elicits High Aesthetic PreferenceInternational Journal of Human–Computer Interaction10.1080/10447318.2022.214486440:6(1355-1368)Online publication date: 17-Nov-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media