More Web Proxy on the site http://driver.im/

research-article

Open access

Living Papers: A Language Toolkit for Augmented Scholarly Communication

Authors:

Matthew Conlen,

Vishal Devireddy,

Joshua HorowitzAuthors Info & Claims

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Article No.: 42, Pages 1 - 13

https://doi.org/10.1145/3586183.3606791

Published: 29 October 2023 Publication History

All formats PDF

Abstract

Computing technology has deeply shaped how academic articles are written and produced, yet article formats and affordances have changed little over centuries. The status quo consists of digital files optimized for printed paper—ill-suited to interactive reading aids, accessibility, dynamic figures, or easy information extraction and reuse. Guided by formative discussions with scholarly communication researchers and publishing tool developers, we present Living Papers, a language toolkit for producing augmented academic articles that span print, interactive, and computational media. Living Papers articles may include formatted text, references, executable code, and interactive components. Articles are parsed into a standardized document format from which a variety of outputs are generated, including static PDFs, dynamic web pages, and extraction APIs for paper content and metadata. We describe Living Papers’ architecture, document model, and reactive runtime, and detail key aspects such as citation processing and conversion of interactive components to static content. We demonstrate the use and extension of Living Papers through examples spanning traditional research papers, explorable explanations, information extraction, and reading aids such as enhanced citations, cross-references, and equations. Living Papers is available as an extensible, open source platform intended to support both article authors and researchers of augmented reading and writing experiences.

Supplemental Material

ZIP File

Supplemental File

Download
370.26 MB

References

[1]

Allen Institute for Artificial Intelligence, Semantic Scholar Team. 2023. Semantic Reader. https://www.semanticscholar.org/product/semantic-reader.

[2]

Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Wang, Chris Willhelm, Zheng Yuan, Madeleine Zuylen, and oren. 2018. Construction of the Literature Graph in Semantic Scholar. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/n18-3011

[3]

Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, and Kyle Lo. 2022. Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing. (2022). https://doi.org/10.48550/ARXIV.2203.00130

[4]

Sriram Karthik Badam, Zhicheng Liu, and Niklas Elmqvist. 2019. Elastic Documents: Coupling Text and Tables through Contextual Visualizations for Enhanced Document Reading. IEEE Transactions on Visualization and Computer Graphics 25, 1 (2019), 661–671. https://doi.org/10.1109/tvcg.2018.2865119

Digital Library

[5]

Tim Berners-Lee, Robert Cailliau, Ari Luotonen, Henrik Frystyk Nielsen, and Arthur Secret. 1994. The World-Wide Web. Commun. ACM 37, 8 (1994), 76–82. https://doi.org/10.1145/179606.179671

Digital Library

[6]

Jeffrey P. Bigham, Erin L. Brady, Cole Gleason, Anhong Guo, and David A. Shamma. 2016. An Uninteresting Tour Through Why Our Research Papers Aren’t Accessible. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/2851581.2892588

Digital Library

[7]

M. Bostock, V. Ogievetsky, and J. Heer. 2011. D3 Data-Driven Documents. IEEE Transactions on Visualization and Computer Graphics 17, 12 (2011), 2301–2309. https://doi.org/10.1109/tvcg.2011.185

Digital Library

[8]

[8] Citation Style Language. 2023. https://citationstyles.org/.

[9]

Matthew Conlen and Jeffrey Heer. 2018. Idyll. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3242587.3242600

Digital Library

[10]

Matthew Conlen and Jeffrey Heer. 2022. Fidyll: A Compiler for Cross-Format Data Stories & Explorable Explanations. (2022). https://doi.org/10.48550/ARXIV.2205.09858

[11]

Matthew Conlen, Megan Vo, Alan Tan, and Jeffrey Heer. 2021. Idyll Studio: A Structured Editor for Authoring Interactive & Data-Driven Articles. In The 34th Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3472749.3474731

Digital Library

[12]

Will Crichton. 2023. A New Medium for Communicating Research on Programming Languages. https://willcrichton.net/nota/.

[13]

[13] Curvenote. 2023. https://curvenote.com/.

[14]

Pierre Dragicevic, Yvonne Jansen, Abhraneel Sarma, Matthew Kay, and Fanny Chevalier. 2019. Increasing the Transparency of Research Papers with Explorable Multiverse Analyses. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3290605.3300295

Digital Library

[15]

Emily Eisenberg and Sophie Alpert. 2023. KaTeX: The fastest math typesetting library for the web. https://katex.org.

[16]

Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Andrew Head, Marti A. Hearst, and Daniel S. Weld. 2022. Scim: Intelligent Skimming Support for Scientific Papers. (2022). https://doi.org/10.48550/ARXIV.2205.04561

[17]

Santo Fortunato, Carl T. Bergstrom, Katy Börner, James A. Evans, Dirk Helbing, Staša Milojević, Alexander M. Petersen, Filippo Radicchi, Roberta Sinatra, Brian Uzzi, Alessandro Vespignani, Ludo Waltman, Dashun Wang, and Albert-László Barabási. 2018. Science of science. Science 359, 6379 (2018). https://doi.org/10.1126/science.aao0185

[18]

Google, Inc.2023. Puppeteer. https://pptr.dev/.

[19]

[19] Google Scholar. 2023. https://scholar.google.com/.

[20]

John Gruber. 2004. Markdown. https://daringfireball.net/projects/markdown/.

[21]

Steve Harrison, Scott Minneman, Maribeth Back, Anne Balsamo, Mark Chow, Rich Gold, Matt Gorbet, and Dale Mac Donald. 2001. Design: the what of XFR. Interactions 8, 3 (2001), 21–30. https://doi.org/10.1145/369825.369829

Digital Library

[22]

Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, and Marti A. Hearst. 2021. Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3411764.3445648

Digital Library

[23]

Andrew Head, Amber Xie, and Marti A. Hearst. 2022. Math Augmentation: How Authors Enhance the Readability of Formulas using Novel Visual Design Practices. In CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3491102.3501932

Digital Library

[24]

Jeffrey Heer. 2021. Fast & Accurate Gaussian Kernel Density Estimation. In 2021 IEEE Visualization Conference (VIS). IEEE. https://doi.org/10.1109/vis49827.2021.9623323

[25]

William C. Hill, James D. Hollan, Dave Wroblewski, and Tim McCandless. 1992. Edit wear and read wear. In Proceedings of the SIGCHI conference on Human factors in computing systems - CHI ’92. ACM Press. https://doi.org/10.1145/142750.142751

Digital Library

[26]

Daniel S. Himmelstein, Vincent Rubinetti, David R. Slochower, Dongbo Hu, Venkat S. Malladi, Casey S. Greene, and Anthony Gitter. 2019. Open collaborative writing with Manubot. PLOS Computational Biology 15, 6 (2019), e1007128. https://doi.org/10.1371/journal.pcbi.1007128

[27]

Tom Hope, Doug Downey, Oren Etzioni, Daniel S. Weld, and Eric Horvitz. 2022. A Computational Inflection for Scientific Discovery. (2022). https://doi.org/10.48550/ARXIV.2205.02007

[28]

[28] Jupyter Book. 2023. https://jupyterbook.org/.

[29]

Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel S. Weld, and Marti A. Hearst. 2020. Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions. (2020). https://doi.org/10.48550/ARXIV.2010.05129

[30]

Dae Hyun Kim, Enamul Hoque, Juho Kim, and Maneesh Agrawala. 2018. Facilitating Document Reading by Linking Text and Tables. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3242587.3242617

Digital Library

[31]

Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin, Haokun Liu, Kyle Lo, Jaron Lochner, Kelsey MacMillan, Tyler Murray, Chris Newell, Smita Rao, Shaurya Rohatgi, Paul Sayre, Zejiang Shen, Amanpreet Singh, Luca Soldaini, Shivashankar Subramanian, Amber Tanaka, Alex D. Wade, Linda Wagner, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Madeleine Van Zuylen, and Daniel S. Weld. 2023. The Semantic Scholar Open Data Platform. (2023). https://doi.org/10.48550/ARXIV.2301.10140

[32]

Clemens N. Klokmose, James R. Eagan, Siemen Baader, Wendy Mackay, and Michel Beaudouin-Lafon. 2015. Webstrates. In Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology. ACM. https://doi.org/10.1145/2807442.2807446

Digital Library

[33]

Thomas Kluyver, Benjamin Ragan-Kelley, Fernando Pérez, Brian E Granger, Matthias Bussonnier, Jonathan Frederic, Kyle Kelley, Jessica B Hamrick, Jason Grout, Sylvain Corlay, and others. 2016. Jupyter Notebooks-a publishing format for reproducible computational workflows. Vol. 2016.

[34]

D. E. Knuth. 1979. TEX and METAFONT: New directions in typesetting. American Mathematical Society.

[35]

D. E. Knuth. 1984. Literate Programming. Comput. J. 27, 2 (1984), 97–111. https://doi.org/10.1093/comjnl/27.2.97

[36]

Leslie Lamport. 1985. LaTeX: A Document Preparation System. Addison-Wesley Professional.

Digital Library

[37]

Patrice Lopez. 2009. GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications. Springer Berlin Heidelberg, 473–474. https://doi.org/10.1007/978-3-642-04346-8_62

[38]

Bruce D. Lucas and Takeo Kanade. 1981. An Iterative Image Registration Technique with an Application to Stereo Vision. In International Joint Conference on Artificial Intelligence.

[39]

John MacFarlane. 2023. Pandoc: A Universal Document Converter. https://pandoc.org/.

[40]

Meta Open Source. 2023. React. https://react.dev/.

[41]

[41] MyST Markdown. 2023. https://myst-tools.org/.

[42]

T. H. Nelson. 1965. Complex information processing. In Proceedings of the 1965 20th national conference on -. ACM Press. https://doi.org/10.1145/800197.806036

Digital Library

[43]

T. H. Nelson. 1981. Literary Machines. Mindful Press.

[44]

[44] Observable. 2023. https://observablehq.com/.

[45]

[45] Observable Inputs. 2023. https://github.com/observablehq/inputs.

[46]

[46] Observable Runtime. 2023. https://github.com/observablehq/runtime.

[47]

Overleaf. 2023. Online LaTeX Editor. https://www.overleaf.com/.

[48]

Thomas A Phelps and Robert Wilensky. 2000. Robust intra-document locations. Computer Networks 33, 1-6 (2000), 105–118. https://doi.org/10.1016/s1389-1286(00)00043-8

Digital Library

[49]

[49] Quarto. 2023. https://quarto.org/.

[50]

Napol Rachatasumrit, Jonathan Bragg, Amy X. Zhang, and Daniel S Weld. 2022. CiteRead: Integrating Localized Citation Contexts into Scientific Paper Reading. In 27th International Conference on Intelligent User Interfaces. ACM. https://doi.org/10.1145/3490099.3511162

Digital Library

[51]

Stuart Ritchie. 2022. The Big Idea: Should we get rid of the scientific paper?https://www.theguardian.com/books/2022/apr/11/the-big-idea-should-we-get-rid-of-the-scientific-paper. The Guardian 11 (2022).

[52]

[52] RMarkdown. 2023. https://rmarkdown.rstudio.com/.

[53]

Adam Rule, Aurélien Tabard, and James D. Hollan. 2018. Exploration and Explanation in Computational Notebooks. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3173574.3173606

Digital Library

[54]

Arvind Satyanarayan, Dominik Moritz, Kanit Wongsuphasawat, and Jeffrey Heer. 2017. Vega-Lite: A Grammar of Interactive Graphics. IEEE Transactions on Visualization and Computer Graphics 23, 1 (2017), 341–350. https://doi.org/10.1109/tvcg.2016.2599030

Digital Library

[55]

Arvind Satyanarayan, Ryan Russell, Jane Hoffswell, and Jeffrey Heer. 2016. Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization. IEEE Transactions on Visualization and Computer Graphics 22, 1 (2016), 659–668. https://doi.org/10.1109/tvcg.2015.2467091

Digital Library

[56]

Manolis Savva, Nicholas Kong, Arti Chhajta, Li Fei-Fei, Maneesh Agrawala, and Jeffrey Heer. 2011. ReVision. In Proceedings of the 24th annual ACM symposium on User interface software and technology. ACM. https://doi.org/10.1145/2047196.2047247

Digital Library

[57]

Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, and Doug Downey. 2022. VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups. Transactions of the Association for Computational Linguistics 10 (2022), 376–392. https://doi.org/10.1162/tacl_a_00466

[58]

Nicole Sultanum, Fanny Chevalier, Zoya Bylinskii, and Zhicheng Liu. 2021. Leveraging Text-Chart Links to Support Authoring of Data-Driven Articles with VizFlow. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3411764.3445354

Digital Library

[59]

Editorial Team. 2021. Distill Hiatus. Distill 6, 7 (2021). https://doi.org/10.23915/distill.00031

[60]

The Alliance for Networking Visual Culture. 2023. Scalar. https://scalar.me/anvc/scalar/.

[61]

Typst. 2023. Typst: Compose papers faster. https://typst.app/.

[62]

Bret Victor. 2011. Explorable Explanations. http://worrydream.com/ExplorableExplanations/.

[63]

Lucy Lu Wang, Isabel Cachola, Jonathan Bragg, Evie Yu-Yen Cheng, Chelsea Haupt, Matt Latzke, Bailey Kuehl, Madeleine N van Zuylen, Linda Wagner, and Daniel Weld. 2021. SciA11y: Converting Scientific Papers to Accessible HTML. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility. ACM. https://doi.org/10.1145/3441852.3476545

Digital Library

[64]

Lars Willighagen. 2023. Citation.js. https://citation.js.org/.

[65]

Gary Wolf. 1995. The Curse of Xanadu. https://www.wired.com/1995/06/xanadu/. In Wired.

[66]

[66] Workshop on Visualization for AI Explainability. 2022. http://visxai.io/.

[67]

Polle T. Zellweger, Susan Harkness Regli, Jock D. Mackinlay, and Bay-Wei Chang. 2000. The impact of fluid documents on reading and browsing. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/332040.332440

Digital Library

[68]

Jonathan Zong, Crystal Lee, Alan Lundgard, JiWoong Jang, Daniel Hajas, and Arvind Satyanarayan. 2022. Rich Screen Reader Experiences for Accessible Data Visualization. Computer Graphics Forum 41, 3 (2022), 15–27. https://doi.org/10.1111/cgf.14519

Cited By

Suh SDang HYen RPollock JArawjo IKazi RSubramonyam HLi JSaquib NSatyanarayan A(2024)Dynamic Abstractions: Building the Next Generation of Cognitive Tools and InterfacesAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686706(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686706
Han CIsaacs K(2024)A Deixis-Centered Approach for Documenting Remote Synchronous Communication Around Data VisualizationsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345635131:1(930-940)Online publication date: 10-Sep-2024
https://dl.acm.org/doi/10.1109/TVCG.2024.3456351
Lee MChen B(2024)MaugVLink: Augmenting Mathematical Formulas with Visual Links2024 IEEE 17th Pacific Visualization Conference (PacificVis)10.1109/PacificVis60374.2024.00048(337-342)Online publication date: 23-Apr-2024
https://doi.org/10.1109/PacificVis60374.2024.00048

Index Terms

Living Papers: A Language Toolkit for Augmented Scholarly Communication
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interactive systems and tools

Recommendations

Russian Scholarly Papers in Open-Access Megajournals
Abstract
The quantity, research topics, and growth rates are assessed for Russian scholarly papers published in open-access megajournals. Russian papers published in PLoS ONE in 2006–2019 are analyzed on the basis of international scientometric indicators. ...
Who support open access publishing? Gender, discipline, seniority and other factors associated with academics' OA practice

This paper presents the findings from a survey study of UK academics and their publishing behaviour. The aim of this study is to investigate academics' attitudes towards and practice of open access (OA) publishing. The results are based on a survey ...
Publishing in English or another language: An inclusive study of scholar's language publication preferences in the natural, social and interdisciplinary sciences

To what extent has English become the dominant language of scientific communication? What factors push non-native speakers to publish their research in English rather than their native language? Using a survey of more than 800 authors of scientific ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

October 2023

1825 pages

ISBN:9798400701320

DOI:10.1145/3586183

Editors:
Sean Follmer
Stanford University, USA
,
Jeff Han,
Jürgen Steimle
Saarland University, Germany
,
Nathalie Henry Riche
Microsoft Research, USA

Copyright © 2023 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2023

Check for updates

Badges

Honorable Mention

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

UIST '23

Sponsor:

UIST '23: The 36th Annual ACM Symposium on User Interface Software and Technology

October 29 - November 1, 2023

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Upcoming Conference

UIST '25

Sponsor:
sigchi
sigchi

The 38th Annual ACM Symposium on User Interface Software and Technology

September 28 - October 1, 2025

Busan , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
1,532
Total Downloads

Downloads (Last 12 months)1,203
Downloads (Last 6 weeks)146

Reflects downloads up to 13 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Suh SDang HYen RPollock JArawjo IKazi RSubramonyam HLi JSaquib NSatyanarayan A(2024)Dynamic Abstractions: Building the Next Generation of Cognitive Tools and InterfacesAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686706(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686706
Han CIsaacs K(2024)A Deixis-Centered Approach for Documenting Remote Synchronous Communication Around Data VisualizationsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345635131:1(930-940)Online publication date: 10-Sep-2024
https://dl.acm.org/doi/10.1109/TVCG.2024.3456351
Lee MChen B(2024)MaugVLink: Augmenting Mathematical Formulas with Visual Links2024 IEEE 17th Pacific Visualization Conference (PacificVis)10.1109/PacificVis60374.2024.00048(337-342)Online publication date: 23-Apr-2024
https://doi.org/10.1109/PacificVis60374.2024.00048

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents